Combining Missing Data Imputation and Pattern Classification in a Multi-Layer Perceptron Articles uri icon

authors

  • SANCHO GOMEZ, JOSE LUIS
  • GARCÍA LAENCINA, PEDRO JOSÉ
  • FIGUEIRAS VIDAL, ANIBAL RAMON

publication date

  • November 2009

start page

  • 539

end page

  • 553

issue

  • 4

volume

  • 15

International Standard Serial Number (ISSN)

  • 1079-8587

Electronic International Standard Serial Number (EISSN)

  • 2326-005X

abstract

  • Multi-Layer Perceptions (MLPs) have been successfully applied in many pattern classification tasks. However, a drawback of these learning machines is that they cannot handle input vectors that present missing data on its features. A recommended way for dealing with missing values is imputation, i.e., to fill in missing data with plausible values. This paper presents a brief review of handling missing data, including the new Multi-Task Learning (MTL) systems. Moreover, an MLP approach for incomplete pattern classification based on MTL is proposed. This network learns in parallel the classification task (main task) and the different tasks associated to each incomplete feature (secondary tasks). During training, unknown values are imputed, being this missing data imputation process oriented by the learning of the classification task. Experimental results on five classification problems are given to show the effectiveness of the proposed approach