Classification of longitudinal data through a semiparametric mixed‐effects model based on lasso‐type estimators Articles uri icon

publication date

  • June 2015

start page

  • 333

end page

  • 343

issue

  • 2

volume

  • 71

international standard serial number (ISSN)

  • 0006-341X

electronic international standard serial number (EISSN)

  • 1541-0420

abstract

  • We propose a classification method for longitudinal data. The Bayes classifier is classically used to determine a classification rule where the underlying density in each class needs to be well modeled and estimated. This work is motivated by a real dataset of hormone levels measured at the early stages of pregnancy that can be used to predict normal versus abnormal pregnancy outcomes. The proposed model, which is a semiparametric linear mixed‐effects model (SLMM), is a particular case of the semiparametric nonlinear mixed‐effects class of models (SNMM) in which finite dimensional (fixed effects and variance components) and infinite dimensional (an unknown function) parameters have to be estimated. In SNMM's maximum likelihood estimation is performed iteratively alternating parametric and nonparametric procedures. However, if one can make the assumption that the random effects and the unknown function interact in a linear way, more efficient estimation methods can be used. Our contribution is the proposal of a unified estimation procedure based on a penalized EM‐type algorithm. The Expectation and Maximization steps are explicit. In this latter step, the unknown function is estimated in a nonparametric fashion using a lasso‐type procedure. A simulation study and an application on real data are performed.

keywords

  • classification; EM algorithm; lasso; longitudinal data; mixed-effects models; semiparametric estimation