Sparse semi-supervised heterogeneous interbattery bayesian analysis Articles
Overview
published in
- PATTERN RECOGNITION Journal
publication date
- December 2021
start page
- 1
end page
- 13
issue
- 108141
volume
- 120
Digital Object Identifier (DOI)
full text
International Standard Serial Number (ISSN)
- 0031-3203
Electronic International Standard Serial Number (EISSN)
- 1873-5142
abstract
-
The Bayesian approach to feature extraction, known as factor analysis (FA), has been widely studied in machine learning to obtain a latent representation of the data. An adequate selection of the probabilities and priors of these bayesian models allows the model to better adapt to the data nature (i.e. heterogeneity, sparsity), obtaining a more representative latent space.
The objective of this article is to propose a general FA framework capable of modelling any problem. To do so, we start from the Bayesian Inter-Battery Factor Analysis (BIBFA) model, enhancing it with new functionalities to be able to work with heterogeneous data, to include feature selection, and to handle missing values as well as semi-supervised problems.
The performance of the proposed model, Sparse Semi-supervised Heterogeneous Interbattery Bayesian Analysis (SSHIBA), has been tested on different scenarios to evaluate each one of its novelties, showing not only a great versatility and an interpretability gain, but also outperforming most of the state-of-the-art algorithms.
Classification
subjects
- Telecommunications
keywords
- bayesian model; canonical correlation analysis; principal component analysis; factor analysis; feature selection; semi-supervised; multi-task