On the design of Bayesian principled algorithms for imbalanced classification Articles uri icon

publication date

  • June 2021

start page

  • 1

end page

  • 9

issue

  • 106969

volume

  • 221

International Standard Serial Number (ISSN)

  • 0950-7051

Electronic International Standard Serial Number (EISSN)

  • 1872-7409

abstract

  • A principled methodology for solving imbalanced binary classification problems has been recently introduced. It permits to obtain high performance designs avoiding the risks of degradation that other procedures suffer from. The corresponding paper Benítez-Buenache et al. (2019) shows evidence of these facts by applying direct versions, using just one of the possible rebalancing techniques and
    applying full rebalancing.
    In this contribution, we extend the above study for maximizing the performance of the resulting designs. To this end, we combine principled techniques in order to taking benefit from their different characteristics. The combination weights as well as the rebalance degree are selected by means of a simple (cross-validation) search. A number of experiments with different kinds of databases shows significant performance improvements. At the same time, the database characteristics that limit the performance improvements -such as small size and noisy samples- are detected.

subjects

  • Psychology

keywords

  • bregman divergences; combined techniques; imbalance; parameter selection; principled rebalance