Electronic International Standard Serial Number (EISSN)
1872-7409
abstract
A principled methodology for solving imbalanced binary classification problems has been recently introduced. It permits to obtain high performance designs avoiding the risks of degradation that other procedures suffer from. The corresponding paper BenÃtez-Buenache et al. (2019) shows evidence of these facts by applying direct versions, using just one of the possible rebalancing techniques and applying full rebalancing. In this contribution, we extend the above study for maximizing the performance of the resulting designs. To this end, we combine principled techniques in order to taking benefit from their different characteristics. The combination weights as well as the rebalance degree are selected by means of a simple (cross-validation) search. A number of experiments with different kinds of databases shows significant performance improvements. At the same time, the database characteristics that limit the performance improvements -such as small size and noisy samples- are detected.