Combining Entropy Measures for Anomaly Detection Articles uri icon

publication date

  • September 2018

issue

  • 9

volume

  • 20

international standard serial number (ISSN)

  • 1099-4300

abstract

  • The combination of different sources of information is a problem that arises in several situations, for instance, when data are analysed using different similarity measures. Often, each source of information is given as a similarity, distance, or a kernel matrix. In this paper, we propose a new class of methods which consists of producing, for anomaly detection purposes, a single Mercer kernel (that acts as a similarity measure) from a set of local entropy kernels and, at the same time, avoids the task of model selection. This kernel is used to build an embedding of data in a variety that will allow the use of a (modified) one-class Support Vector Machine to detect outliers. We study several information combination schemes and their limiting behaviour when the data sample size increases within an Information Geometry context. In particular, we study the variety of the given positive definite kernel matrices to obtain the desired kernel combination as belonging to that variety. The proposed methodology has been evaluated on several real and artificial problems.

keywords

  • entropy kernel; kernel combination; Karcher mean; anomaly detection; functional data; support vector machines; functional data; depth; distributions; matrices