Electronic International Standard Serial Number (EISSN)
1433-755X
abstract
This paper presents a fast training strategy for the Viola&-Jones (VJ) type object-detection systems. The VJ object- detection system, popular for its high accuracy at real-time testing speeds, has a drawback that it is slow to train. A face detector, for example, can take days to train. In content-based image retrieval (CBIR), where search needs to be performed instantaneously, VJ's long training time is not affordable. Therefore, VJ's method is hardly used for such applications. This paper proposes two modifications to the training algorithm of VJ-type object detection systems which reduces the training time to the order of seconds. Firstly, Laplacian clutter (non-object) models are used to train the weak classifier, thus eliminating the need to read and evaluate thousands of clutter images. Secondly, the training procedure is simplified by removing the time-consuming AdaBoost-based feature selection procedure. An object detector, trained with 500 images, approximately takes 2 s for training in a conventional 3 GHz machine. Our results show that the accuracy of the detector, built with the proposed approach, is inferior to that of VJ for difficult object class such as frontal faces. However, for objects with lesser degree of intra-class variations such as hearts, state-of-the-art accuracy can be obtained. Importantly, for CBIR applications, the fast testing speed of the VJ type object detector is maintained.