Compact and Discriminative Descriptor Inference Using Multi-Cues

Publication Type:
Journal Article
IEEE Transactions on Image Processing, 2015, 24 (12), pp. 5114 - 5126
Issue Date:
Filename Description Size
c.pdfPublished Version2.13 MB
Adobe PDF
Full metadata record
© 1992-2012 IEEE. Feature descriptors around local interest points are widely used in human action recognition both for images and videos. However, each kind of descriptors describes the local characteristics around the reference point only from one cue. To enhance the descriptive and discriminative ability from multiple cues, this paper proposes a descriptor learning framework to optimize the descriptors at the source by learning a projection from multiple descriptors' spaces to a new Euclidean space. In this space, multiple cues and characteristics of different descriptors are fused and complemented for each other. In order to make the new descriptor more discriminative, we learn the multi-cue projection by the minimization of the ratio of within-class scatter to between-class scatter, and therefore, the discriminative ability of the projected descriptor is enhanced. In the experiment, we evaluate our framework on the tasks of action recognition from still images and videos. Experimental results on two benchmark image and two benchmark video data sets demonstrate the effectiveness and better performance of our method.
Please use this identifier to cite or link to this item: