Region-based Mixture Models for human action recognition in low-resolution videos

Publication Type:
Journal Article
Citation:
Neurocomputing, 2017, 247 pp. 1 - 15
Issue Date:
2017-07-19
Filename Description Size
1-s2.0-S0925231217305416-main.pdfPublished Version3.99 MB
Adobe PDF
Full metadata record
© 2017 State-of-the-art performance in human action recognition is achieved by the use of dense trajectories which are extracted by optical flow algorithms. However, optical flow algorithms are far from perfect in low-resolution (LR) videos. In addition, the spatial and temporal layout of features is a powerful cue for action discrimination. While, most existing methods encode the layout by previously segmenting body parts which is not feasible in LR videos. Addressing the problems, we adopt the Layered Elastic Motion Tracking (LEMT) method to extract a set of long-term motion trajectories and a long-term common shape from each video sequence, where the extracted trajectories are much denser than those of sparse interest points (SIPs); then we present a hybrid feature representation to integrate both of the shape and motion features; and finally we propose a Region-based Mixture Model (RMM) to be utilized for action classification. The RMM encodes the spatial layout of features without any needs of body parts segmentation. Experimental results show that the approach is effective and, more importantly, the approach is more general for LR recognition tasks.
Please use this identifier to cite or link to this item: