Region-based Mixture Models for human action recognition in low-resolution videos

Zhao, Y; Di, H; Zhang, J; Lu, Y; Lv, F; Li, Y

Region-based Mixture Models for human action recognition in low-resolution videos

Zhao, Y Di, H Zhang, J

Lu, Y Lv, F Li, Y

Permalink

Publication Type:: Journal Article
Citation:: Neurocomputing, 2017, 247 pp. 1 - 15
Issue Date:: 2017-07-19

Closed Access

	Filename	Description	Size
	1-s2.0-S0925231217305416-main.pdf	Published Version	3.99 MB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Zhao, Y	en_US
dc.contributor.author	Di, H	en_US
dc.contributor.author	Zhang, J https://orcid.org/0000-0002-7240-3541	en_US
dc.contributor.author	Lu, Y	en_US
dc.contributor.author	Lv, F	en_US
dc.contributor.author	Li, Y	en_US
dc.date.issued	2017-07-19	en_US
dc.identifier.citation	Neurocomputing, 2017, 247 pp. 1 - 15	en_US
dc.identifier.issn	0925-2312	en_US
dc.identifier.uri	http://hdl.handle.net/10453/97666
dc.description.abstract	© 2017 State-of-the-art performance in human action recognition is achieved by the use of dense trajectories which are extracted by optical flow algorithms. However, optical flow algorithms are far from perfect in low-resolution (LR) videos. In addition, the spatial and temporal layout of features is a powerful cue for action discrimination. While, most existing methods encode the layout by previously segmenting body parts which is not feasible in LR videos. Addressing the problems, we adopt the Layered Elastic Motion Tracking (LEMT) method to extract a set of long-term motion trajectories and a long-term common shape from each video sequence, where the extracted trajectories are much denser than those of sparse interest points (SIPs); then we present a hybrid feature representation to integrate both of the shape and motion features; and finally we propose a Region-based Mixture Model (RMM) to be utilized for action classification. The RMM encodes the spatial layout of features without any needs of body parts segmentation. Experimental results show that the approach is effective and, more importantly, the approach is more general for LR recognition tasks.	en_US
dc.relation.ispartof	Neurocomputing	en_US
dc.relation.isbasedon	10.1016/j.neucom.2017.03.033	en_US
dc.subject.classification	Artificial Intelligence & Image Processing	en_US
dc.title	Region-based Mixture Models for human action recognition in low-resolution videos	en_US
dc.type	Journal Article
utslib.citation.volume	247	en_US
utslib.for	0801 Artificial Intelligence and Image Processing	en_US
utslib.for	08 Information and Computing Sciences	en_US
utslib.for	09 Engineering	en_US
utslib.for	17 Psychology and Cognitive Sciences	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Electrical and Data Engineering
pubs.organisational-group	/University of Technology Sydney/Strength - GBDTC - Global Big Data Technologies
utslib.copyright.status	closed_access
pubs.publication-status	Published	en_US
pubs.volume	247	en_US

Abstract:

© 2017 State-of-the-art performance in human action recognition is achieved by the use of dense trajectories which are extracted by optical flow algorithms. However, optical flow algorithms are far from perfect in low-resolution (LR) videos. In addition, the spatial and temporal layout of features is a powerful cue for action discrimination. While, most existing methods encode the layout by previously segmenting body parts which is not feasible in LR videos. Addressing the problems, we adopt the Layered Elastic Motion Tracking (LEMT) method to extract a set of long-term motion trajectories and a long-term common shape from each video sequence, where the extracted trajectories are much denser than those of sparse interest points (SIPs); then we present a hybrid feature representation to integrate both of the shape and motion features; and finally we propose a Region-based Mixture Model (RMM) to be utilized for action classification. The RMM encodes the spatial layout of features without any needs of body parts segmentation. Experimental results show that the approach is effective and, more importantly, the approach is more general for LR recognition tasks.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/97666