Joint structured sparsity regularized multiview dimension reduction for video-based facial expression recognition

Xie, L; Tao, D; Wei, H

Joint structured sparsity regularized multiview dimension reduction for video-based facial expression recognition

Xie, L Tao, D

Wei, H

Permalink

Publication Type:: Journal Article
Citation:: ACM Transactions on Intelligent Systems and Technology, 2016, 8 (2)
Issue Date:: 2016-10-01

Closed Access

	Filename	Description	Size
	a28-xie.pdf	Published Version	1.44 MB		View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Xie, L	en_US
dc.contributor.author	Tao, D https://orcid.org/0000-0001-7225-5449	en_US
dc.contributor.author	Wei, H	en_US
dc.date.issued	2016-10-01	en_US
dc.identifier.citation	ACM Transactions on Intelligent Systems and Technology, 2016, 8 (2)	en_US
dc.identifier.issn	2157-6904	en_US
dc.identifier.uri	http://hdl.handle.net/10453/122932
dc.description.abstract	© 2016 ACM. Video-based facial expression recognition (FER) has recently received increased attention as a result of its widespread application. Using only one type of feature to describe facial expression in video sequences is often inadequate, because the information available is very complex. With the emergence of different features to represent different properties of facial expressions in videos, an appropriate combination of these features becomes an important, yet challenging, problem. Considering that the dimensionality of these features is usually high, we thus introduce multiview dimension reduction (MVDR) into video-based FER. In MVDR, it is critical to explore the relationships between and within different feature views. To achieve this goal, we propose a novel framework of MVDR by enforcing joint structured sparsity at both inter- and intraview levels. In this way, correlations on and between the feature spaces of different views tend to be well-exploited. In addition, a transformation matrix is learned for each view to discover the patterns contained in the original features, so that the different views are comparable in finding a common representation. The model can be not only performed in an unsupervised manner, but also easily extended to a semisupervised setting by incorporating some domain knowledge. An alternating algorithm is developed for problem optimization, and each subproblem can be efficiently solved. Experiments on two challenging video-based FER datasets demonstrate the effectiveness of the proposed framework.	en_US
dc.relation	http://purl.org/au-research/grants/arc/DP140102164
dc.relation	http://purl.org/au-research/grants/arc/FT130101457
dc.relation	http://purl.org/au-research/grants/arc/LE140100061
dc.relation.ispartof	ACM Transactions on Intelligent Systems and Technology	en_US
dc.relation.isbasedon	10.1145/2956556	en_US
dc.title	Joint structured sparsity regularized multiview dimension reduction for video-based facial expression recognition	en_US
dc.type	Journal Article
utslib.citation.volume	2	en_US
utslib.citation.volume	8	en_US
utslib.for	0801 Artificial Intelligence and Image Processing	en_US
utslib.for	0806 Information Systems	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
utslib.copyright.status	closed_access
pubs.issue	2	en_US
pubs.publication-status	Published	en_US
pubs.volume	8	en_US

Abstract:

© 2016 ACM. Video-based facial expression recognition (FER) has recently received increased attention as a result of its widespread application. Using only one type of feature to describe facial expression in video sequences is often inadequate, because the information available is very complex. With the emergence of different features to represent different properties of facial expressions in videos, an appropriate combination of these features becomes an important, yet challenging, problem. Considering that the dimensionality of these features is usually high, we thus introduce multiview dimension reduction (MVDR) into video-based FER. In MVDR, it is critical to explore the relationships between and within different feature views. To achieve this goal, we propose a novel framework of MVDR by enforcing joint structured sparsity at both inter- and intraview levels. In this way, correlations on and between the feature spaces of different views tend to be well-exploited. In addition, a transformation matrix is learned for each view to discover the patterns contained in the original features, so that the different views are comparable in finding a common representation. The model can be not only performed in an unsupervised manner, but also easily extended to a semisupervised setting by incorporating some domain knowledge. An alternating algorithm is developed for problem optimization, and each subproblem can be efficiently solved. Experiments on two challenging video-based FER datasets demonstrate the effectiveness of the proposed framework.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/122932