Adversarial Action Data Augmentation for Similar Gesture Action Recognition

Wu, D; Chen, J; Sharma, N; Pan, S; Long, G; Blumenstein, M

Adversarial Action Data Augmentation for Similar Gesture Action Recognition

Wu, D Chen, J Sharma, N

Pan, S

Long, G

Blumenstein, M

Permalink

Publisher:: IEEE
Publication Type:: Conference Proceeding
Citation:: 2019 International Joint Conference on Neural Networks (IJCNN), 2019, 2019-July
Issue Date:: 2019-07-19

Closed Access

	Filename	Description	Size
	IJCNN 2019 - Di - Paper.pdf	Published version	3.07 MB		View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Wu, D
dc.contributor.author	Chen, J
dc.contributor.author	Sharma, N https://orcid.org/0000-0003-0841-1245
dc.contributor.author	Pan, S https://orcid.org/0000-0003-0794-527X
dc.contributor.author	Long, G https://orcid.org/0000-0003-3740-9515
dc.contributor.author	Blumenstein, M https://orcid.org/0000-0002-9908-3744
dc.date	2019-07-14
dc.date.accessioned	2021-05-11T01:16:09Z
dc.date.available	2021-05-11T01:16:09Z
dc.date.issued	2019-07-19
dc.identifier.citation	2019 International Joint Conference on Neural Networks (IJCNN), 2019, 2019-July
dc.identifier.isbn	978-1-7281-1985-4
dc.identifier.issn	2161-4407
dc.identifier.uri	http://hdl.handle.net/10453/148829
dc.description.abstract	Human gestures are unique for recognizing and describing human actions, and video-based human action recognition techniques are effective solutions to varies real-world applications, such as surveillance, video indexing, and human-computer interaction. Most existing video human action recognition approaches either using handcraft features from the frames or deep learning models such as convolutional neural networks (CNN) and recurrent neural networks (RNN); however, they have mostly overlooked the similar gestures between different actions when processing the frames into the models. The classifiers suffer from similar features extracted from similar gestures, which are unable to classify the actions in the video streams. In this paper, we propose a novel framework with generative adversarial networks (GAN) to generate the data augmentation for similar gesture action recognition. The contribution of our work is tri-fold: 1) we proposed a novel action data augmentation framework (ADAF) to enlarge the differences between the actions with very similar gestures; 2) the framework can boost the classification performance either on similar gesture action pairs or the whole dataset; 3) experiments conducted on both KTH and UCF101 datasets show that our data augmentation framework boost the performance on both similar gestures actions as well as the whole dataset compared with baseline methods such as 2DCNN and 3DCNN.
dc.language	en
dc.publisher	IEEE
dc.relation.ispartof	2019 International Joint Conference on Neural Networks (IJCNN)
dc.relation.ispartof	International Joint Conference on Neural Networks
dc.relation.ispartofseries	IEEE International Joint Conference on Neural Networks (IJCNN)
dc.relation.isbasedon	10.1109/IJCNN.2019.8851993
dc.rights	info:eu-repo/semantics/closedAccess
dc.title	Adversarial Action Data Augmentation for Similar Gesture Action Recognition
dc.type	Conference Proceeding
utslib.citation.volume	2019-July
utslib.location.activity	Budapest, Hungary
utslib.for	0801 Artificial Intelligence and Image Processing
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Students
pubs.organisational-group	/University of Technology Sydney/Strength - AAII - Australian Artificial Intelligence Institute
pubs.organisational-group	/University of Technology Sydney/Strength - QSI - Centre for Quantum Software and Information
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Computer Science
utslib.copyright.status	closed_access	*
pubs.consider-herdc	false
dc.date.updated	2021-05-11T01:16:07Z
pubs.finish-date	2019-07-19
pubs.place-of-publication	Piscataway, USA
pubs.publication-status	Published
pubs.start-date	2019-07-14
pubs.volume	2019-July
dc.location	Piscataway, USA

Abstract:

Human gestures are unique for recognizing and describing human actions, and video-based human action recognition techniques are effective solutions to varies real-world applications, such as surveillance, video indexing, and human-computer interaction. Most existing video human action recognition approaches either using handcraft features from the frames or deep learning models such as convolutional neural networks (CNN) and recurrent neural networks (RNN); however, they have mostly overlooked the similar gestures between different actions when processing the frames into the models. The classifiers suffer from similar features extracted from similar gestures, which are unable to classify the actions in the video streams. In this paper, we propose a novel framework with generative adversarial networks (GAN) to generate the data augmentation for similar gesture action recognition. The contribution of our work is tri-fold: 1) we proposed a novel action data augmentation framework (ADAF) to enlarge the differences between the actions with very similar gestures; 2) the framework can boost the classification performance either on similar gesture action pairs or the whole dataset; 3) experiments conducted on both KTH and UCF101 datasets show that our data augmentation framework boost the performance on both similar gestures actions as well as the whole dataset compared with baseline methods such as 2DCNN and 3DCNN.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/148829