Local depth patterns for fine-grained activity recognition in depth videos

Awwad, S; Piccardi, M

Local depth patterns for fine-grained activity recognition in depth videos

Awwad, S Piccardi, M

Permalink

Publication Type:: Conference Proceeding
Citation:: International Conference Image and Vision Computing New Zealand, 2016, 0
Issue Date:: 2016-07-02

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download Published versionAdobe PDF (237.23 kB)

Microsoft Word XML

Download Accepted Manuscript versionMicrosoft Word XML (17.03 kB)

Adobe PDF

Download Accepted Manuscript versionAdobe PDF (1.68 MB)

View on publisher's site

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Awwad, S	en_US
dc.contributor.author	Piccardi, M https://orcid.org/0000-0001-9250-6604	en_US
dc.date.issued	2016-07-02	en_US
dc.identifier.citation	International Conference Image and Vision Computing New Zealand, 2016, 0	en_US
dc.identifier.isbn	9781509027484	en_US
dc.identifier.issn	2151-2191	en_US
dc.identifier.uri	http://hdl.handle.net/10453/80091
dc.description.abstract	© 2016 IEEE. Fine-grained activities are human activities involving small objects and small movements. Automatic recognition of such activities can prove useful for many applications, including detailed diarization of meetings and training sessions, assistive human-computer interaction and robotics interfaces. Existing approaches to fine-grained activity recognition typically leverage the combined use of multiple sensors including cameras, RFID tags, gyroscopes and accelerometers borne by the monitored people and target objects. Although effective, the downside of these solutions is that they require minute instrumentation of the environment that is intrusive and hard to scale. To this end, this paper investigates fine-grained activity recognition in a kitchen setting by solely using a depth camera. The primary contribution of this work is an aggregated depth descriptor that effectively captures the shape of the objects and the actors. Experimental results over the challenging '50 Salads' dataset of kitchen activities show an accuracy comparable to that of a state-of-the-art approach based on multiple sensors, thereby validating a less intrusive and more practical way of monitoring fine-grained activities.	en_US
dc.relation.ispartof	International Conference Image and Vision Computing New Zealand	en_US
dc.relation.isbasedon	10.1109/IVCNZ.2016.7804453	en_US
dc.title	Local depth patterns for fine-grained activity recognition in depth videos	en_US
dc.type	Conference Proceeding
utslib.citation.volume	0	en_US
utslib.for	080104 Computer Vision	en_US
utslib.for	080109 Pattern Recognition and Data Mining	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Electrical and Data Engineering
pubs.organisational-group	/University of Technology Sydney/Strength - GBDTC - Global Big Data Technologies
utslib.copyright.status	open_access
pubs.publication-status	Published	en_US
pubs.volume	0	en_US

Abstract:

© 2016 IEEE. Fine-grained activities are human activities involving small objects and small movements. Automatic recognition of such activities can prove useful for many applications, including detailed diarization of meetings and training sessions, assistive human-computer interaction and robotics interfaces. Existing approaches to fine-grained activity recognition typically leverage the combined use of multiple sensors including cameras, RFID tags, gyroscopes and accelerometers borne by the monitored people and target objects. Although effective, the downside of these solutions is that they require minute instrumentation of the environment that is intrusive and hard to scale. To this end, this paper investigates fine-grained activity recognition in a kitchen setting by solely using a depth camera. The primary contribution of this work is an aggregated depth descriptor that effectively captures the shape of the objects and the actors. Experimental results over the challenging '50 Salads' dataset of kitchen activities show an accuracy comparable to that of a state-of-the-art approach based on multiple sensors, thereby validating a less intrusive and more practical way of monitoring fine-grained activities.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/80091