Automatic image dataset construction with multiple textual metadata

Yao, Y; Zhang, J; Shen, F; Hua, X; Xu, J; Tang, Z

Automatic image dataset construction with multiple textual metadata

Yao, Y Zhang, J

Shen, F Hua, X Xu, J Tang, Z

Permalink

Publication Type:: Conference Proceeding
Citation:: Proceedings - IEEE International Conference on Multimedia and Expo, 2016, 2016-August
Issue Date:: 2016-08-25

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download Accepted Manuscript versionAdobe PDF (490.38 kB)

View on publisher's site

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Yao, Y	en_US
dc.contributor.author	Zhang, J https://orcid.org/0000-0002-7240-3541	en_US
dc.contributor.author	Shen, F	en_US
dc.contributor.author	Hua, X	en_US
dc.contributor.author	Xu, J	en_US
dc.contributor.author	Tang, Z	en_US
dc.date.issued	2016-08-25	en_US
dc.identifier.citation	Proceedings - IEEE International Conference on Multimedia and Expo, 2016, 2016-August	en_US
dc.identifier.isbn	9781467372589	en_US
dc.identifier.issn	1945-7871	en_US
dc.identifier.uri	http://hdl.handle.net/10453/54468
dc.description.abstract	© 2016 IEEE. The goal of this work is to automatically collect a large number of highly relevant images from the Internet for given queries. A novel image dataset construction framework is proposed by employing multiple textual metadata. In specific, the given queries are first expanded by searching in the Google Books Ngrams Corpora to obtain a richer semantic description, from which the visually non-salient and less relevant expansions are then filtered. After retrieving images from the Internet with filtered expansions, we further filter noisy images by clustering and progressively Convolutional Neural Networks (CNN). To verify the effectiveness of our proposed method, we construct a dataset with 10 categories, which is not only much larger than but also have comparable cross-dataset generalization ability with manually labeled dataset STL-10 and CIFAR-10.	en_US
dc.relation.ispartof	Proceedings - IEEE International Conference on Multimedia and Expo	en_US
dc.relation.isbasedon	10.1109/ICME.2016.7552988	en_US
dc.title	Automatic image dataset construction with multiple textual metadata	en_US
dc.type	Conference Proceeding
utslib.citation.volume	2016-August	en_US
utslib.for	080101 Adaptive Agents and Intelligent Robotics	en_US
utslib.for	080110 Simulation and Modelling	en_US
utslib.for	080109 Pattern Recognition and Data Mining	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Electrical and Data Engineering
pubs.organisational-group	/University of Technology Sydney/Strength - GBDTC - Global Big Data Technologies
pubs.organisational-group	/University of Technology Sydney/Students
utslib.copyright.status	open_access
pubs.publication-status	Published	en_US
pubs.volume	2016-August	en_US

Abstract:

© 2016 IEEE. The goal of this work is to automatically collect a large number of highly relevant images from the Internet for given queries. A novel image dataset construction framework is proposed by employing multiple textual metadata. In specific, the given queries are first expanded by searching in the Google Books Ngrams Corpora to obtain a richer semantic description, from which the visually non-salient and less relevant expansions are then filtered. After retrieving images from the Internet with filtered expansions, we further filter noisy images by clustering and progressively Convolutional Neural Networks (CNN). To verify the effectiveness of our proposed method, we construct a dataset with 10 categories, which is not only much larger than but also have comparable cross-dataset generalization ability with manually labeled dataset STL-10 and CIFAR-10.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/54468