Multi-Modal Curriculum Learning for Semi-Supervised Image Classification

Gong, C; Tao, D; Maybank, SJ; Liu, W; Kang, G; Yang, J

Multi-Modal Curriculum Learning for Semi-Supervised Image Classification

Gong, C Tao, D

Maybank, SJ Liu, W Kang, G Yang, J

Permalink

Publication Type:: Journal Article
Citation:: IEEE Transactions on Image Processing, 2016, 25 (7), pp. 3249 - 3260
Issue Date:: 2016-07-01

Closed Access

	Filename	Description	Size
	07465792.pdf	Published Version	3 MB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Gong, C	en_US
dc.contributor.author	Tao, D https://orcid.org/0000-0001-7225-5449	en_US
dc.contributor.author	Maybank, SJ	en_US
dc.contributor.author	Liu, W	en_US
dc.contributor.author	Kang, G	en_US
dc.contributor.author	Yang, J	en_US
dc.date.issued	2016-07-01	en_US
dc.identifier.citation	IEEE Transactions on Image Processing, 2016, 25 (7), pp. 3249 - 3260	en_US
dc.identifier.issn	1057-7149	en_US
dc.identifier.uri	http://hdl.handle.net/10453/122761
dc.description.abstract	© 1992-2012 IEEE. Semi-supervised image classification aims to classify a large quantity of unlabeled images by typically harnessing scarce labeled images. Existing semi-supervised methods often suffer from inadequate classification accuracy when encountering difficult yet critical images, such as outliers, because they treat all unlabeled images equally and conduct classifications in an imperfectly ordered sequence. In this paper, we employ the curriculum learning methodology by investigating the difficulty of classifying every unlabeled image. The reliability and the discriminability of these unlabeled images are particularly investigated for evaluating their difficulty. As a result, an optimized image sequence is generated during the iterative propagations, and the unlabeled images are logically classified from simple to difficult. Furthermore, since images are usually characterized by multiple visual feature descriptors, we associate each kind of features with a teacher, and design a multi-modal curriculum learning (MMCL) strategy to integrate the information from different feature modalities. In each propagation, each teacher analyzes the difficulties of the currently unlabeled images from its own modality viewpoint. A consensus is subsequently reached among all the teachers, determining the currently simplest images (i.e., a curriculum), which are to be reliably classified by the multi-modal learner. This well-organized propagation process leveraging multiple teachers and one learner enables our MMCL to outperform five state-of-the-art methods on eight popular image data sets.	en_US
dc.relation	http://purl.org/au-research/grants/arc/DP140102164
dc.relation	http://purl.org/au-research/grants/arc/FT130101457
dc.relation	http://purl.org/au-research/grants/arc/LP140100569
dc.relation	http://purl.org/au-research/grants/arc/LE140100061
dc.relation.ispartof	IEEE Transactions on Image Processing	en_US
dc.relation.isbasedon	10.1109/TIP.2016.2563981	en_US
dc.rights	info:eu-repo/semantics/closedAccess
dc.subject.classification	Artificial Intelligence & Image Processing	en_US
dc.title	Multi-Modal Curriculum Learning for Semi-Supervised Image Classification	en_US
dc.type	Journal Article
utslib.citation.volume	7	en_US
utslib.citation.volume	25	en_US
utslib.for	0801 Artificial Intelligence and Image Processing	en_US
utslib.for	0906 Electrical and Electronic Engineering	en_US
utslib.for	1702 Cognitive Sciences	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Electrical and Data Engineering
pubs.organisational-group	/University of Technology Sydney/Students
utslib.copyright.status	closed_access	*
pubs.issue	7	en_US
pubs.publication-status	Published	en_US
pubs.volume	25	en_US

Abstract:

© 1992-2012 IEEE. Semi-supervised image classification aims to classify a large quantity of unlabeled images by typically harnessing scarce labeled images. Existing semi-supervised methods often suffer from inadequate classification accuracy when encountering difficult yet critical images, such as outliers, because they treat all unlabeled images equally and conduct classifications in an imperfectly ordered sequence. In this paper, we employ the curriculum learning methodology by investigating the difficulty of classifying every unlabeled image. The reliability and the discriminability of these unlabeled images are particularly investigated for evaluating their difficulty. As a result, an optimized image sequence is generated during the iterative propagations, and the unlabeled images are logically classified from simple to difficult. Furthermore, since images are usually characterized by multiple visual feature descriptors, we associate each kind of features with a teacher, and design a multi-modal curriculum learning (MMCL) strategy to integrate the information from different feature modalities. In each propagation, each teacher analyzes the difficulties of the currently unlabeled images from its own modality viewpoint. A consensus is subsequently reached among all the teachers, determining the currently simplest images (i.e., a curriculum), which are to be reliably classified by the multi-modal learner. This well-organized propagation process leveraging multiple teachers and one learner enables our MMCL to outperform five state-of-the-art methods on eight popular image data sets.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/122761