Deep semisupervised zero-shot learning with maximum mean discrepancy

Zhang, L; Liu, J; Luo, M; Chang, X; Zheng, Q

Deep semisupervised zero-shot learning with maximum mean discrepancy

Zhang, L Liu, J Luo, M Chang, X

Zheng, Q

Permalink

Publication Type:: Journal Article
Citation:: Neural Computation, 2018, 30 (5), pp. 1426 - 1447
Issue Date:: 2018-05-01

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download Published VersionAdobe PDF (1.47 MB)

View on publisher's site

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Zhang, L	en_US
dc.contributor.author	Liu, J	en_US
dc.contributor.author	Luo, M	en_US
dc.contributor.author	Chang, X https://orcid.org/0000-0002-7778-8807	en_US
dc.contributor.author	Zheng, Q	en_US
dc.date.issued	2018-05-01	en_US
dc.identifier.citation	Neural Computation, 2018, 30 (5), pp. 1426 - 1447	en_US
dc.identifier.issn	0899-7667	en_US
dc.identifier.uri	http://hdl.handle.net/10453/134324
dc.description.abstract	© 2018 Massachusetts Institute of Technology. Due to the difficulty of collecting labeled images for hundreds of thousands of visual categories, zero-shot learning,where unseen categories do not have any labeled images in training stage, has attracted more attention. In the past, many studies focused on transferring knowledge from seen to unseen categories by projecting all category labels into a semantic space. However, the label embeddings could not adequately express the semantics of categories. Furthermore, the common semantics of seen and unseen instances cannot be captured accurately because the distribution of these instances may be quite different. For these issues, we propose a novel deep semisupervised method by jointly considering the heterogeneity gap between different modalities and the correlation among unimodal instances. This method replaces the original labels with the corresponding textual descriptions to better capture the category semantics. This method also overcomes the problem of distribution difference by minimizing the maximum mean discrepancy between seen and unseen instance distributions. Extensive experimental results on two benchmark data sets, CU200-Birds and Oxford Flowers-102, indicate that our method achieves significant improvements over previous methods.	en_US
dc.relation.ispartof	Neural Computation	en_US
dc.relation.isbasedon	10.1162/neco_a_01071	en_US
dc.subject.classification	Artificial Intelligence & Image Processing	en_US
dc.title	Deep semisupervised zero-shot learning with maximum mean discrepancy	en_US
dc.type	Journal Article
utslib.citation.volume	5	en_US
utslib.citation.volume	30	en_US
utslib.for	0801 Artificial Intelligence and Image Processing	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Students
utslib.copyright.status	open_access
pubs.issue	5	en_US
pubs.publication-status	Published	en_US
pubs.volume	30	en_US

Abstract:

© 2018 Massachusetts Institute of Technology. Due to the difficulty of collecting labeled images for hundreds of thousands of visual categories, zero-shot learning,where unseen categories do not have any labeled images in training stage, has attracted more attention. In the past, many studies focused on transferring knowledge from seen to unseen categories by projecting all category labels into a semantic space. However, the label embeddings could not adequately express the semantics of categories. Furthermore, the common semantics of seen and unseen instances cannot be captured accurately because the distribution of these instances may be quite different. For these issues, we propose a novel deep semisupervised method by jointly considering the heterogeneity gap between different modalities and the correlation among unimodal instances. This method replaces the original labels with the corresponding textual descriptions to better capture the category semantics. This method also overcomes the problem of distribution difference by minimizing the maximum mean discrepancy between seen and unseen instance distributions. Extensive experimental results on two benchmark data sets, CU200-Birds and Oxford Flowers-102, indicate that our method achieves significant improvements over previous methods.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/134324