Convex and scalable weakly labeled SVMs

Li, YF; Tsang, IW; Kwok, JT; Zhou, ZH

Convex and scalable weakly labeled SVMs

Li, YF Tsang, IW

Kwok, JT Zhou, ZH

Permalink

Publication Type:: Journal Article
Citation:: Journal of Machine Learning Research, 2013, 14 pp. 2151 - 2188
Issue Date:: 2013-06-01

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download full textAdobe PDF (1 MB)

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Li, YF	en_US
dc.contributor.author	Tsang, IW https://orcid.org/0000-0001-8095-4637	en_US
dc.contributor.author	Kwok, JT	en_US
dc.contributor.author	Zhou, ZH	en_US
dc.date.issued	2013-06-01	en_US
dc.identifier.citation	Journal of Machine Learning Research, 2013, 14 pp. 2151 - 2188	en_US
dc.identifier.issn	1532-4435	en_US
dc.identifier.uri	http://hdl.handle.net/10453/30643
dc.identifier.uri	http://hdl.handle.net/10453/29385
dc.description.abstract	In this paper, we study the problem of learning from weakly labeled data, where labels of the training examples are incomplete. This includes, for example, (i) semi-supervised learning where labels are partially known; (ii) multi-instance learning where labels are implicitly known; and (iii) clustering where labels are completely unknown. Unlike supervised learning, learning with weak labels involves a difficult Mixed-Integer Programming (MIP) problem. Therefore, it can suffer from poor scalability and may also get stuck in local minimum. In this paper, we focus on SVMs and propose the WELLSVM via a novel label generation strategy. This leads to a convex relaxation of the original MIP, which is at least as tight as existing convex Semi-Definite Programming (SDP) relaxations. Moreover, the WELLSVM can be solved via a sequence of SVM subproblems that are much more scalable than previous convex SDP relaxations. Experiments on three weakly labeled learning tasks, namely, (i) semi-supervised learning; (ii) multi-instance learning for locating regions of interest in content-based information retrieval; and (iii) clustering, clearly demonstrate improved performance, and WELLSVM is also readily applicable on large data sets. © 2013 Yu-Feng Li, Ivor Tsang, James Kwok and Zhi-Hua Zhou.	en_US
dc.relation.ispartof	Journal of Machine Learning Research	en_US
dc.subject.classification	Artificial Intelligence & Image Processing	en_US
dc.title	Convex and scalable weakly labeled SVMs	en_US
dc.type	Journal Article
utslib.citation.volume	14	en_US
utslib.for	080109 Pattern Recognition and Data Mining	en_US
utslib.for	08 Information and Computing Sciences	en_US
utslib.for	17 Psychology and Cognitive Sciences	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - CAI - Centre for Artificial Intelligence
utslib.copyright.status	open_access
pubs.publication-status	Published	en_US
pubs.volume	14	en_US

Abstract:

In this paper, we study the problem of learning from weakly labeled data, where labels of the training examples are incomplete. This includes, for example, (i) semi-supervised learning where labels are partially known; (ii) multi-instance learning where labels are implicitly known; and (iii) clustering where labels are completely unknown. Unlike supervised learning, learning with weak labels involves a difficult Mixed-Integer Programming (MIP) problem. Therefore, it can suffer from poor scalability and may also get stuck in local minimum. In this paper, we focus on SVMs and propose the WELLSVM via a novel label generation strategy. This leads to a convex relaxation of the original MIP, which is at least as tight as existing convex Semi-Definite Programming (SDP) relaxations. Moreover, the WELLSVM can be solved via a sequence of SVM subproblems that are much more scalable than previous convex SDP relaxations. Experiments on three weakly labeled learning tasks, namely, (i) semi-supervised learning; (ii) multi-instance learning for locating regions of interest in content-based information retrieval; and (iii) clustering, clearly demonstrate improved performance, and WELLSVM is also readily applicable on large data sets. © 2013 Yu-Feng Li, Ivor Tsang, James Kwok and Zhi-Hua Zhou.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/29385