Maximum margin clustering made practical

Zhang, K; Tsang, IW; Kwok, JT

Maximum margin clustering made practical

Zhang, K Tsang, IW

Kwok, JT

Permalink

Publication Type:: Journal Article
Citation:: IEEE Transactions on Neural Networks, 2009, 20 (4), pp. 583 - 596
Issue Date:: 2009-03-10

Closed Access

	Filename	Description	Size
	2013004110OK.pdf		1.37 MB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Zhang, K	en_US
dc.contributor.author	Tsang, IW https://orcid.org/0000-0001-8095-4637	en_US
dc.contributor.author	Kwok, JT	en_US
dc.date.issued	2009-03-10	en_US
dc.identifier.citation	IEEE Transactions on Neural Networks, 2009, 20 (4), pp. 583 - 596	en_US
dc.identifier.issn	1045-9227	en_US
dc.identifier.uri	http://hdl.handle.net/10453/29715
dc.description.abstract	Motivated by the success of large margin methods in supervised learning, maximum margin clustering (MMC) is a recent approach that aims at extending large margin methods to unsupervised learning. However, its optimization problem is nonconvex and existing MMC methods all rely on reformulating and relaxing the nonconvex optimization problem as semidefinite programs (SDP). Though SDP is convex and standard solvers are available, they are computationally very expensive and only small data sets can be handled. To make MMC more practical, we avoid SDP relaxations and propose in this paper an efficient approach that performs alternating optimization directly on the original nonconvex problem. A key step to avoid premature convergence in the resultant iterative procedure is to change the loss function from the hinge loss to the Laplacian/square loss so that overconfident predictions are penalized. Experiments on a number of synthetic and real-world data sets demonstrate that the proposed approach is more accurate, much faster (hundreds to tens of thousands of times faster), and can handle data sets that are hundreds of times larger than the largest data set reported in the MMC literature. © 2009 IEEE.	en_US
dc.relation.ispartof	IEEE Transactions on Neural Networks	en_US
dc.relation.isbasedon	10.1109/TNN.2008.2010620	en_US
dc.subject.classification	Artificial Intelligence & Image Processing	en_US
dc.title	Maximum margin clustering made practical	en_US
dc.type	Journal Article
utslib.citation.volume	4	en_US
utslib.citation.volume	20	en_US
utslib.for	0801 Artificial Intelligence and Image Processing	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - CAI - Centre for Artificial Intelligence
utslib.copyright.status	closed_access
pubs.issue	4	en_US
pubs.publication-status	Published	en_US
pubs.volume	20	en_US

Abstract:

Motivated by the success of large margin methods in supervised learning, maximum margin clustering (MMC) is a recent approach that aims at extending large margin methods to unsupervised learning. However, its optimization problem is nonconvex and existing MMC methods all rely on reformulating and relaxing the nonconvex optimization problem as semidefinite programs (SDP). Though SDP is convex and standard solvers are available, they are computationally very expensive and only small data sets can be handled. To make MMC more practical, we avoid SDP relaxations and propose in this paper an efficient approach that performs alternating optimization directly on the original nonconvex problem. A key step to avoid premature convergence in the resultant iterative procedure is to change the loss function from the hinge loss to the Laplacian/square loss so that overconfident predictions are penalized. Experiments on a number of synthetic and real-world data sets demonstrate that the proposed approach is more accurate, much faster (hundreds to tens of thousands of times faster), and can handle data sets that are hundreds of times larger than the largest data set reported in the MMC literature. © 2009 IEEE.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/29715