Incorporating the loss function into discriminative clustering of structured outputs

Zhong, W; Pan, W; Kwok, JT; Tsang, IW

Incorporating the loss function into discriminative clustering of structured outputs

Zhong, W Pan, W Kwok, JT Tsang, IW

Permalink

Publication Type:: Journal Article
Citation:: IEEE Transactions on Neural Networks, 2010, 21 (10), pp. 1564 - 1575
Issue Date:: 2010-10-01

Closed Access

	Filename	Description	Size
	2013004109OK.pdf		597.1 kB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Zhong, W	en_US
dc.contributor.author	Pan, W	en_US
dc.contributor.author	Kwok, JT	en_US
dc.contributor.author	Tsang, IW https://orcid.org/0000-0001-8095-4637	en_US
dc.date.issued	2010-10-01	en_US
dc.identifier.citation	IEEE Transactions on Neural Networks, 2010, 21 (10), pp. 1564 - 1575	en_US
dc.identifier.issn	1045-9227	en_US
dc.identifier.uri	http://hdl.handle.net/10453/28133
dc.description.abstract	Clustering using the Hilbert Schmidt independence criterion (CLUHSIC) is a recent clustering algorithm that maximizes the dependence between cluster labels and data observations according to the Hilbert Schmidt independence criterion (HSIC). It is unique in that structure information on the cluster outputs can be easily utilized in the clustering process. However, while the choice of the loss function is known to be very important in supervised learning with structured outputs, we will show in this paper that CLUHSIC is implicitly using the often inappropriate zero-one loss. We propose an extension called CLUHSICAL (which stands for Clustering using HSIC and loss) which explicitly considers both the output dependency and loss function. Its optimization problem has the same form as CLUHSIC, except that its partition matrix is constructed in a different manner. Experimental results on a number of datasets with structured outputs show that CLUHSICAL often outperforms CLUHSIC in terms of both structured loss and clustering accuracy. © 2010 IEEE.	en_US
dc.relation.ispartof	IEEE Transactions on Neural Networks	en_US
dc.relation.isbasedon	10.1109/TNN.2010.2064177	en_US
dc.subject.classification	Artificial Intelligence & Image Processing	en_US
dc.subject.mesh	Proteins	en_US
dc.subject.mesh	Facial Expression	en_US
dc.subject.mesh	Cluster Analysis	en_US
dc.subject.mesh	Algorithms	en_US
dc.subject.mesh	Neural Networks (Computer)	en_US
dc.subject.mesh	Databases, Factual	en_US
dc.subject.mesh	Neural Networks, Computer	en_US
dc.title	Incorporating the loss function into discriminative clustering of structured outputs	en_US
dc.type	Journal Article
utslib.citation.volume	10	en_US
utslib.citation.volume	21	en_US
utslib.for	020603 Quantum Information, Computation and Communication	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - CAI - Centre for Artificial Intelligence
utslib.copyright.status	closed_access
pubs.issue	10	en_US
pubs.publication-status	Published	en_US
pubs.volume	21	en_US

Abstract:

Clustering using the Hilbert Schmidt independence criterion (CLUHSIC) is a recent clustering algorithm that maximizes the dependence between cluster labels and data observations according to the Hilbert Schmidt independence criterion (HSIC). It is unique in that structure information on the cluster outputs can be easily utilized in the clustering process. However, while the choice of the loss function is known to be very important in supervised learning with structured outputs, we will show in this paper that CLUHSIC is implicitly using the often inappropriate zero-one loss. We propose an extension called CLUHSICAL (which stands for Clustering using HSIC and loss) which explicitly considers both the output dependency and loss function. Its optimization problem has the same form as CLUHSIC, except that its partition matrix is constructed in a different manner. Experimental results on a number of datasets with structured outputs show that CLUHSICAL often outperforms CLUHSIC in terms of both structured loss and clustering accuracy. © 2010 IEEE.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/28133