Nonsmooth Penalized Clustering via ℓ<inf>p</inf> Regularized Sparse Regression

Niu, L; Zhou, R; Tian, Y; Qi, Z; Zhang, P

Nonsmooth Penalized Clustering via ℓ<inf>p</inf> Regularized Sparse Regression

Niu, L Zhou, R Tian, Y Qi, Z Zhang, P

Permalink

Publication Type:: Journal Article
Citation:: IEEE Transactions on Cybernetics, 2017, 47 (6), pp. 1423 - 1433
Issue Date:: 2017-06-01

Closed Access

	Filename	Description	Size
	07460120.pdf	Published Version	1.92 MB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Niu, L	en_US
dc.contributor.author	Zhou, R	en_US
dc.contributor.author	Tian, Y	en_US
dc.contributor.author	Qi, Z	en_US
dc.contributor.author	Zhang, P https://orcid.org/0000-0001-7973-2746	en_US
dc.date.issued	2017-06-01	en_US
dc.identifier.citation	IEEE Transactions on Cybernetics, 2017, 47 (6), pp. 1423 - 1433	en_US
dc.identifier.issn	2168-2267	en_US
dc.identifier.uri	http://hdl.handle.net/10453/126985
dc.description.abstract	© 2016 IEEE. Clustering has been widely used in data analysis. A majority of existing clustering approaches assume that the number of clusters is given in advance. Recently, a novel clustering framework is proposed which can automatically learn the number of clusters from training data. Based on these works, we propose a nonsmooth penalized clustering model via ℓp (0 < p < 1) regularized sparse regression. In particular, this model is formulated as a nonsmooth nonconvex optimization, which is based on over-parameterization and utilizes an ℓp-norm-based regularization to control the tradeoff between the model fit and the number of clusters. We theoretically prove that the new model can guarantee the sparseness of cluster centers. To increase its practicality for practical use, we adhere to an easy-to-compute criterion and follow a strategy to narrow down the search interval of cross validation. To address the nonsmoothness and nonconvexness of the cost function, we propose a simple smoothing trust region algorithm and present its convergent and computational complexity analysis. Numerical studies on both simulated and practical data sets provide support to our theoretical results and demonstrate the advantages of our new method.	en_US
dc.relation.ispartof	IEEE Transactions on Cybernetics	en_US
dc.relation.isbasedon	10.1109/TCYB.2016.2546965	en_US
dc.subject.classification	Artificial Intelligence & Image Processing	en_US
dc.title	Nonsmooth Penalized Clustering via ℓ<inf>p</inf> Regularized Sparse Regression	en_US
dc.type	Journal Article
utslib.citation.volume	6	en_US
utslib.citation.volume	47	en_US
utslib.for	0801 Artificial Intelligence and Image Processing	en_US
utslib.for	0102 Applied Mathematics	en_US
utslib.for	0906 Electrical and Electronic Engineering	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
utslib.copyright.status	closed_access
pubs.issue	6	en_US
pubs.publication-status	Published	en_US
pubs.volume	47	en_US

Abstract:

© 2016 IEEE. Clustering has been widely used in data analysis. A majority of existing clustering approaches assume that the number of clusters is given in advance. Recently, a novel clustering framework is proposed which can automatically learn the number of clusters from training data. Based on these works, we propose a nonsmooth penalized clustering model via ℓp (0 < p < 1) regularized sparse regression. In particular, this model is formulated as a nonsmooth nonconvex optimization, which is based on over-parameterization and utilizes an ℓp-norm-based regularization to control the tradeoff between the model fit and the number of clusters. We theoretically prove that the new model can guarantee the sparseness of cluster centers. To increase its practicality for practical use, we adhere to an easy-to-compute criterion and follow a strategy to narrow down the search interval of cross validation. To address the nonsmoothness and nonconvexness of the cost function, we propose a simple smoothing trust region algorithm and present its convergent and computational complexity analysis. Numerical studies on both simulated and practical data sets provide support to our theoretical results and demonstrate the advantages of our new method.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/126985