LDA-Based clustering algorithm and its application to an unsupervised feature extraction

Publication Type:
Journal Article
Citation:
IEEE Transactions on Fuzzy Systems, 2011, 19 (1), pp. 152 - 163
Issue Date:
2011-02-01
Full metadata record
Files in This Item:
Filename Description Size
05609198.pdfPublished Version1.42 MB
Adobe PDF
Research has shown fuzzy c-means (FCM) clustering to be a powerful tool to partition samples into different categories. However, the objective function of FCM is based only on the sum of distances of samples to their cluster centers, which is equal to the trace of the within-cluster scatter matrix. In this study, we propose a clustering algorithm based on both within- and between-cluster scatter matrices, extended from linear discriminant analysis (LDA), and its application to an unsupervised feature extraction (FE). Our proposed methods comprise between- and within-cluster scatter matrices modified from the between- and within-class scatter matrices of LDA. The scatter matrices of LDA are special cases of our proposed unsupervised scatter matrices. The results of experiments on both synthetic and real data show that the proposed clustering algorithm can generate similar or better clustering results than 11 popular clustering algorithms: K-means, K-medoid, FCM, the GustafsonKessel, GathGeva, possibilistic c-means (PCM), fuzzy PCM, possibilistic FCM, fuzzy compactness and separation, a fuzzy clustering algorithm based on a fuzzy treatment of finite mixtures of multivariate Students t distributions algorithms, and a fuzzy mixture of the Students t factor analyzers model. The results also show that the proposed FE outperforms principal component analysis and independent component analysis. © 2006 IEEE.
Please use this identifier to cite or link to this item: