A fast and effective multiple kernel clustering method on incomplete data

Xiang, L; Zhao, G; Li, Q; Kim, GJ; Alfarraj, O; Tolba, A

A fast and effective multiple kernel clustering method on incomplete data

Xiang, L Zhao, G Li, Q

Kim, GJ Alfarraj, O Tolba, A

Permalink

Publisher:: Computers, Materials and Continua (Tech Science Press)
Publication Type:: Journal Article
Citation:: Computers, Materials and Continua, 2021, 67, (1), pp. 267-284
Issue Date:: 2021-01-01

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download Published versionAdobe PDF (4.41 MB)

View on publisher's site

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Xiang, L
dc.contributor.author	Zhao, G
dc.contributor.author	Li, Q https://orcid.org/0000-0002-8308-9551
dc.contributor.author	Kim, GJ
dc.contributor.author	Alfarraj, O
dc.contributor.author	Tolba, A
dc.date.accessioned	2022-07-02T21:28:06Z
dc.date.available	2022-07-02T21:28:06Z
dc.date.issued	2021-01-01
dc.identifier.citation	Computers, Materials and Continua, 2021, 67, (1), pp. 267-284
dc.identifier.issn	1546-2218
dc.identifier.issn	1546-2226
dc.identifier.uri	http://hdl.handle.net/10453/158533
dc.description.abstract	Multiple kernel clustering is an unsupervised data analysis method that has been used in various scenarios where data is easy to be collected but hard to be labeled. However, multiple kernel clustering for incomplete data is a critical yet challenging task. Although the existing absent multiple kernel clustering methods have achieved remarkable performance on this task, they may fail when data has a high value-missing rate, and they may easily fall into a local optimum. To address these problems, in this paper, we propose an absent multiple kernel clustering (AMKC) method on incomplete data. The AMKC method first clusters the initialized incomplete data. Then, it constructs a new multiple-kernel-based data space, referred to as K-space, from multiple sources to learn kernel combination coefficients. Finally, it seamlessly integrates an incomplete-kernel-imputation objective, a multiple-kernel-learning objective, and a kernel-clustering objective in order to achieve absent multiple kernel clustering. The three stages in this process are carried out simultaneously until the convergence condition is met. Experiments on six datasets with various characteristics demonstrate that the kernel imputation and clustering performance of the proposed method is significantly better than state-of-the-art competitors. Meanwhile, the proposed method gains fast convergence speed.
dc.language	en
dc.publisher	Computers, Materials and Continua (Tech Science Press)
dc.relation.ispartof	Computers, Materials and Continua
dc.relation.isbasedon	10.32604/cmc.2021.013488
dc.rights	info:eu-repo/semantics/openAccess
dc.subject	0103 Numerical and Computational Mathematics, 0912 Materials Engineering, 0915 Interdisciplinary Engineering
dc.subject.classification	Applied Mathematics
dc.title	A fast and effective multiple kernel clustering method on incomplete data
dc.type	Journal Article
utslib.citation.volume	67
utslib.for	0103 Numerical and Computational Mathematics
utslib.for	0912 Materials Engineering
utslib.for	0915 Interdisciplinary Engineering
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - AAI - Advanced Analytics Institute Research Centre
utslib.copyright.status	open_access	*
dc.date.updated	2022-07-02T21:28:03Z
pubs.issue	1
pubs.publication-status	Published
pubs.volume	67
utslib.citation.issue	1

Abstract:

Multiple kernel clustering is an unsupervised data analysis method that has been used in various scenarios where data is easy to be collected but hard to be labeled. However, multiple kernel clustering for incomplete data is a critical yet challenging task. Although the existing absent multiple kernel clustering methods have achieved remarkable performance on this task, they may fail when data has a high value-missing rate, and they may easily fall into a local optimum. To address these problems, in this paper, we propose an absent multiple kernel clustering (AMKC) method on incomplete data. The AMKC method first clusters the initialized incomplete data. Then, it constructs a new multiple-kernel-based data space, referred to as K-space, from multiple sources to learn kernel combination coefficients. Finally, it seamlessly integrates an incomplete-kernel-imputation objective, a multiple-kernel-learning objective, and a kernel-clustering objective in order to achieve absent multiple kernel clustering. The three stages in this process are carried out simultaneously until the convergence condition is met. Experiments on six datasets with various characteristics demonstrate that the kernel imputation and clustering performance of the proposed method is significantly better than state-of-the-art competitors. Meanwhile, the proposed method gains fast convergence speed.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/158533