Minority Oversampling in Kernel Adaptive Subspaces for Class Imbalanced Datasets

Lin, CT; Hsieh, TY; Liu, YT; Lin, YY; Fang, CN; Wang, YK; Yen, G; Pal, NR; Chuang, CH

Minority Oversampling in Kernel Adaptive Subspaces for Class Imbalanced Datasets

Lin, CT

Hsieh, TY Liu, YT

Lin, YY Fang, CN

Wang, YK

Yen, G Pal, NR Chuang, CH

Permalink

Publication Type:: Journal Article
Citation:: IEEE Transactions on Knowledge and Data Engineering, 2018, 30 (5), pp. 950 - 962
Issue Date:: 2018-05-01

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Download Accepted Manuscript VersionAdobe PDF (1.83 MB)

View on publisher's site

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Lin, CT https://orcid.org/0000-0001-8371-8197	en_US
dc.contributor.author	Hsieh, TY	en_US
dc.contributor.author	Liu, YT https://orcid.org/0000-0001-6184-2373	en_US
dc.contributor.author	Lin, YY	en_US
dc.contributor.author	Fang, CN https://orcid.org/0000-0002-7889-4564	en_US
dc.contributor.author	Wang, YK https://orcid.org/0000-0001-8390-2664	en_US
dc.contributor.author	Yen, G	en_US
dc.contributor.author	Pal, NR	en_US
dc.contributor.author	Chuang, CH https://orcid.org/0000-0002-5043-8380	en_US
dc.date.issued	2018-05-01	en_US
dc.identifier.citation	IEEE Transactions on Knowledge and Data Engineering, 2018, 30 (5), pp. 950 - 962	en_US
dc.identifier.issn	1041-4347	en_US
dc.identifier.uri	http://hdl.handle.net/10453/124045
dc.description.abstract	© 1989-2012 IEEE. The class imbalance problem in machine learning occurs when certain classes are underrepresented relative to the others, leading to a learning bias toward the majority classes. To cope with the skewed class distribution, many learning methods featuring minority oversampling have been proposed, which are proved to be effective. To reduce information loss during feature space projection, this study proposes a novel oversampling algorithm, named minority oversampling in kernel adaptive subspaces (MOKAS), which exploits the invariant feature extraction capability of a kernel version of the adaptive subspace self-organizing maps. The synthetic instances are generated from well-trained subspaces and then their pre-images are reconstructed in the input space. Additionally, these instances characterize nonlinear structures present in the minority class data distribution and help the learning algorithms to counterbalance the skewed class distribution in a desirable manner. Experimental results on both real and synthetic data show that the proposed MOKAS is capable of modeling complex data distribution and outperforms a set of state-of-the-art oversampling algorithms.	en_US
dc.relation	http://purl.org/au-research/grants/arc/DP180100670
dc.relation	http://purl.org/au-research/grants/arc/DP180100656
dc.relation.ispartof	IEEE Transactions on Knowledge and Data Engineering	en_US
dc.relation.isbasedon	10.1109/TKDE.2017.2779849	en_US
dc.subject.classification	Information Systems	en_US
dc.title	Minority Oversampling in Kernel Adaptive Subspaces for Class Imbalanced Datasets	en_US
dc.type	Journal Article
utslib.citation.volume	5	en_US
utslib.citation.volume	30	en_US
utslib.for	0802 Computation Theory and Mathematics	en_US
utslib.for	1109 Neurosciences	en_US
utslib.for	08 Information and Computing Sciences	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Computer Science
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Software
pubs.organisational-group	/University of Technology Sydney/Strength - CAI - Centre for Artificial Intelligence
utslib.copyright.status	open_access
pubs.issue	5	en_US
pubs.publication-status	Published	en_US
pubs.volume	30	en_US

Abstract:

© 1989-2012 IEEE. The class imbalance problem in machine learning occurs when certain classes are underrepresented relative to the others, leading to a learning bias toward the majority classes. To cope with the skewed class distribution, many learning methods featuring minority oversampling have been proposed, which are proved to be effective. To reduce information loss during feature space projection, this study proposes a novel oversampling algorithm, named minority oversampling in kernel adaptive subspaces (MOKAS), which exploits the invariant feature extraction capability of a kernel version of the adaptive subspace self-organizing maps. The synthetic instances are generated from well-trained subspaces and then their pre-images are reconstructed in the input space. Additionally, these instances characterize nonlinear structures present in the minority class data distribution and help the learning algorithms to counterbalance the skewed class distribution in a desirable manner. Experimental results on both real and synthetic data show that the proposed MOKAS is capable of modeling complex data distribution and outperforms a set of state-of-the-art oversampling algorithms.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/124045