A Deep-Ensemble-Level-Based Interpretable Takagi-Sugeno-Kang Fuzzy Classifier for Imbalanced Data.

Wang, G; Zhou, T; Choi, K-S; Lu, J

A Deep-Ensemble-Level-Based Interpretable Takagi-Sugeno-Kang Fuzzy Classifier for Imbalanced Data.

Wang, G Zhou, T Choi, K-S Lu, J

Permalink

Publisher:: Institute of Electrical and Electronics Engineers (IEEE)
Publication Type:: Journal Article
Citation:: IEEE transactions on cybernetics, 2020, PP, pp. 1-14
Issue Date:: 2020-09-18

Closed Access

	Filename	Description	Size
	09199810.pdf	Published version	2.55 MB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Wang, G
dc.contributor.author	Zhou, T
dc.contributor.author	Choi, K-S
dc.contributor.author	Lu, J https://orcid.org/0000-0003-0690-4732
dc.date.accessioned	2020-10-15T10:34:31Z
dc.date.available	2020-10-15T10:34:31Z
dc.date.issued	2020-09-18
dc.identifier.citation	IEEE transactions on cybernetics, 2020, PP, pp. 1-14
dc.identifier.issn	2168-2267
dc.identifier.issn	2168-2275
dc.identifier.uri	http://hdl.handle.net/10453/143278
dc.description.abstract	Existing research reveals that the misclassification rate for imbalanced data depends heavily on the problematic areas due to the existence of small disjoints, class overlap, borderline, and rare data samples. In this study, by stacking zero-order Takagi-Sugeno-Kang (TSK) fuzzy subclassifiers on the minority class and its problematic areas in the deep ensemble, a novel deep-ensemble-level-based TSK fuzzy classifier (IDE-TSK-FC) for imbalanced data classification tasks is presented to achieve both promising classification performance and high interpretability of zero-order TSK fuzzy classifiers. Simultaneously, according to the stacked generalization principle, the proposed classifier lifts up oversampling from the data level to the deep ensemble level with a guarantee of enhanced generalization capability for class imbalance learning. In the structure of IDE-TSK-FC, the first interpretable zero-order TSK fuzzy subclassifier is built on the original training dataset. After that, several successive zero-order TSK fuzzy subclassifiers are stacked layer by layer on the newly identified problematic areas from the original training dataset plus the corresponding interpretable predictions obtained by the averaging strategy on all previous layers. IDE-TSK-FC simply takes the classical K-nearest neighboring algorithm at each layer to identify its problematic area that consists of the minority samples and its surrounding K majority neighbors. After randomly neglecting certain input features and randomly selecting the five Gaussian membership functions for all the chosen input features and the augmented feature in the premise of each fuzzy rule, each subclassifier can be quickly obtained by using the least learning machine to determine the consequent part of each fuzzy rule. The experimental results on both the public datasets and a real-world healthcare dataset demonstrate IDE-TSK-FC's superiority in class imbalanced learning.
dc.format	Print-Electronic
dc.language	eng
dc.publisher	Institute of Electrical and Electronics Engineers (IEEE)
dc.relation	http://purl.org/au-research/grants/arc/DP190101733
dc.relation.ispartof	IEEE transactions on cybernetics
dc.relation.isbasedon	10.1109/tcyb.2020.3016972
dc.rights	© 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.	en_US
dc.rights	info:eu-repo/semantics/restrictedAccess
dc.subject	0102 Applied Mathematics, 0801 Artificial Intelligence and Image Processing, 0906 Electrical and Electronic Engineering
dc.subject.classification	Artificial Intelligence & Image Processing
dc.title	A Deep-Ensemble-Level-Based Interpretable Takagi-Sugeno-Kang Fuzzy Classifier for Imbalanced Data.
dc.type	Journal Article
utslib.citation.volume	PP
utslib.location.activity	United States
utslib.for	0102 Applied Mathematics
utslib.for	0801 Artificial Intelligence and Image Processing
utslib.for	0906 Electrical and Electronic Engineering
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - AAII - Australian Artificial Intelligence Institute
pubs.organisational-group	/University of Technology Sydney
utslib.copyright.status	closed_access	*
dc.date.updated	2020-10-15T10:34:19Z
pubs.publication-status	Published
pubs.volume	PP

Abstract:

Existing research reveals that the misclassification rate for imbalanced data depends heavily on the problematic areas due to the existence of small disjoints, class overlap, borderline, and rare data samples. In this study, by stacking zero-order Takagi-Sugeno-Kang (TSK) fuzzy subclassifiers on the minority class and its problematic areas in the deep ensemble, a novel deep-ensemble-level-based TSK fuzzy classifier (IDE-TSK-FC) for imbalanced data classification tasks is presented to achieve both promising classification performance and high interpretability of zero-order TSK fuzzy classifiers. Simultaneously, according to the stacked generalization principle, the proposed classifier lifts up oversampling from the data level to the deep ensemble level with a guarantee of enhanced generalization capability for class imbalance learning. In the structure of IDE-TSK-FC, the first interpretable zero-order TSK fuzzy subclassifier is built on the original training dataset. After that, several successive zero-order TSK fuzzy subclassifiers are stacked layer by layer on the newly identified problematic areas from the original training dataset plus the corresponding interpretable predictions obtained by the averaging strategy on all previous layers. IDE-TSK-FC simply takes the classical K-nearest neighboring algorithm at each layer to identify its problematic area that consists of the minority samples and its surrounding K majority neighbors. After randomly neglecting certain input features and randomly selecting the five Gaussian membership functions for all the chosen input features and the augmented feature in the premise of each fuzzy rule, each subclassifier can be quickly obtained by using the least learning machine to determine the consequent part of each fuzzy rule. The experimental results on both the public datasets and a real-world healthcare dataset demonstrate IDE-TSK-FC's superiority in class imbalanced learning.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/143278