Balanced Random Hyperboxes for Class Imbalanced Problems

Khuat, TT; Le, MH

Balanced Random Hyperboxes for Class Imbalanced Problems

Khuat, TT Le, MH

Permalink

Publisher:: IAENG - International Association of Engineers
Publication Type:: Journal Article
Citation:: IAENG International Journal of Computer Science, 2021, 48, (2), pp. 406-412
Issue Date:: 2021-01-01

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download Published versionAdobe PDF (3.49 MB)

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Khuat, TT
dc.contributor.author	Le, MH
dc.date.accessioned	2022-03-19T23:10:34Z
dc.date.available	2022-03-19T23:10:34Z
dc.date.issued	2021-01-01
dc.identifier.citation	IAENG International Journal of Computer Science, 2021, 48, (2), pp. 406-412
dc.identifier.issn	1819-656X
dc.identifier.issn	1819-9224
dc.identifier.uri	http://hdl.handle.net/10453/155376
dc.description.abstract	A Random Hyperboxes (RH) classifier is a simple but powerful randomization-based ensemble model, including hyperbox-based classifiers used as base learners. Individual learners in this ensemble model are trained on random subspaces of both instance and feature spaces. This facet results in a flexible mechanism to form a high-performing classifier competitive with other ensemble models in the literature. Like other machine learning models, however, the RH classifier also faces inefficiency when dealing with class-imbalanced datasets. Meanwhile, data containing highly imbalanced class distributions are prevalent in practical applications. Hence, this paper proposes a new variant of the original RH model, namely Balance Random Hyperboxes (BRH), to bypass this drawback effectively. The proposed method uses an undersampling strategy to build individual learners instead of the random sampling method employed in the original RH model. The experiment conducted on software fault datasets, which show a highly class-imbalanced property, indicated the proposed method's efficiency compared to the original RH model and other ensemble models.
dc.language	en
dc.publisher	IAENG - International Association of Engineers
dc.relation.ispartof	IAENG International Journal of Computer Science
dc.rights	info:eu-repo/semantics/openAccess
dc.subject	08 Information and Computing Sciences
dc.title	Balanced Random Hyperboxes for Class Imbalanced Problems
dc.type	Journal Article
utslib.citation.volume	48
utslib.for	08 Information and Computing Sciences
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - AAI - Advanced Analytics Institute Research Centre
utslib.copyright.status	open_access	*
pubs.consider-herdc	false
dc.date.updated	2022-03-19T23:10:32Z
pubs.issue	2
pubs.publication-status	Published
pubs.volume	48
utslib.citation.issue	2

Abstract:

A Random Hyperboxes (RH) classifier is a simple but powerful randomization-based ensemble model, including hyperbox-based classifiers used as base learners. Individual learners in this ensemble model are trained on random subspaces of both instance and feature spaces. This facet results in a flexible mechanism to form a high-performing classifier competitive with other ensemble models in the literature. Like other machine learning models, however, the RH classifier also faces inefficiency when dealing with class-imbalanced datasets. Meanwhile, data containing highly imbalanced class distributions are prevalent in practical applications. Hence, this paper proposes a new variant of the original RH model, namely Balance Random Hyperboxes (BRH), to bypass this drawback effectively. The proposed method uses an undersampling strategy to build individual learners instead of the random sampling method employed in the original RH model. The experiment conducted on software fault datasets, which show a highly class-imbalanced property, indicated the proposed method's efficiency compared to the original RH model and other ensemble models.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/155376