Cost-sensitive feature selection by optimizing f-measures

Liu, M; Xu, C; Luo, Y; Wen, Y; Tao, D

Cost-sensitive feature selection by optimizing f-measures

Liu, M Xu, C

Luo, Y Wen, Y Tao, D

Permalink

Publication Type:: Journal Article
Citation:: IEEE Transactions on Image Processing, 2018, 27 (3), pp. 1323 - 1335
Issue Date:: 2018-03-01

Closed Access

	Filename	Description	Size
	08170306.pdf	Published Version	3.21 MB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Liu, M	en_US
dc.contributor.author	Xu, C https://orcid.org/0000-0002-4756-0609	en_US
dc.contributor.author	Luo, Y	en_US
dc.contributor.author	Wen, Y	en_US
dc.contributor.author	Tao, D https://orcid.org/0000-0001-7225-5449	en_US
dc.date.issued	2018-03-01	en_US
dc.identifier.citation	IEEE Transactions on Image Processing, 2018, 27 (3), pp. 1323 - 1335	en_US
dc.identifier.issn	1057-7149	en_US
dc.identifier.uri	http://hdl.handle.net/10453/124565
dc.description.abstract	© 2017 IEEE. Feature selection is beneficial for improving the performance of general machine learning tasks by extracting an informative subset from the high-dimensional features. Conventional feature selection methods usually ignore the class imbalance problem, thus the selected features will be biased towards the majority class. Considering that F-measure is a more reasonable performance measure than accuracy for imbalanced data, this paper presents an effective feature selection algorithm that explores the class imbalance issue by optimizing F-measures. Since F-measure optimization can be decomposed into a series of cost-sensitive classification problems, we investigate the costsensitive feature selection by generating and assigning different costs to each class with rigorous theory guidance. After solving a series of cost-sensitive feature selection problems, features corresponding to the best F-measure will be selected. In this way, the selected features will fully represent the properties of all classes. Experimental results on popular benchmarks and challenging real-world data sets demonstrate the significance of cost-sensitive feature selection for the imbalanced data setting and validate the effectiveness of the proposed method.	en_US
dc.relation.ispartof	IEEE Transactions on Image Processing	en_US
dc.relation.isbasedon	10.1109/TIP.2017.2781298	en_US
dc.subject.classification	Artificial Intelligence & Image Processing	en_US
dc.title	Cost-sensitive feature selection by optimizing f-measures	en_US
dc.type	Journal Article
utslib.citation.volume	3	en_US
utslib.citation.volume	27	en_US
utslib.for	0801 Artificial Intelligence and Image Processing	en_US
utslib.for	0906 Electrical and Electronic Engineering	en_US
utslib.for	1702 Cognitive Sciences	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Computer Science
utslib.copyright.status	closed_access
pubs.issue	3	en_US
pubs.publication-status	Published	en_US
pubs.volume	27	en_US

Abstract:

© 2017 IEEE. Feature selection is beneficial for improving the performance of general machine learning tasks by extracting an informative subset from the high-dimensional features. Conventional feature selection methods usually ignore the class imbalance problem, thus the selected features will be biased towards the majority class. Considering that F-measure is a more reasonable performance measure than accuracy for imbalanced data, this paper presents an effective feature selection algorithm that explores the class imbalance issue by optimizing F-measures. Since F-measure optimization can be decomposed into a series of cost-sensitive classification problems, we investigate the costsensitive feature selection by generating and assigning different costs to each class with rigorous theory guidance. After solving a series of cost-sensitive feature selection problems, features corresponding to the best F-measure will be selected. In this way, the selected features will fully represent the properties of all classes. Experimental results on popular benchmarks and challenging real-world data sets demonstrate the significance of cost-sensitive feature selection for the imbalanced data setting and validate the effectiveness of the proposed method.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/124565