Training deep neural networks on imbalanced data sets

Wang, S; Liu, W; Wu, J; Cao, L; Meng, Q; Kennedy, PJ

Training deep neural networks on imbalanced data sets

Wang, S

Liu, W

Wu, J

Cao, L

Meng, Q Kennedy, PJ

Permalink

Publication Type:: Conference Proceeding
Citation:: Proceedings of the International Joint Conference on Neural Networks, 2016, 2016-October pp. 4368 - 4374
Issue Date:: 2016-10-31

Closed Access

	Filename	Description	Size
	07727770.pdf	Published version	187.89 kB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Wang, S https://orcid.org/0000-0003-1133-9379	en_US
dc.contributor.author	Liu, W https://orcid.org/0000-0002-3003-1313	en_US
dc.contributor.author	Wu, J https://orcid.org/0000-0002-1371-5801	en_US
dc.contributor.author	Cao, L https://orcid.org/0000-0003-1562-9429	en_US
dc.contributor.author	Meng, Q	en_US
dc.contributor.author	Kennedy, PJ https://orcid.org/0000-0001-7837-3171	en_US
dc.date.issued	2016-10-31	en_US
dc.identifier.citation	Proceedings of the International Joint Conference on Neural Networks, 2016, 2016-October pp. 4368 - 4374	en_US
dc.identifier.isbn	9781509006199	en_US
dc.identifier.uri	http://hdl.handle.net/10453/88092
dc.description.abstract	© 2016 IEEE. Deep learning has become increasingly popular in both academic and industrial areas in the past years. Various domains including pattern recognition, computer vision, and natural language processing have witnessed the great power of deep networks. However, current studies on deep learning mainly focus on data sets with balanced class labels, while its performance on imbalanced data is not well examined. Imbalanced data sets exist widely in real world and they have been providing great challenges for classification tasks. In this paper, we focus on the problem of classification using deep network on imbalanced data sets. Specifically, a novel loss function called mean false error together with its improved version mean squared false error are proposed for the training of deep networks on imbalanced data sets. The proposed method can effectively capture classification errors from both majority class and minority class equally. Experiments and comparisons demonstrate the superiority of the proposed approach compared with conventional methods in classifying imbalanced data sets on deep neural networks.	en_US
dc.relation.ispartof	Proceedings of the International Joint Conference on Neural Networks	en_US
dc.relation.isbasedon	10.1109/IJCNN.2016.7727770	en_US
dc.title	Training deep neural networks on imbalanced data sets	en_US
dc.type	Conference Proceeding
utslib.citation.volume	2016-October	en_US
utslib.for	080101 Adaptive Agents and Intelligent Robotics	en_US
utslib.for	080110 Simulation and Modelling	en_US
utslib.for	080109 Pattern Recognition and Data Mining	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Computer Science
pubs.organisational-group	/University of Technology Sydney/Strength - AAI - Advanced Analytics Institute Research Centre
pubs.organisational-group	/University of Technology Sydney/Strength - CAI - Centre for Artificial Intelligence
pubs.organisational-group	/University of Technology Sydney/Strength - CHT - Health Technologies
utslib.copyright.status	closed_access
pubs.publication-status	Published	en_US
pubs.volume	2016-October	en_US

Abstract:

© 2016 IEEE. Deep learning has become increasingly popular in both academic and industrial areas in the past years. Various domains including pattern recognition, computer vision, and natural language processing have witnessed the great power of deep networks. However, current studies on deep learning mainly focus on data sets with balanced class labels, while its performance on imbalanced data is not well examined. Imbalanced data sets exist widely in real world and they have been providing great challenges for classification tasks. In this paper, we focus on the problem of classification using deep network on imbalanced data sets. Specifically, a novel loss function called mean false error together with its improved version mean squared false error are proposed for the training of deep networks on imbalanced data sets. The proposed method can effectively capture classification errors from both majority class and minority class equally. Experiments and comparisons demonstrate the superiority of the proposed approach compared with conventional methods in classifying imbalanced data sets on deep neural networks.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/88092