Class noise handling for effective cost-sensitive learning by cost-guided iterative classification filtering

Publication Type:
Journal Article
Citation:
IEEE Transactions on Knowledge and Data Engineering, 2006, 18 (10), pp. 1435 - 1440
Issue Date:
2006-10-01
Full metadata record
Files in This Item:
Filename Description Size
Thumbnail2011000606OK.pdf2.32 MB
Adobe PDF
Recent research in machine learning, data mining, and related areas has produced a wide variety of algorithms for cost-sensitive (CS) classification, where instead of maximizing the classification accuracy, minimizing the misclassification cost becomes the objective. These methods often assume that their input is quality data without conflict or erroneous values, or the noise impact is trivial, which is seldom the case in real-world environments. In this paper, we propose a Cost-guided Iterative Classification Filter (CICF) to identify noise for effective CS learning. Instead of putting equal weights on handling noise in all classes in existing efforts, CICF puts more emphasis on expensive classes, which makes it attractive in dealing with data sets with a large cost-ratio. (Experimental results and comparative studies indicate that the existence of noise may seriously corrupt the performance of the underlying CS learners and by adopting the proposed CICF algorithm, we can significantly reduce the misclassification cost of a CS classifier in noisy environments. © 2006 IEEE.
Please use this identifier to cite or link to this item: