GBKII: An imputation method for missing values

Publication Type:
Conference Proceeding
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2007, 4426 LNAI pp. 1080 - 1087
Issue Date:
Full metadata record
Files in This Item:
Filename Description Size
Thumbnail2007000696.pdf488.67 kB
Adobe PDF
Missing data imputation is an actual and challenging issue in machine learning and data mining. This is because missing values in a dataset can generate bias that affects the quality of the learned patterns or the classification performances. To deal with this issue, this paper proposes a Grey-Based K-NN Iteration Imputation method, called GBKII, for imputing missing values. GBKII is an instance-based imputation method, which is referred to a non-parametric regression method in statistics. It is also efficient for handling with categorical attributes. We experimentally evaluate our approach and demonstrate that GBKII is much more efficient than the k-NN and mean-substitution methods. © Springer-Verlag Berlin Heidelberg 2007.
Please use this identifier to cite or link to this item: