GBKII: An Imputation Method for Missing Values

Publication Type:
Conference Proceeding
Advances in Knowledge Discovery and Data Mining, 2007, pp. 1080 - 1087
Issue Date:
Full metadata record
Files in This Item:
Filename Description SizeFormat
2007000696.pdf488.67 kBAdobe PDF
Missing data imputation is an actual and challenging issue in machine learning and data mining. This is because missing values in a dataset can generate bias that affects the quality of the learned patterns or the classification performances. To deal with this issue, this paper proposes a Grey-Based K-NN Iteration Imputation method, called GBKII, for imputing missing values. GBKII is an instance-based imputation method, which is referred to a non-parametric regression method in statistics. It is also efficient for handling with categorical attributes. We experimentally evaluate our approach and demonstrate that GBKII is much more efficient than the k-NN and mean-substitution methods.
Please use this identifier to cite or link to this item: