Optimized parameters for missing data imputation

Publication Type:
Conference Proceeding
Citation:
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2006, 4099 LNAI pp. 1010 - 1016
Issue Date:
2006-01-01
Full metadata record
Files in This Item:
Filename Description Size
Thumbnail2006005463.pdf996.63 kB
Adobe PDF
To complete missing values, a solution is to use attribute correlations within data. However, it is difficult to identify such relations within data containing missing values. Accordingly, we develop a kernel-based missing data imputation method in this paper. This approach aims at making optimal statistical parameters: mean, distribution function after missing-data are imputed. We refer this approach to parameter optimization method (POP algorithm, a random regression imputation). We experimentally evaluate our approach, and demonstrate that our POP algorithm is much better than deterministic regression imputation in efficiency of generating an inference on the above two parameters. The results also show our algorithm is computationally efficient, robust and stable for the missing data imputation. © Springer-Verlag Berlin Heidelberg 2006.
Please use this identifier to cite or link to this item: