Error Detection and Uncertainty Modeling for Imprecise Data

Publisher:
IEEE Computer Society
Publication Type:
Conference Proceeding
Citation:
Proc. of the 21st IEEE International Conference on Tools with Artificial Intelligence (ICTAI-09), 2009, pp. 792 - 795
Issue Date:
2009-01
Full metadata record
Files in This Item:
Filename Description Size
Thumbnail2009001674OK.pdf929.45 kB
Adobe PDF
In this paper, we propose a method to derive and model data uncertainty from imprecise data. We view data imprecision and errors as the outcome of the precise data exposed to some uncertain channels, and our scheme is to directly derive the data uncertainty model from imprecise data, such that the derived data uncertainty information may be integrated into the succeeding mining process. To achieve the goal, we propose an Expectation Maximization (EM) based approach to detect erroneous data entries from the input data. The data uncertainty models are constructed by applying statistical analysis to the detected errors. Experimental results show that the proposed error detection approach can locate data errors and suggest alternative data entry values to improve classifiers built from imprecise data. In addition, the uncertain models derived for each individual attributes are shown to be close to the genuine uncertainty models used to corrupt the data.
Please use this identifier to cite or link to this item: