SVDD-based outlier detection on uncertain data

Publication Type:
Journal Article
Citation:
Knowledge and Information Systems, 2013, 34 (3), pp. 597 - 618
Issue Date:
2013-01-01
Filename Description Size
Thumbnail2013001903OK.pdf632.75 kB
Adobe PDF
Full metadata record
Outlier detection is an important problem that has been studied within diverse research areas and application domains. Most existing methods are based on the assumption that an example can be exactly categorized as either a normal class or an outlier. However, in many real-life applications, data are uncertain in nature due to various errors or partial completeness. These data uncertainty make the detection of outliers far more difficult than it is from clearly separable data. The key challenge of handling uncertain data in outlier detection is how to reduce the impact of uncertain data on the learned distinctive classifier. This paper proposes a new SVDD-based approach to detect outliers on uncertain data. The proposed approach operates in two steps. In the first step, a pseudo-training set is generated by assigning a confidence score to each input example, which indicates the likelihood of an example tending normal class. In the second step, the generated confidence score is incorporated into the support vector data description training phase to construct a global distinctive classifier for outlier detection. In this phase, the contribution of the examples with the least confidence score on the construction of the decision boundary has been reduced. The experiments show that the proposed approach outperforms state-of-art outlier detection techniques. © 2012 Springer-Verlag London Limited.
Please use this identifier to cite or link to this item: