Graph repairing under neighborhood constraints

Publication Type:
Journal Article
Citation:
VLDB Journal, 2017, 26 (5), pp. 611 - 635
Issue Date:
2017-10-01
Filename Description Size
Song2017_Article_GraphRepairingUnderNeighborhoo.pdfPublished Version1.73 MB
Adobe PDF
Full metadata record
© 2017, Springer-Verlag Berlin Heidelberg. A broad class of data, ranging from similarity networks, workflow networks to protein networks, can be modeled as graphs with data values as vertex labels. Both vertex labels and neighbors could be dirty for various reasons such as typos or erroneous reporting of results in scientific experiments. Neighborhood constraints, specifying label pairs that are allowed to appear on adjacent vertices in the graph, are employed to detect and repair erroneous vertex labels and neighbors. In this paper, we study the problem of repairing vertex labels and neighbors to make graphs satisfy neighborhood constraints. Unfortunately, the problem is generally hard, which motivates us to devise approximation methods for repairing and identify interesting special cases (star and clique constraints) that can be efficiently solved. First, we propose several label repairing approximation algorithms including greedy heuristics, contraction method and an approach combining both. The performances of algorithms are also analyzed for the special case. Moreover, we devise a cubic-time constant-factor graph repairing algorithm with both label and neighbor repairs (given degree-bounded instance graphs). Our extensive experimental evaluation on real data demonstrates the effectiveness of eliminating frauds in several types of application networks.
Please use this identifier to cite or link to this item: