User Preference Analysis for Most Frequent Peer/Dominator

Publication Type:
Journal Article
IEEE Transactions on Knowledge and Data Engineering, 2019, 31 (7), pp. 1412 - 1425
Issue Date:
Filename Description Size
08413137.pdfAccepted Manuscript Version4.14 MB
Adobe PDF
Full metadata record
© 1989-2012 IEEE. Given a set of objects O (such as hotels), each can be represented as a point in a multi-dimensional feature space where each dimension corresponds to one attribute of the objects (such as price). Given the preference of a customer, the objects in O not dominated by any other object (i.e., beat in all dimensions) are those worthy to be further considered. Such objects are known as skyline objects in database community. Suppose we have an object o\in O-O. If o is a skyline point, other skyline objects are called peers of o. If o is not a skyline object, it must be dominated by some skyline objects which are called dominators of o. Given a large number of user preferences, an interesting problem is to identify the most frequent peer/dominator (MFP/MFD) of o. The MFP/MFD search has unique values in competitor analysis of various information systems. However, it is a challenging task because of the complexity to process a large number of user preferences. In this work, we provide robust solutions including exact and approximate methods. While the exact solutions explore the dominance relationship in the feature space, the approximate solutions are based on sampling techniques with theoretical bounds. We did extensive tests on large data sets which are up to 100 million user preferences generated from commercial surveys. The test resutls demonstrate the exact algorithms outperform various baseline algorithms significantly, and the approximate algorithms make further improvement by one order of magnitude with 90-98 percent accuracy.
Please use this identifier to cite or link to this item: