Association Link Network Based Core Events Discovery on the Web

IEEE Computer Society
Publication Type:
Liu, Yang et al. 2013, 'Association Link Network Based Core Events Discovery on the Web', IEEE Computer Society, United States, pp. 553-560.
Issue Date:
Filename Description Size
Thumbnail2013004411OK.pdf1.08 MB
Adobe PDF
Full metadata record
As documents are explosively increasing in the era of big data, document clustering has been proven to be useful for organizing online document streams into events. However, extant studies on document clustering still suffer from the problems of high dimensionality, scalability and accuracy. In this paper, we will present a novel association link network (ALN) based document clustering method, which is an adaptive iteration splitting process to discover core events on the web. In the iteration, we first detect community structures from ALN; then, map documents to the associated community based on words relations in ALN; finally rebuild communities using the mapped documents. Compared to existing document clustering methods, the effectiveness of presented clustering method in automatically discovering the web events is proved by the experimental results on real data set.
Please use this identifier to cite or link to this item: