Mining sequential patterns across time sequences

Publication Type:
Journal Article
Citation:
New Generation Computing, 2007, 26 (1), pp. 75 - 96
Issue Date:
2007-11-01
Metrics:
Full metadata record
Files in This Item:
Filename Description Size
Thumbnail2011000595OK.pdf508.6 kB
Adobe PDF
In this paper, we deal with mining sequential patterns in multiple time sequences. Building on a state-of-the-art sequential pattern mining algorithm PrefixSpan for mining transaction databases, we propose MILE (MIning in muLtiple sEquences), an efficient algorithm to facilitate the mining process. MILE recursively utilizes the knowledge of existing patterns to avoid redundant data scanning, and therefore can effectively speed up the new patterns' discovery process. Another unique feature of MILE is that it can incorporate prior knowledge of the data distribution in time sequences into the mining process to further improve the performance. Extensive empirical results show that MILE is significantly faster than PrefixSpan. As MILE consumes more memory than PrefixSpan, we also present a solution to trade time efficiency in memory constrained environments. © Ohmsha, Ltd. and Springer 2008.
Please use this identifier to cite or link to this item: