Incrementally mining recently repeating patterns over data streams

Jia Ling Koh, Pei Min Chou

研究成果: 書貢獻/報告類型會議論文篇章

摘要

Repeating patterns represent temporal relations among data items, which could be used for data summarization and data prediction. More and more data of various applications is generated as a data stream. Based on time sensitive concern, mining repeating patterns from the whole history data sequence of a data stream does not extract the current trend of patterns over the stream. Therefore, the traditional strategies for mining repeating patterns on static database are not applicable to data streams. For this reason, an algorithm, named appearing-bit-sequence-based incremental mining algorithm, for efficiently discovering recently repeating patterns over a data stream is proposed in this paper. The appearing bit sequences are used to monitor the occurrences of patterns within a sliding window. Two versions of algorithms are proposed by maintaining the appearing bit sequences of maximum repeating patterns and closed repeating patterns, respectively. Accordingly, the cost of re-mining repeating patterns over a sliding window is reduced to that of monitoring frequency changes of the maintained patterns. The experimental results show that the incremental mining methods perform much better than the re-miming approach.

原文英語
主出版物標題New Frontiers in Applied Data Mining - PAKDD 2008 International Workshops, Revised Selected Papers
發行者Springer Verlag
頁面26-37
頁數12
ISBN(列印)3642003982, 9783642003981
DOIs
出版狀態已發佈 - 2009
事件Pacific Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2008 - Osaka, 日本
持續時間: 2008 5月 202009 5月 23

出版系列

名字Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
5433 LNAI
ISSN(列印)0302-9743
ISSN(電子)1611-3349

其他

其他Pacific Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2008
國家/地區日本
城市Osaka
期間2008/05/202009/05/23

ASJC Scopus subject areas

  • 理論電腦科學
  • 一般電腦科學

指紋

深入研究「Incrementally mining recently repeating patterns over data streams」主題。共同形成了獨特的指紋。

引用此