TY - GEN
T1 - Approximately mining recently representative patterns on data streams
AU - Koh, Jia Ling
AU - Don, Yuan Bin
PY - 2007
Y1 - 2007
N2 - Catching the recent trend of data is an important issue when mining frequent itemsets from data streams. To prevent from storing the whole transaction data within the sliding window, the frequency changing point (FCP) method was proposed for monitoring the recent occurrences of itemsets in a data stream under the assumption that exact one transaction arrives at each time point. In this paper, the FCP method is extended for maintaining recent patterns in a data stream where a block of various numbers of transactions (including zero or more transactions) is inputted within each time unit. Moreover, to avoid generating redundant information in the mining results, the recently representative patterns are discovered from the maintained structure approximately. The experimental results show that our approach reduces the run-time memory usage significantly. Moreover, the proposed GFCP algorithm achieves high accuracy of mining results and guarantees no false dismissal occurring.
AB - Catching the recent trend of data is an important issue when mining frequent itemsets from data streams. To prevent from storing the whole transaction data within the sliding window, the frequency changing point (FCP) method was proposed for monitoring the recent occurrences of itemsets in a data stream under the assumption that exact one transaction arrives at each time point. In this paper, the FCP method is extended for maintaining recent patterns in a data stream where a block of various numbers of transactions (including zero or more transactions) is inputted within each time unit. Moreover, to avoid generating redundant information in the mining results, the recently representative patterns are discovered from the maintained structure approximately. The experimental results show that our approach reduces the run-time memory usage significantly. Moreover, the proposed GFCP algorithm achieves high accuracy of mining results and guarantees no false dismissal occurring.
UR - http://www.scopus.com/inward/record.url?scp=38549123331&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=38549123331&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-77018-3_25
DO - 10.1007/978-3-540-77018-3_25
M3 - Conference contribution
AN - SCOPUS:38549123331
SN - 354077016X
SN - 9783540770169
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 231
EP - 243
BT - Emerging Technologies in Knowledge Discovery and Data Mining - PAKDD 2007 International Workshops, Revised Selected Papers
PB - Springer Verlag
T2 - Pacific-Asia Conference on Knowledge Discovery and Data Mining, PAKDD 2007
Y2 - 22 May 2007 through 22 May 2007
ER -