An efficient approach for mining top-K fault-tolerant repeating patterns

Jia Ling Koh*, Yu Ting Kung

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

In this paper, an efficient strategy for mining top-K non-trivial faulttolerant repeating patterns (FT-RPs in short) with lengths no less than min_len from data sequences is provided. By extending the idea of appearing bit sequences, fault-tolerant appearing bit sequences are defined to represent the locations where candidate patterns appear in a data sequence with insertion/deletion errors being allowed. Two algorithms, named TFTRP-Mine(Top-K non-trivial FT-RPs Mining) and RE-TFTRP-Mine (REfinement of TFTRP-Mine), respectively, are proposed. Both of these two algorithms use the recursive formulas to obtain the fault-tolerant appearing bit sequence of a pattern systematically and then the fault-tolerant frequency of each candidate pattern could be counted quickly. Besides, RE-TFTRP-Mine adopts two additional strategies for pruning the searching space in order to improve the mining efficiency. The experimental results show that RE-TFTRP-Mine outperforms TFTRP-Mine algorithm when K and min_len are small. In addition, more important and implicit repeating patterns could be found from real music objects by adopting fault tolerant mining.

Original languageEnglish
Title of host publicationDatabase Systems for Advanced Applications - 11th International Conference, DASFAA 2006, Proceedings
Pages95-110
Number of pages16
DOIs
Publication statusPublished - 2006
Event11th International Conference on Database Systems for Advanced Applications, DASFAA 2006 - Singapore, Singapore
Duration: 2006 Apr 122006 Apr 15

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume3882 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other11th International Conference on Database Systems for Advanced Applications, DASFAA 2006
Country/TerritorySingapore
CitySingapore
Period2006/04/122006/04/15

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'An efficient approach for mining top-K fault-tolerant repeating patterns'. Together they form a unique fingerprint.

Cite this