High-performance parallel location-aware algorithms for approximate string matching on GPUs

Cheng Hung Lin, Chun Cheng Huang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Approximate string matching has been widely used in many applications, including deoxyribonucleic acid sequence searching, spell checking, text mining, and spam filters. The method is designed to find all locations of strings that approximately match a pattern in accordance with the number of insertion, deletion, and substitution operations. Among the proposed algorithms, the bit-parallel algorithms are considered to be the best and highly efficient algorithms. However, the traditional bit-parallel algorithms lacks the ability of identifying the start and end positions of a matched pattern. Furthermore, acceleration of the bit-parallel algorithms has become a crucial issue for processing big data nowadays. In this paper, we propose two kinds of parallel location-aware algorithms called data-segmented parallelism and high-degree parallelism as means to accelerate approximate string matching using graphic processing units. Experimental results show that the high-degree parallelism on GPUs achieves significant improvement in system and kernel throughputs compared to CPU counterparts. Compared to state-of-the-art approaches, the proposed high-degree parallelism achieves 11 to 105 times improvement.

Original languageEnglish
Title of host publicationProceedings - 2015 IEEE 21st International Conference on Parallel and Distributed Systems, ICPADS 2015
PublisherIEEE Computer Society
Pages570-575
Number of pages6
ISBN (Electronic)9780769557854
DOIs
Publication statusPublished - 2016 Jan 15
Event21st IEEE International Conference on Parallel and Distributed Systems, ICPADS 2015 - Melbourne, Australia
Duration: 2015 Dec 142015 Dec 17

Publication series

NameProceedings of the International Conference on Parallel and Distributed Systems - ICPADS
Volume2016-January
ISSN (Print)1521-9097

Other

Other21st IEEE International Conference on Parallel and Distributed Systems, ICPADS 2015
Country/TerritoryAustralia
CityMelbourne
Period2015/12/142015/12/17

Keywords

  • Approximate string matching
  • Bit-parallel algorithm
  • Graphic processing units
  • Levenshtein distance
  • Nondeterministic finite automaton
  • Parallel algorithm

ASJC Scopus subject areas

  • Hardware and Architecture

Fingerprint

Dive into the research topics of 'High-performance parallel location-aware algorithms for approximate string matching on GPUs'. Together they form a unique fingerprint.

Cite this