M-DFA (multithreaded DFA): An algorithm for reduction of state transitions and acceleration of REGEXP matching

Cheng Hung Lin*, Jyh Charn Liu

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

This paper proposes a multi-thread based regular expression (regexp) matching algorithm, M-DFA (multithreaded DFA), for parallel computer architectures such as multi-core processors and graphic processing units (GPU). At the thread level, one thread is designated to traverse the DFA of a possible matching path until its termination, and at the task level multiple threads concurrently match each input symbol in parallel. Given a set of regexps, the total number of (DFA) state transitions in M-DFA is significantly smaller than that of its traditional DFA counterpart. The significant saving of state transitions is contributed by elimination of backtracking transitions, which commonly occur to mapping of concurrent active states in NFA to DFA and other situations. Experimental result shows that the proposed algorithm achieves significant reduction on state and state transition. In addition, the proposed algorithm running on Nvidia® GTX 480 is 35 times faster than the popular regexp library, RE2 performed on Intel Core i7 CPU.

Original languageEnglish
Title of host publicationANCS 2012 - Proceedings of the 8th ACM/IEEE Symposium on Architectures for Networking and Communications Systems
Pages79-80
Number of pages2
DOIs
Publication statusPublished - 2012
Event8th ACM/IEEE Symposium on Architectures for Networking and Communications Systems, ANCS 2012 - Austin, TX, United States
Duration: 2012 Oct 292012 Oct 30

Publication series

NameANCS 2012 - Proceedings of the 8th ACM/IEEE Symposium on Architectures for Networking and Communications Systems

Other

Other8th ACM/IEEE Symposium on Architectures for Networking and Communications Systems, ANCS 2012
Country/TerritoryUnited States
CityAustin, TX
Period2012/10/292012/10/30

Keywords

  • graphics processing units
  • regular expression matching

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Hardware and Architecture

Fingerprint

Dive into the research topics of 'M-DFA (multithreaded DFA): An algorithm for reduction of state transitions and acceleration of REGEXP matching'. Together they form a unique fingerprint.

Cite this