A hierarchical tag-graph search scheme with layered grammar rules for spontaneous speech understanding

Bor Shen Lin, Berlin Chen, Hsin Min Wang, Lin Shan Lee

Research output: Contribution to journalArticle

6 Citations (Scopus)

Abstract

It has always been difficult for language understanding systems to handle spontaneous speech with satisfactory robustness, primarily due to such problems as the fragments, disfluencies, out-of-vocabulary words, and ill-formed sentence structures. Also, the search schemes used are usually not flexible enough in accepting different input linguistic units, and great efforts are therefore required when they are used with different acoustic front ends in different tasks, specially in multi-modal and multi-lingual systems. In this paper, a new hierarchical tag-graph-based search scheme for spontaneous speech understanding is proposed. This scheme is based on a layered hierarchy of grammar rules, and therefore can integrate all the statistical and rule-based knowledge including acoustic scores, language model scores and grammar rules into the search process. More robust speech understanding is thus achievable. In addition, this scheme can accept graphs of different linguistic units such as phonemes, syllables, characters, words, spotted keywords, or phrases as the input, thus compatible to different acoustic front ends and multi-modal and multi-lingual applications can be easily developed. This search scheme has been successfully applied to a multi-domain, multi-modal dialogue system.

Original languageEnglish
Pages (from-to)819-831
Number of pages13
JournalPattern Recognition Letters
Volume23
Issue number7
DOIs
Publication statusPublished - 2002 May 1

Fingerprint

Acoustics
Linguistics

Keywords

  • Robustness
  • Speech understanding
  • Spontaneous speech
  • Tag-graph search

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Computer Vision and Pattern Recognition
  • Artificial Intelligence

Cite this

A hierarchical tag-graph search scheme with layered grammar rules for spontaneous speech understanding. / Lin, Bor Shen; Chen, Berlin; Wang, Hsin Min; Lee, Lin Shan.

In: Pattern Recognition Letters, Vol. 23, No. 7, 01.05.2002, p. 819-831.

Research output: Contribution to journalArticle

@article{2063d20c30d54716b8739c316350f76f,
title = "A hierarchical tag-graph search scheme with layered grammar rules for spontaneous speech understanding",
abstract = "It has always been difficult for language understanding systems to handle spontaneous speech with satisfactory robustness, primarily due to such problems as the fragments, disfluencies, out-of-vocabulary words, and ill-formed sentence structures. Also, the search schemes used are usually not flexible enough in accepting different input linguistic units, and great efforts are therefore required when they are used with different acoustic front ends in different tasks, specially in multi-modal and multi-lingual systems. In this paper, a new hierarchical tag-graph-based search scheme for spontaneous speech understanding is proposed. This scheme is based on a layered hierarchy of grammar rules, and therefore can integrate all the statistical and rule-based knowledge including acoustic scores, language model scores and grammar rules into the search process. More robust speech understanding is thus achievable. In addition, this scheme can accept graphs of different linguistic units such as phonemes, syllables, characters, words, spotted keywords, or phrases as the input, thus compatible to different acoustic front ends and multi-modal and multi-lingual applications can be easily developed. This search scheme has been successfully applied to a multi-domain, multi-modal dialogue system.",
keywords = "Robustness, Speech understanding, Spontaneous speech, Tag-graph search",
author = "Lin, {Bor Shen} and Berlin Chen and Wang, {Hsin Min} and Lee, {Lin Shan}",
year = "2002",
month = "5",
day = "1",
doi = "10.1016/S0167-8655(01)00158-1",
language = "English",
volume = "23",
pages = "819--831",
journal = "Pattern Recognition Letters",
issn = "0167-8655",
publisher = "Elsevier",
number = "7",

}

TY - JOUR

T1 - A hierarchical tag-graph search scheme with layered grammar rules for spontaneous speech understanding

AU - Lin, Bor Shen

AU - Chen, Berlin

AU - Wang, Hsin Min

AU - Lee, Lin Shan

PY - 2002/5/1

Y1 - 2002/5/1

N2 - It has always been difficult for language understanding systems to handle spontaneous speech with satisfactory robustness, primarily due to such problems as the fragments, disfluencies, out-of-vocabulary words, and ill-formed sentence structures. Also, the search schemes used are usually not flexible enough in accepting different input linguistic units, and great efforts are therefore required when they are used with different acoustic front ends in different tasks, specially in multi-modal and multi-lingual systems. In this paper, a new hierarchical tag-graph-based search scheme for spontaneous speech understanding is proposed. This scheme is based on a layered hierarchy of grammar rules, and therefore can integrate all the statistical and rule-based knowledge including acoustic scores, language model scores and grammar rules into the search process. More robust speech understanding is thus achievable. In addition, this scheme can accept graphs of different linguistic units such as phonemes, syllables, characters, words, spotted keywords, or phrases as the input, thus compatible to different acoustic front ends and multi-modal and multi-lingual applications can be easily developed. This search scheme has been successfully applied to a multi-domain, multi-modal dialogue system.

AB - It has always been difficult for language understanding systems to handle spontaneous speech with satisfactory robustness, primarily due to such problems as the fragments, disfluencies, out-of-vocabulary words, and ill-formed sentence structures. Also, the search schemes used are usually not flexible enough in accepting different input linguistic units, and great efforts are therefore required when they are used with different acoustic front ends in different tasks, specially in multi-modal and multi-lingual systems. In this paper, a new hierarchical tag-graph-based search scheme for spontaneous speech understanding is proposed. This scheme is based on a layered hierarchy of grammar rules, and therefore can integrate all the statistical and rule-based knowledge including acoustic scores, language model scores and grammar rules into the search process. More robust speech understanding is thus achievable. In addition, this scheme can accept graphs of different linguistic units such as phonemes, syllables, characters, words, spotted keywords, or phrases as the input, thus compatible to different acoustic front ends and multi-modal and multi-lingual applications can be easily developed. This search scheme has been successfully applied to a multi-domain, multi-modal dialogue system.

KW - Robustness

KW - Speech understanding

KW - Spontaneous speech

KW - Tag-graph search

UR - http://www.scopus.com/inward/record.url?scp=0036567728&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0036567728&partnerID=8YFLogxK

U2 - 10.1016/S0167-8655(01)00158-1

DO - 10.1016/S0167-8655(01)00158-1

M3 - Article

AN - SCOPUS:0036567728

VL - 23

SP - 819

EP - 831

JO - Pattern Recognition Letters

JF - Pattern Recognition Letters

SN - 0167-8655

IS - 7

ER -