Discriminating capabilities of syllable-based features and approaches of utilizing them for voice retrieval of speech information in Mandarin Chinese

Berlin Chen*, Hsin Min Wang, Lin Shan Lee

*此作品的通信作者

研究成果: 雜誌貢獻期刊論文同行評審

53 引文 斯高帕斯(Scopus)

摘要

With the rapidly growing use of the audio and multimedia information over the Internet, the technology for retrieving speech information using voice queries is becoming more and more important. In this paper, considering the monosyllabic structure of the Chinese language, a whole class of syllable-based indexing features, including overlapping segments of syllables and Syllable pairs separated by a few syllables, is extensively investigated based on a Mandarin broadcast news database. The strong discriminating capabilities of such syllable-based features were verified by comparing with the word- or character-based features. Good approaches for better utilizing such capabilities, including fusion with the word- and character-level information and improved approaches to obtain better syllable-based features and query expressions, were extensively investigated. Very encouraging experimental results were obtained.

原文英語
頁(從 - 到)303-314
頁數12
期刊IEEE Transactions on Speech and Audio Processing
10
發行號5
DOIs
出版狀態已發佈 - 2002 7月
對外發佈

ASJC Scopus subject areas

  • 軟體
  • 聲學與超音波
  • 電腦視覺和模式識別
  • 電氣與電子工程

指紋

深入研究「Discriminating capabilities of syllable-based features and approaches of utilizing them for voice retrieval of speech information in Mandarin Chinese」主題。共同形成了獨特的指紋。

引用此