Comparison of word and subword indexing techniques for mandarin Chinese spoken document retrieval

Hsin Min Wang, Berlin Chen

研究成果: 書貢獻/報告類型會議論文篇章

摘要

In this paper, we investigate the use of words and subwords (including both characters and syllables) in audio indexing for Mandarin Chinese spoken document retrieval. Two retrieval approaches, including the well-known vector space model approach and the newly proposed HMM/N-gram-based approach, are used in the present work. We focus on the use of an entire Chinese textual story (from a newspaper) as a query to retrieve Mandarin Chinese spoken documents (from news broadcasts). Experiments are based on the Topic Detection and Tracking Corpora.

原文英語
主出版物標題Advances in Multimedia Information Processing - PCM 2001 - 2nd IEEE Pacific Rim Conference on Multimedia, Proceedings
編輯Heung-Yeung Shum, Mark Liao, Shih-Fu Chang
發行者Springer Verlag
頁面606-613
頁數8
ISBN(列印)3540426809, 9783540426806
DOIs
出版狀態已發佈 - 2001
對外發佈
事件2nd IEEE Pacific-Rim Conference on Multimedia, IEEE-PCM 2001 - Beijing, 中国
持續時間: 2001 10月 242001 10月 26

出版系列

名字Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
2195
ISSN(列印)0302-9743
ISSN(電子)1611-3349

其他

其他2nd IEEE Pacific-Rim Conference on Multimedia, IEEE-PCM 2001
國家/地區中国
城市Beijing
期間2001/10/242001/10/26

ASJC Scopus subject areas

  • 理論電腦科學
  • 一般電腦科學

指紋

深入研究「Comparison of word and subword indexing techniques for mandarin Chinese spoken document retrieval」主題。共同形成了獨特的指紋。

引用此