Topic modeling for spoken document retrieval using word- and syllable-level information

Shih Hsiang Lin*, Berlin Chen

*此作品的通信作者

研究成果: 書貢獻/報告類型會議論文篇章

3 引文 斯高帕斯(Scopus)

摘要

Topic modeling for information retrieval (IR) has attracted significant attention and demonstrated good performance in a wide variety of tasks over the years. In this article, we first present a comprehensive comparison among various topic modeling approaches, including the so-called document topic models (DTM) and word topic models (WTM), for Chinese spoken document retrieval (SDR). Moreover, in order to lessen SDR performance degradation when using imperfect recognition transcripts, we also leverage different levels of indexing features for topic modeling, including words, syllable-level units and their combinations. All the experiments are performed on the TDT Chinese collection.

原文英語
主出版物標題3rd Workshop on Searching Spontaneous Conversational Speech, SSCS'09, Co-located with the 2009 ACM International Conference on Multimedia, MM'09
頁面3-10
頁數8
DOIs
出版狀態已發佈 - 2009
事件3rd Workshop on Searching Spontaneous Conversational Speech, SSCS'09, Co-located with the 2009 ACM International Conference on Multimedia, MM'09 - Beijing, 中国
持續時間: 2009 10月 192009 10月 24

出版系列

名字3rd Workshop on Searching Spontaneous Conversational Speech, SSCS'09, Co-located with the 2009 ACM International Conference on Multimedia, MM'09

其他

其他3rd Workshop on Searching Spontaneous Conversational Speech, SSCS'09, Co-located with the 2009 ACM International Conference on Multimedia, MM'09
國家/地區中国
城市Beijing
期間2009/10/192009/10/24

ASJC Scopus subject areas

  • 電腦繪圖與電腦輔助設計
  • 軟體

指紋

深入研究「Topic modeling for spoken document retrieval using word- and syllable-level information」主題。共同形成了獨特的指紋。

引用此