Statistical Chinese spoken document retrieval using latent topical information

Berlin Chen, Jen Wei Kuo, Yao Min Huang, Hsin Min Wang

研究成果: 會議貢獻類型

4 引文 斯高帕斯(Scopus)

摘要

Information retrieval which aims to provide people with easy access to all kinds of information is now becoming more and more emphasized. However, most approaches to information retrieval are primarily based on literal term matching and operate in a deterministic manner. Thus their performance is often limited due to the problems of vocabulary mismatch and not able to be steadily improved through use. In order to overcome these drawbacks as well as to enhance the retrieval performance, in this paper we explore the use of topical mixture model for statistical Chinese spoken document retrieval. Various kinds of model structures and learning approaches were extensively investigated. In addition, the retrieval capabilities were verified by comparison with the conventional vector space model and latent semantic indexing model, as well as our previously presented HMM/N-gram retrieval model. The experiments were performed on the TDT-2 Chinese collection. Noticeable improvements in retrieval performance were obtained.

原文英語
頁面1621-1624
頁數4
出版狀態已發佈 - 2004 一月 1
事件8th International Conference on Spoken Language Processing, ICSLP 2004 - Jeju, Jeju Island, 大韓民國
持續時間: 2004 十月 42004 十月 8

其他

其他8th International Conference on Spoken Language Processing, ICSLP 2004
國家大韓民國
城市Jeju, Jeju Island
期間04/10/404/10/8

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language

指紋 深入研究「Statistical Chinese spoken document retrieval using latent topical information」主題。共同形成了獨特的指紋。

  • 引用此

    Chen, B., Kuo, J. W., Huang, Y. M., & Wang, H. M. (2004). Statistical Chinese spoken document retrieval using latent topical information. 1621-1624. 論文發表於 8th International Conference on Spoken Language Processing, ICSLP 2004, Jeju, Jeju Island, 大韓民國.