I-vector based language modeling for query representation

Kuan Yu Chen, Hsin Min Wang, Berlin Chen, Hsin His Chen

研究成果: 書貢獻/報告類型會議論文篇章

摘要

Since more and more multimedia data associated with spoken documents have been made available to the public, spoken document retrieval (SDR) has become an important research subject in the past two decades. Following the research tendency, many efforts have been devoted towards developing indexing and modeling techniques for representing spoken documents, but only few have been made on improving query formulation for better representing users' information needs. The i-vector based language modeling (IVLM) framework, stemming from the state-of-the-art i-vector framework for language identification and speaker recognition, has been proposed and formulated to represent documents in SDR with good promise recently. However, a major challenge of using IVLM for query modeling is that a query usually consists of only a few words; thus, it is hard to learn a reliable representation accordingly. In this paper, we focus our attention on query reformulation and propose three novel methods on top of IVLM to more accurately represent users' information needs. In addition, we also explore the use of multi-levels of index features, including word- and subword-level units, to work in concert with the proposed methods. A series of empirical SDR experiments conducted on the TDT-2 (Topic Detection and Tracking) collection demonstrate the good effectiveness of our proposed methods as compared to existing state-of-the-art methods.

原文英語
主出版物標題2015 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2015 - Proceedings
發行者Institute of Electrical and Electronics Engineers Inc.
頁面5211-5215
頁數5
ISBN(電子)9781467369978
DOIs
出版狀態已發佈 - 2015 8月 4
事件40th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2015 - Brisbane, 澳大利亚
持續時間: 2014 4月 192014 4月 24

出版系列

名字ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
2015-August
ISSN(列印)1520-6149

其他

其他40th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2015
國家/地區澳大利亚
城市Brisbane
期間2014/04/192014/04/24

ASJC Scopus subject areas

  • 軟體
  • 訊號處理
  • 電氣與電子工程

指紋

深入研究「I-vector based language modeling for query representation」主題。共同形成了獨特的指紋。

引用此