I-vector based language modeling for spoken document retrieval

Kuan Yu Chen, Hung Shin Lee, Hsin Min Wang, Berlin Chen, Hsin His Chen

研究成果: 書貢獻/報告類型會議貢獻

14 引文 斯高帕斯(Scopus)

摘要

Since more and more multimedia data associated with spoken documents have been made available to the public, spoken document retrieval (SDR) has become an important research subject in the past two decades. The i-vector based framework has been proposed and introduced to language identification (LID) and speaker recognition (SR) tasks recently. The major contribution of the i-vector framework is to reduce a series of acoustic feature vectors of a speech utterance to a low-dimensional vector representation, and then numbers of well-developed postprocessing techniques (such as probabilistic linear discriminative analysis, PLDA) can be readily and effectively used. However, to our best knowledge, there is no research up to date on applying the i-vector framework for SDR or information retrieval (IR). In this paper, we make a step forward to formulate an i-vector based language modeling (IVLM) framework for SDR. Furthermore, we evaluate the proposed IVLM framework with both inductive and transductive learning strategies. We also exploit multi-levels of index features, including word- and subword-level units, in concert with the proposed framework. The results of SDR experiments conducted on the TDT-2 (Topic Detection and Tracking) collection demonstrate the performance merits of our proposed framework when compared to several existing approaches.

原文英語
主出版物標題2014 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2014
發行者Institute of Electrical and Electronics Engineers Inc.
頁面7083-7087
頁數5
ISBN(列印)9781479928927
DOIs
出版狀態已發佈 - 2014 一月 1
事件2014 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2014 - Florence, 意大利
持續時間: 2014 五月 42014 五月 9

出版系列

名字ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN(列印)1520-6149

其他

其他2014 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2014
國家意大利
城市Florence
期間14/5/414/5/9

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Electrical and Electronic Engineering

指紋 深入研究「I-vector based language modeling for spoken document retrieval」主題。共同形成了獨特的指紋。

引用此