TY - GEN
T1 - Content-based language models for spoken document retrieval
AU - Wang, Hsin Min
AU - Chen, Berlin
N1 - Publisher Copyright:
Copyright 2000 ACM.
PY - 2000/11/1
Y1 - 2000/11/1
N2 - Spoken document retrieval (SDR) has been extensively studied in recent years because of its potential use in navigating large multimedia collections in the near future. This paper presents a novel concept of applying the content-based language models to spoken document retrieval. In an example task for retrieval of Mandarin broadcast news, the content-based language models either trained with the automatic transcriptions of the spoken documents or adapted from the baseline language models using the automatic transcriptions of the spoken documents were used to create the more accurate recognition results and indexing terms from both the spoken documents and the speech queries. We report on some interesting findings obtained in this research.
AB - Spoken document retrieval (SDR) has been extensively studied in recent years because of its potential use in navigating large multimedia collections in the near future. This paper presents a novel concept of applying the content-based language models to spoken document retrieval. In an example task for retrieval of Mandarin broadcast news, the content-based language models either trained with the automatic transcriptions of the spoken documents or adapted from the baseline language models using the automatic transcriptions of the spoken documents were used to create the more accurate recognition results and indexing terms from both the spoken documents and the speech queries. We report on some interesting findings obtained in this research.
KW - Content-based language models
KW - Speech recognition
KW - Spoken document retrieval (SDR)
UR - http://www.scopus.com/inward/record.url?scp=85027119404&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85027119404&partnerID=8YFLogxK
U2 - 10.1145/355214.355236
DO - 10.1145/355214.355236
M3 - Conference contribution
AN - SCOPUS:85027119404
T3 - Proceedings of the 5th international Workshop on Information Retrieval with Asian Languages, IRAL 2000
SP - 149
EP - 155
BT - Proceedings of the 5th international Workshop on Information Retrieval with Asian Languages, IRAL 2000
PB - Association for Computing Machinery, Inc
T2 - 5th International Workshop on Information Retrieval with Asian Languages, IRAL 2000
Y2 - 30 September 2000 through 1 October 2000
ER -