This paper presents a system for speech retrieval of Mandarin broadcast news. First, several data-driven and unsupervised approaches are integrated into the broadcast news transcription system to improve the speech recognition accuracy and efficiency. Then, a multi-scale indexing paradigm for broadcast news retrieval is proposed to make use of the special structural properties of the Chinese language as well as to alleviate the problems caused by the speech recognition errors. Finally, we use the PDA as the platform and Mandarin broadcast news stories collected in Taiwan as the document collection to establish a speech-based multimedia information retrieval prototype system. Very encouraging results are obtained.
|出版狀態||已發佈 - 2005 十二月 1|
|事件||9th European Conference on Speech Communication and Technology - Lisbon, 葡萄牙|
持續時間: 2005 九月 4 → 2005 九月 8
|其他||9th European Conference on Speech Communication and Technology|
|期間||2005/09/04 → 2005/09/08|
ASJC Scopus subject areas