This paper presents a system for speech retrieval of Mandarin broadcast news. First, several data-driven and unsupervised approaches are integrated into the broadcast news transcription system to improve the speech recognition accuracy and efficiency. Then, a multi-scale indexing paradigm for broadcast news retrieval is proposed to make use of the special structural properties of the Chinese language as well as to alleviate the problems caused by the speech recognition errors. Finally, we use the PDA as the platform and Mandarin broadcast news stories collected in Taiwan as the document collection to establish a speech-based multimedia information retrieval prototype system. Very encouraging results are obtained.
|已發佈 - 2005
|9th European Conference on Speech Communication and Technology - Lisbon, 葡萄牙
持續時間: 2005 9月 4 → 2005 9月 8
|9th European Conference on Speech Communication and Technology
|2005/09/04 → 2005/09/08
ASJC Scopus subject areas