Abstract
This paper presents an improved framework for voice retrieval of Mandarin broadcast news speech. First, several unsupervised and data-driven approaches for broadcast news transcription were proposed to improve the speech recognition accuracy and efficiency. Then, a multiscale indexing paradigm for broadcast news retrieval was exploited to alleviate the problems caused by the speech recognition errors and the flexible wording structure of the Chinese language. Finally, we used the PDA as the platform and broadcast radio programs collected in Taiwan as the document collection to establish a speech-based multimedia information retrieval prototype system. Very encouraging results were obtained.
Original language | English |
---|---|
Pages (from-to) | 91-109 |
Number of pages | 19 |
Journal | International Journal of Pattern Recognition and Artificial Intelligence |
Volume | 20 |
Issue number | 1 |
DOIs | |
Publication status | Published - 2006 Feb |
Keywords
- Broadcast news
- Information retrieval
- Multimedia
- Multiscale indexing
- Speech recognition
ASJC Scopus subject areas
- Software
- Computer Vision and Pattern Recognition
- Artificial Intelligence