TY - JOUR
T1 - Discriminating capabilities of syllable-based features and approaches of utilizing them for voice retrieval of speech information in Mandarin Chinese
AU - Chen, Berlin
AU - Wang, Hsin Min
AU - Lee, Lin Shan
PY - 2002/7
Y1 - 2002/7
N2 - With the rapidly growing use of the audio and multimedia information over the Internet, the technology for retrieving speech information using voice queries is becoming more and more important. In this paper, considering the monosyllabic structure of the Chinese language, a whole class of syllable-based indexing features, including overlapping segments of syllables and Syllable pairs separated by a few syllables, is extensively investigated based on a Mandarin broadcast news database. The strong discriminating capabilities of such syllable-based features were verified by comparing with the word- or character-based features. Good approaches for better utilizing such capabilities, including fusion with the word- and character-level information and improved approaches to obtain better syllable-based features and query expressions, were extensively investigated. Very encouraging experimental results were obtained.
AB - With the rapidly growing use of the audio and multimedia information over the Internet, the technology for retrieving speech information using voice queries is becoming more and more important. In this paper, considering the monosyllabic structure of the Chinese language, a whole class of syllable-based indexing features, including overlapping segments of syllables and Syllable pairs separated by a few syllables, is extensively investigated based on a Mandarin broadcast news database. The strong discriminating capabilities of such syllable-based features were verified by comparing with the word- or character-based features. Good approaches for better utilizing such capabilities, including fusion with the word- and character-level information and improved approaches to obtain better syllable-based features and query expressions, were extensively investigated. Very encouraging experimental results were obtained.
KW - Confidence measure
KW - Retrieval of speech information
KW - Syllable-based features
KW - Term association matrix
UR - http://www.scopus.com/inward/record.url?scp=0036649836&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0036649836&partnerID=8YFLogxK
U2 - 10.1109/TSA.2002.802541
DO - 10.1109/TSA.2002.802541
M3 - Article
AN - SCOPUS:0036649836
SN - 1063-6676
VL - 10
SP - 303
EP - 314
JO - IEEE Transactions on Speech and Audio Processing
JF - IEEE Transactions on Speech and Audio Processing
IS - 5
ER -