TY - GEN
T1 - Statistical language model adaptation for Mandarin broadcast news transcription
AU - Chen, Berlin
AU - Tsai, Wen Hung
AU - Kuo, Jen Wei
PY - 2004
Y1 - 2004
N2 - This paper investigates statistical language model adaptation for Mandarin broadcast news transcription. A topical mixture model was proposed to explore the long-span latent topical information for dynamic language model adaptation. The underlying characteristics and various kinds of model complexities were extensively investigated, while their performance was verified by comparison with the conventional MAP-based adaptation approaches, which are devoted to extracting the short-span n-gram information. The speech recognition experiments were conducted on the broadcast news collected in Taiwan. Very promising results in both perplexity and word error rate reductions were initially obtained.
AB - This paper investigates statistical language model adaptation for Mandarin broadcast news transcription. A topical mixture model was proposed to explore the long-span latent topical information for dynamic language model adaptation. The underlying characteristics and various kinds of model complexities were extensively investigated, while their performance was verified by comparison with the conventional MAP-based adaptation approaches, which are devoted to extracting the short-span n-gram information. The speech recognition experiments were conducted on the broadcast news collected in Taiwan. Very promising results in both perplexity and word error rate reductions were initially obtained.
UR - http://www.scopus.com/inward/record.url?scp=21444452767&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=21444452767&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:21444452767
SN - 0780386787
SN - 9780780386785
T3 - 2004 International Symposium on Chinese Spoken Language Processing - Proceedings
SP - 313
EP - 316
BT - 2004 International Symposium on Chinese Spoken Language Processing - Proceedings
T2 - 2004 International Symposium on Chinese Spoken Language Processing
Y2 - 15 December 2004 through 18 December 2004
ER -