TY - GEN
T1 - Latent topic modeling of word co-occurrence information for spoken document retrieval
AU - Chen, Berlin
PY - 2009
Y1 - 2009
N2 - In this paper, we present a word topic model (WTM) approach, discovering the co-occurrence relationship between words as well as the long-span latent topic information, for spoken document retrieval (SDR). A given document as a whole is modeled as a composite WTM model for generating an observed query. The underlying characteristics and different kinds of model structures are extensively investigated, while the performance of WTM is thoroughly analyzed and verified by comparison with a few existing retrieval models on the TDT-2 SDR task. We also attempt to incorporate part-of-speech (POS) weighting into the representations of the query observations and the WTM models for obtaining better retrieval performance.
AB - In this paper, we present a word topic model (WTM) approach, discovering the co-occurrence relationship between words as well as the long-span latent topic information, for spoken document retrieval (SDR). A given document as a whole is modeled as a composite WTM model for generating an observed query. The underlying characteristics and different kinds of model structures are extensively investigated, while the performance of WTM is thoroughly analyzed and verified by comparison with a few existing retrieval models on the TDT-2 SDR task. We also attempt to incorporate part-of-speech (POS) weighting into the representations of the query observations and the WTM models for obtaining better retrieval performance.
KW - Language model
KW - Probabilistic latent semantic analysis
KW - Spoken document retrieval
KW - Word topic model
UR - http://www.scopus.com/inward/record.url?scp=70349223893&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=70349223893&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2009.4960495
DO - 10.1109/ICASSP.2009.4960495
M3 - Conference contribution
AN - SCOPUS:70349223893
SN - 9781424423545
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 3961
EP - 3964
BT - 2009 IEEE International Conference on Acoustics, Speech, and Signal Processing - Proceedings, ICASSP 2009
T2 - 2009 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2009
Y2 - 19 April 2009 through 24 April 2009
ER -