TY - GEN
T1 - Query modeling for spoken document retrieval
AU - Chen, Berlin
AU - Chen, Pei Ning
AU - Chen, Kuan Yu
PY - 2011
Y1 - 2011
N2 - Spoken document retrieval (SDR) has recently become a more interesting research avenue due to increasing volumes of publicly available multimedia associated with speech information. Many efforts have been devoted to developing elaborate indexing and modeling techniques for representing spoken documents, but only few to improving query formulations for better representing the users' information needs. In view of this, we recently presented a language modeling framework exploring a novel use of relevance information cues for improving query effectiveness. Our work in this paper continues this general line of research in two main aspects. We further explore various ways to glean both relevance and non-relevance cues from the spoken document collection so as to enhance query modeling in an unsupervised fashion. Furthermore, we also investigate representing the query and documents with different granularities of index features to work in conjunction with the various relevance and/or non-relevance cues. Experiments conducted on the TDT (Topic Detection and Tracking) SDR task demonstrate the performance merits of the methods instantiated from our retrieval framework when compared to other existing retrieval methods.
AB - Spoken document retrieval (SDR) has recently become a more interesting research avenue due to increasing volumes of publicly available multimedia associated with speech information. Many efforts have been devoted to developing elaborate indexing and modeling techniques for representing spoken documents, but only few to improving query formulations for better representing the users' information needs. In view of this, we recently presented a language modeling framework exploring a novel use of relevance information cues for improving query effectiveness. Our work in this paper continues this general line of research in two main aspects. We further explore various ways to glean both relevance and non-relevance cues from the spoken document collection so as to enhance query modeling in an unsupervised fashion. Furthermore, we also investigate representing the query and documents with different granularities of index features to work in conjunction with the various relevance and/or non-relevance cues. Experiments conducted on the TDT (Topic Detection and Tracking) SDR task demonstrate the performance merits of the methods instantiated from our retrieval framework when compared to other existing retrieval methods.
UR - http://www.scopus.com/inward/record.url?scp=84858957387&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84858957387&partnerID=8YFLogxK
U2 - 10.1109/ASRU.2011.6163963
DO - 10.1109/ASRU.2011.6163963
M3 - Conference contribution
AN - SCOPUS:84858957387
SN - 9781467303675
T3 - 2011 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2011, Proceedings
SP - 389
EP - 394
BT - 2011 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2011, Proceedings
T2 - 2011 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2011
Y2 - 11 December 2011 through 15 December 2011
ER -