TY - JOUR
T1 - Extractive spoken document summarization for information retrieval
AU - Chen, Berlin
AU - Chen, Yi Ting
PY - 2008/3/1
Y1 - 2008/3/1
N2 - The purpose of extractive summarization is to automatically select a number of indicative sentences, passages, or paragraphs from the original document according to a target summarization ratio and then sequence them to form a concise summary. In this paper, we proposed the use of probabilistic latent topical information for extractive summarization of spoken documents. Various kinds of modeling structures and learning approaches were extensively investigated. In addition, the summarization capabilities were verified by comparison with several conventional spoken document summarization models. The experiments were performed on the Chinese broadcast news collected in Taiwan. Noticeable performance gains were obtained. The proposed summarization technique has also been properly integrated into our prototype system for voice retrieval of Mandarin broadcast news via mobile devices.
AB - The purpose of extractive summarization is to automatically select a number of indicative sentences, passages, or paragraphs from the original document according to a target summarization ratio and then sequence them to form a concise summary. In this paper, we proposed the use of probabilistic latent topical information for extractive summarization of spoken documents. Various kinds of modeling structures and learning approaches were extensively investigated. In addition, the summarization capabilities were verified by comparison with several conventional spoken document summarization models. The experiments were performed on the Chinese broadcast news collected in Taiwan. Noticeable performance gains were obtained. The proposed summarization technique has also been properly integrated into our prototype system for voice retrieval of Mandarin broadcast news via mobile devices.
KW - Extractive summarization
KW - Information retrieval
KW - Speech recognition
KW - Spoken documents
KW - Topical mixture model
UR - http://www.scopus.com/inward/record.url?scp=38349073318&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=38349073318&partnerID=8YFLogxK
U2 - 10.1016/j.patrec.2007.10.022
DO - 10.1016/j.patrec.2007.10.022
M3 - Article
AN - SCOPUS:38349073318
SN - 0167-8655
VL - 29
SP - 426
EP - 437
JO - Pattern Recognition Letters
JF - Pattern Recognition Letters
IS - 4
ER -