Word topical mixture models for extractive spoken document summarization

Berlin Chen*, Yi Ting Chen

*此作品的通信作者

研究成果: 書貢獻/報告類型會議論文篇章

5 引文 斯高帕斯(Scopus)

摘要

This paper considers extractive summarization of Chinese spoken documents. In contrast to conventional approaches, we attempt to deal with the extractive summarization problem under a probabilistic generative framework. A word topical mixture model (w-TMM) was proposed to explore the cooccurrence relationship between words of the language. Each sentence of the spoken document to be summarized was treated as a composite word TMM model for generating the document, and sentences were ranked and selected according to their likelihoods. Various kinds of modeling structures and learning approaches were extensively investigated. In addition, the summarization capabilities were verified by comparison with the other conventional summarization approaches. The experiments were performed on the Chinese broadcast news collected in Taiwan. Noticeable performance gains were obtained. The proposed summarization technique has also been properly integrated into our prototype system for voice retrieval of broadcast news via mobile devices.

原文英語
主出版物標題Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, ICME 2007
發行者IEEE Computer Society
頁面52-55
頁數4
ISBN(列印)1424410177, 9781424410170
DOIs
出版狀態已發佈 - 2007
事件IEEE International Conference onMultimedia and Expo, ICME 2007 - Beijing, 中国
持續時間: 2007 7月 22007 7月 5

出版系列

名字Proceedings of the 2007 IEEE International Conference on Multimedia and Expo, ICME 2007

其他

其他IEEE International Conference onMultimedia and Expo, ICME 2007
國家/地區中国
城市Beijing
期間2007/07/022007/07/05

ASJC Scopus subject areas

  • 電腦繪圖與電腦輔助設計
  • 軟體

指紋

深入研究「Word topical mixture models for extractive spoken document summarization」主題。共同形成了獨特的指紋。

引用此