Spoken document understanding and organization

Lin Shan Lee, Berlin Chen

研究成果: 雜誌貢獻期刊論文同行評審

118 引文 斯高帕斯(Scopus)

摘要

Spoken documents associated with the network content are critical for retrieval and browsing. An overview is given of various technology areas reaching towards this goal in a unified context. Technology areas covered include named-entity (NE) extraction, segmentation, and information extraction for the spoken documents as well as automatic summarization, title generation, and topic analysis and organization. The relevant problems and issues, general principles, and basic approaches for each area are briefly reviewed. A framework for properly integrating all these different technology areas is proposed, in which four different levels of processes are defined and bottom-up and top-down relationships are discussed. An initial prototype system for such purposes has been developed by National Taiwan University. The resultant system used broadcast news in Mandarin Chinese as example spoken documents.

原文英語
頁(從 - 到)42-60
頁數19
期刊IEEE Signal Processing Magazine
22
發行號5
DOIs
出版狀態已發佈 - 2005 一月 1

ASJC Scopus subject areas

  • Signal Processing
  • Electrical and Electronic Engineering
  • Applied Mathematics

指紋 深入研究「Spoken document understanding and organization」主題。共同形成了獨特的指紋。

引用此