Toward generic title generation for clustered documents

Yuen Hsien Tseng, Chi Jen Lin, Hsiu Han Chen, Yu I. Lin

研究成果: 書貢獻/報告類型會議貢獻

18 引文 斯高帕斯(Scopus)

摘要

A cluster labeling algorithm for creating generic titles based on external resources such as WordNet is proposed. Our method first extracts category-specific terms as cluster descriptors. These descriptors are then mapped to generic terms based on a hypernym search algorithm. The proposed method has been evaluated on a patent document collection and a subset of the Reuters-21578 collection. Experimental results revealed that our method performs as anticipated. Real-case applications of these generic terms show promising in assisting humans in interpreting the clustered topics. Our method is general enough such that it can be easily extended to use other hierarchical resources for adaptable label generation.

原文英語
主出版物標題Information Retrieval Technology - Third Asia Information Retrieval Symposium, AIRS 2006, Proceedings
發行者Springer Verlag
頁面145-157
頁數13
ISBN(列印)3540457801, 9783540457800
DOIs
出版狀態已發佈 - 2006
事件3rd Asia Information Retrieval Symposium, AIRS 2006 - Singapore, 新加坡
持續時間: 2006 十月 162006 十月 18

出版系列

名字Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
4182 LNCS
ISSN(列印)0302-9743
ISSN(電子)1611-3349

其他

其他3rd Asia Information Retrieval Symposium, AIRS 2006
國家新加坡
城市Singapore
期間06/10/1606/10/18

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

指紋 深入研究「Toward generic title generation for clustered documents」主題。共同形成了獨特的指紋。

  • 引用此

    Tseng, Y. H., Lin, C. J., Chen, H. H., & Lin, Y. I. (2006). Toward generic title generation for clustered documents. 於 Information Retrieval Technology - Third Asia Information Retrieval Symposium, AIRS 2006, Proceedings (頁 145-157). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); 卷 4182 LNCS). Springer Verlag. https://doi.org/10.1007/11880592_12