New word extraction utilizing google news corpuses for supporting lexicon-based chinese word segmentation systems

Chin Ming Hong*, Chih Ming Chen, Chao Yang Chiu

*此作品的通信作者

研究成果: 書貢獻/報告類型會議論文篇章

4 引文 斯高帕斯(Scopus)

摘要

This study proposes a novel statistics-based scheme for new word extraction based on Google news to promote the word identification ability for the lexicon-based Chinese word segmentation systems. To extract news words from the corpuses of news and incrementally add them into the lexicon for the lexicon-based Chinese word segmentation systems provides benefits in terms of automatically constructing a professional lexicon of news and enhancing word identification ability. Compared with another proposed method, the experimental results indicated that the proposed new word extraction scheme not only can more correctly retrieve news words from the categorized corpuses of Google news, but also obtain has larger amount of new words.

原文英語
主出版物標題International Joint Conference on Neural Networks 2006, IJCNN '06
頁面3040-3046
頁數7
出版狀態已發佈 - 2006
事件International Joint Conference on Neural Networks 2006, IJCNN '06 - Vancouver, BC, 加拿大
持續時間: 2006 7月 162006 7月 21

出版系列

名字IEEE International Conference on Neural Networks - Conference Proceedings
ISSN(列印)1098-7576

其他

其他International Joint Conference on Neural Networks 2006, IJCNN '06
國家/地區加拿大
城市Vancouver, BC
期間2006/07/162006/07/21

ASJC Scopus subject areas

  • 軟體

指紋

深入研究「New word extraction utilizing google news corpuses for supporting lexicon-based chinese word segmentation systems」主題。共同形成了獨特的指紋。

引用此