Using corpus-based linguistic approaches in sense prediction study

Jia Fei Hong, Sue Jin Ker, Chu Ren Huang, Kathleen Ahrens

研究成果: 書貢獻/報告類型會議論文篇章

摘要

In this study, we propose to use two corpus-based linguistic approaches for a sense prediction study. We will concentrate on the character similarity clustering approach and concept similarity clustering approach to predict the senses of non-assigned words by using corpora and tools, such as Chinese Gigaword Corpus, and HowNet. In this study, we would then like to evaluate their predictions via the sense divisions of Chinese Wordnet and Xiandai Hanyu Cidian. Using these corpora, we will determine the clusters of our four target words ---- chi1 "eat", wan2 "play", huan4 "change" and shao1 "burn" in order to predict their all possible senses and evaluate them. This requirement will demonstrate the visibility of the corpus-based approaches.

原文英語
主出版物標題PACLIC 24 - Proceedings of the 24th Pacific Asia Conference on Language, Information and Computation
頁面399-407
頁數9
出版狀態已發佈 - 2010 十二月 1
對外發佈
事件24th Pacific Asia Conference on Language, Information and Computation, PACLIC 24 - Sendai, 日本
持續時間: 2010 十一月 42010 十一月 7

出版系列

名字PACLIC 24 - Proceedings of the 24th Pacific Asia Conference on Language, Information and Computation

會議

會議24th Pacific Asia Conference on Language, Information and Computation, PACLIC 24
國家/地區日本
城市Sendai
期間2010/11/042010/11/07

ASJC Scopus subject areas

  • 語言與語言學
  • 電腦科學(雜項)

指紋

深入研究「Using corpus-based linguistic approaches in sense prediction study」主題。共同形成了獨特的指紋。

引用此