Developing learner corpus annotation for Chinese grammatical errors

Lung Hao Lee, Li Ping Chang, Yuen Hsien Tseng

研究成果: 書貢獻/報告類型會議論文篇章

12 引文 斯高帕斯(Scopus)

摘要

This study describes the construction of the TOCFL (Test Of Chinese as a Foreign Language) learner corpus, including the collection and grammatical error annotation of 2,837 essays written by Chinese language learners originating from a total of 46 different mother-Tongue languages. We propose hierarchical tagging sets to manually annotate grammatical errors, resulting in 33,835 inappropriate usages. Our built corpus has been provided for the shared tasks on Chinese grammatical error diagnosis. These demonstrate the usability of our learner corpus annotation.

原文英語
主出版物標題Proceedings of the 2016 International Conference on Asian Language Processing, IALP 2016
編輯Minghui Dong, Chung-Hsien Wu, Yanfeng Lu, Haizhou Li, Yuen-Hsien Tseng, Liang-Chih Yu, Lung-Hao Lee
發行者Institute of Electrical and Electronics Engineers Inc.
頁面254-257
頁數4
ISBN(電子)9781509009213
DOIs
出版狀態已發佈 - 2017 3月 10
事件20th International Conference on Asian Language Processing, IALP 2016 - Tainan, 臺灣
持續時間: 2016 11月 212016 11月 23

出版系列

名字Proceedings of the 2016 International Conference on Asian Language Processing, IALP 2016

其他

其他20th International Conference on Asian Language Processing, IALP 2016
國家/地區臺灣
城市Tainan
期間2016/11/212016/11/23

ASJC Scopus subject areas

  • 訊號處理
  • 電腦視覺和模式識別
  • 語言和語言學
  • 人工智慧

指紋

深入研究「Developing learner corpus annotation for Chinese grammatical errors」主題。共同形成了獨特的指紋。

引用此