Introduction to a proofreading tool for chinese spelling check task of SIGHAN-8

Tao Hsing Chang, Cheng Han Yang, Hsueh Chih Chen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

13 Citations (Scopus)

Abstract

The detection and correction of erroneous Chinese characters is an important problem in many applications. This paper proposed an automatic method for correcting erroneous Chinese characters. The method is divided into two parts, which separately handle two types of erroneous character: The occurrence of an erroneous character in a word length of one, and the occurrence in a word length of two or more. The first primarily makes use of a rulesbased method, while the second integrates parameters of similarity and syntax rationality using a linear regression model to predict erroneous characters. Experimental results shown that the F1 and FPR of the proposed method are 0.34 and 0.18 respectively.

Original languageEnglish
Title of host publicationProceedings of the 8th SIGHAN Workshop on Chinese Language Processing, SIGHAN 2015 - co-located with 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, ACL IJCNLP 2015
EditorsLiang-Chih Yu, Zhifang Sui, Yue Zhang, Vincent Ng
PublisherAssociation for Computational Linguistics (ACL)
Pages50-55
Number of pages6
ISBN (Electronic)9781941643570
Publication statusPublished - 2015
Event8th SIGHAN Workshop on Chinese Language Processing, SIGHAN 2015 - Beijing, China
Duration: 2015 Jul 302015 Jul 31

Publication series

NameProceedings of the 8th SIGHAN Workshop on Chinese Language Processing, SIGHAN 2015 - co-located with 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, ACL IJCNLP 2015

Conference

Conference8th SIGHAN Workshop on Chinese Language Processing, SIGHAN 2015
Country/TerritoryChina
CityBeijing
Period2015/07/302015/07/31

ASJC Scopus subject areas

  • Language and Linguistics
  • Computer Science Applications
  • Education
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Introduction to a proofreading tool for chinese spelling check task of SIGHAN-8'. Together they form a unique fingerprint.

Cite this