CRIE: An automated analyzer for Chinese texts

Yao Ting Sung, Tao Hsing Chang, Wei Chun Lin, Kuan Sheng Hsieh, Kuo En Chang

Research output: Contribution to journalArticle

6 Citations (Scopus)

Abstract

Textual analysis has been applied to various fields, such as discourse analysis, corpus studies, text leveling, and automated essay evaluation. Several tools have been developed for analyzing texts written in alphabetic languages such as English and Spanish. However, currently there is no tool available for analyzing Chinese-language texts. This article introduces a tool for the automated analysis of simplified and traditional Chinese texts, called the Chinese Readability Index Explorer (CRIE). Composed of four subsystems and incorporating 82 multilevel linguistic features, CRIE is able to conduct the major tasks of segmentation, syntactic parsing, and feature extraction. Furthermore, the integration of linguistic features with machine learning models enables CRIE to provide leveling and diagnostic information for texts in language arts, texts for learning Chinese as a foreign language, and texts with domain knowledge. The usage and validation of the functions provided by CRIE are also introduced.

Original languageEnglish
Pages (from-to)1238-1251
Number of pages14
JournalBehavior Research Methods
Volume48
Issue number4
DOIs
Publication statusPublished - 2016 Dec 1

Fingerprint

Language
Linguistics
Language Arts
Learning
Readability
Linguistic Features
Leveling
Machine Learning

Keywords

  • CRIE
  • Chinese text analysis
  • Linguistic feature
  • Readability

ASJC Scopus subject areas

  • Experimental and Cognitive Psychology
  • Developmental and Educational Psychology
  • Arts and Humanities (miscellaneous)
  • Psychology (miscellaneous)
  • Psychology(all)

Cite this

CRIE : An automated analyzer for Chinese texts. / Sung, Yao Ting; Chang, Tao Hsing; Lin, Wei Chun; Hsieh, Kuan Sheng; Chang, Kuo En.

In: Behavior Research Methods, Vol. 48, No. 4, 01.12.2016, p. 1238-1251.

Research output: Contribution to journalArticle

Sung, Yao Ting ; Chang, Tao Hsing ; Lin, Wei Chun ; Hsieh, Kuan Sheng ; Chang, Kuo En. / CRIE : An automated analyzer for Chinese texts. In: Behavior Research Methods. 2016 ; Vol. 48, No. 4. pp. 1238-1251.
@article{254de07521664c73a2b703244e78f98c,
title = "CRIE: An automated analyzer for Chinese texts",
abstract = "Textual analysis has been applied to various fields, such as discourse analysis, corpus studies, text leveling, and automated essay evaluation. Several tools have been developed for analyzing texts written in alphabetic languages such as English and Spanish. However, currently there is no tool available for analyzing Chinese-language texts. This article introduces a tool for the automated analysis of simplified and traditional Chinese texts, called the Chinese Readability Index Explorer (CRIE). Composed of four subsystems and incorporating 82 multilevel linguistic features, CRIE is able to conduct the major tasks of segmentation, syntactic parsing, and feature extraction. Furthermore, the integration of linguistic features with machine learning models enables CRIE to provide leveling and diagnostic information for texts in language arts, texts for learning Chinese as a foreign language, and texts with domain knowledge. The usage and validation of the functions provided by CRIE are also introduced.",
keywords = "CRIE, Chinese text analysis, Linguistic feature, Readability",
author = "Sung, {Yao Ting} and Chang, {Tao Hsing} and Lin, {Wei Chun} and Hsieh, {Kuan Sheng} and Chang, {Kuo En}",
year = "2016",
month = "12",
day = "1",
doi = "10.3758/s13428-015-0649-1",
language = "English",
volume = "48",
pages = "1238--1251",
journal = "Behavior Research Methods",
issn = "1069-9384",
publisher = "Springer New York",
number = "4",

}

TY - JOUR

T1 - CRIE

T2 - An automated analyzer for Chinese texts

AU - Sung, Yao Ting

AU - Chang, Tao Hsing

AU - Lin, Wei Chun

AU - Hsieh, Kuan Sheng

AU - Chang, Kuo En

PY - 2016/12/1

Y1 - 2016/12/1

N2 - Textual analysis has been applied to various fields, such as discourse analysis, corpus studies, text leveling, and automated essay evaluation. Several tools have been developed for analyzing texts written in alphabetic languages such as English and Spanish. However, currently there is no tool available for analyzing Chinese-language texts. This article introduces a tool for the automated analysis of simplified and traditional Chinese texts, called the Chinese Readability Index Explorer (CRIE). Composed of four subsystems and incorporating 82 multilevel linguistic features, CRIE is able to conduct the major tasks of segmentation, syntactic parsing, and feature extraction. Furthermore, the integration of linguistic features with machine learning models enables CRIE to provide leveling and diagnostic information for texts in language arts, texts for learning Chinese as a foreign language, and texts with domain knowledge. The usage and validation of the functions provided by CRIE are also introduced.

AB - Textual analysis has been applied to various fields, such as discourse analysis, corpus studies, text leveling, and automated essay evaluation. Several tools have been developed for analyzing texts written in alphabetic languages such as English and Spanish. However, currently there is no tool available for analyzing Chinese-language texts. This article introduces a tool for the automated analysis of simplified and traditional Chinese texts, called the Chinese Readability Index Explorer (CRIE). Composed of four subsystems and incorporating 82 multilevel linguistic features, CRIE is able to conduct the major tasks of segmentation, syntactic parsing, and feature extraction. Furthermore, the integration of linguistic features with machine learning models enables CRIE to provide leveling and diagnostic information for texts in language arts, texts for learning Chinese as a foreign language, and texts with domain knowledge. The usage and validation of the functions provided by CRIE are also introduced.

KW - CRIE

KW - Chinese text analysis

KW - Linguistic feature

KW - Readability

UR - http://www.scopus.com/inward/record.url?scp=84944704384&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84944704384&partnerID=8YFLogxK

U2 - 10.3758/s13428-015-0649-1

DO - 10.3758/s13428-015-0649-1

M3 - Article

C2 - 26424442

AN - SCOPUS:84944704384

VL - 48

SP - 1238

EP - 1251

JO - Behavior Research Methods

JF - Behavior Research Methods

SN - 1069-9384

IS - 4

ER -