An automatic collocation writing assistant for Taiwanese EFL learners: A case of corpus-based NLP technology

Yu Chia Chang, Jason S. Chang, Hao Jan Chen, Hsien Chin Liou

Research output: Contribution to journalArticle

42 Citations (Scopus)

Abstract

Previous work in the literature reveals that EFL learners were deficient in collocations that are a hallmark of near native fluency in learner's writing. Among different types of collocations, the verb-noun (V-N) one was found to be particularly difficult to master, and learners' first language was also found to heavily influence their collocation production. In this paper, we develop an online collocation aid for EFL writers in Taiwan, aiming at detecting and correcting of learners' miscollocations attributable to L1 interference. Relevant correct collocation as feedback messages is suggested according to the translation equivalents between learner's L1 and L2. The system utilizes natural language processing (NLP) techniques to segment sentences in order to extract V-N collocations in given texts, and to derive a list of candidate English verbs that share the same Chinese translations via consulting electronic bilingual dictionaries. After combining nouns with these derived candidate verbs as V-N pairs, the system makes use of a reference corpus to exclude the inappropriate V-N pairs and single out the proper collocations. The system can effectively pinpoint the miscollocations and provide the learner with adequate collocations that the learner intends to write but misuses. It is hoped that this online assistant can facilitate EFL learner-writers' collocation use and help them transfer this essential knowledge to their future writing.

Original languageEnglish
Pages (from-to)283-299
Number of pages17
JournalComputer Assisted Language Learning
Volume21
Issue number3
DOIs
Publication statusPublished - 2008 Jul 1

Fingerprint

Natural language processing systems
Glossaries
assistant
Feedback
candidacy
Processing
language
writer
management counsulting
dictionary
interference
Taiwan
electronics
Natural Language Processing
Collocation
EFL Learners
Corpus-based
Taiwanese
Nouns
Verbs

Keywords

  • Automatic error correction
  • L1 influence
  • Miscollocation analysis

ASJC Scopus subject areas

  • Language and Linguistics
  • Linguistics and Language
  • Computer Science Applications

Cite this

An automatic collocation writing assistant for Taiwanese EFL learners : A case of corpus-based NLP technology. / Chang, Yu Chia; Chang, Jason S.; Chen, Hao Jan; Liou, Hsien Chin.

In: Computer Assisted Language Learning, Vol. 21, No. 3, 01.07.2008, p. 283-299.

Research output: Contribution to journalArticle

@article{54e886ee0a92493b8a7277d688510b14,
title = "An automatic collocation writing assistant for Taiwanese EFL learners: A case of corpus-based NLP technology",
abstract = "Previous work in the literature reveals that EFL learners were deficient in collocations that are a hallmark of near native fluency in learner's writing. Among different types of collocations, the verb-noun (V-N) one was found to be particularly difficult to master, and learners' first language was also found to heavily influence their collocation production. In this paper, we develop an online collocation aid for EFL writers in Taiwan, aiming at detecting and correcting of learners' miscollocations attributable to L1 interference. Relevant correct collocation as feedback messages is suggested according to the translation equivalents between learner's L1 and L2. The system utilizes natural language processing (NLP) techniques to segment sentences in order to extract V-N collocations in given texts, and to derive a list of candidate English verbs that share the same Chinese translations via consulting electronic bilingual dictionaries. After combining nouns with these derived candidate verbs as V-N pairs, the system makes use of a reference corpus to exclude the inappropriate V-N pairs and single out the proper collocations. The system can effectively pinpoint the miscollocations and provide the learner with adequate collocations that the learner intends to write but misuses. It is hoped that this online assistant can facilitate EFL learner-writers' collocation use and help them transfer this essential knowledge to their future writing.",
keywords = "Automatic error correction, L1 influence, Miscollocation analysis",
author = "Chang, {Yu Chia} and Chang, {Jason S.} and Chen, {Hao Jan} and Liou, {Hsien Chin}",
year = "2008",
month = "7",
day = "1",
doi = "10.1080/09588220802090337",
language = "English",
volume = "21",
pages = "283--299",
journal = "Computer Assisted Language Learning",
issn = "0958-8221",
publisher = "Routledge",
number = "3",

}

TY - JOUR

T1 - An automatic collocation writing assistant for Taiwanese EFL learners

T2 - A case of corpus-based NLP technology

AU - Chang, Yu Chia

AU - Chang, Jason S.

AU - Chen, Hao Jan

AU - Liou, Hsien Chin

PY - 2008/7/1

Y1 - 2008/7/1

N2 - Previous work in the literature reveals that EFL learners were deficient in collocations that are a hallmark of near native fluency in learner's writing. Among different types of collocations, the verb-noun (V-N) one was found to be particularly difficult to master, and learners' first language was also found to heavily influence their collocation production. In this paper, we develop an online collocation aid for EFL writers in Taiwan, aiming at detecting and correcting of learners' miscollocations attributable to L1 interference. Relevant correct collocation as feedback messages is suggested according to the translation equivalents between learner's L1 and L2. The system utilizes natural language processing (NLP) techniques to segment sentences in order to extract V-N collocations in given texts, and to derive a list of candidate English verbs that share the same Chinese translations via consulting electronic bilingual dictionaries. After combining nouns with these derived candidate verbs as V-N pairs, the system makes use of a reference corpus to exclude the inappropriate V-N pairs and single out the proper collocations. The system can effectively pinpoint the miscollocations and provide the learner with adequate collocations that the learner intends to write but misuses. It is hoped that this online assistant can facilitate EFL learner-writers' collocation use and help them transfer this essential knowledge to their future writing.

AB - Previous work in the literature reveals that EFL learners were deficient in collocations that are a hallmark of near native fluency in learner's writing. Among different types of collocations, the verb-noun (V-N) one was found to be particularly difficult to master, and learners' first language was also found to heavily influence their collocation production. In this paper, we develop an online collocation aid for EFL writers in Taiwan, aiming at detecting and correcting of learners' miscollocations attributable to L1 interference. Relevant correct collocation as feedback messages is suggested according to the translation equivalents between learner's L1 and L2. The system utilizes natural language processing (NLP) techniques to segment sentences in order to extract V-N collocations in given texts, and to derive a list of candidate English verbs that share the same Chinese translations via consulting electronic bilingual dictionaries. After combining nouns with these derived candidate verbs as V-N pairs, the system makes use of a reference corpus to exclude the inappropriate V-N pairs and single out the proper collocations. The system can effectively pinpoint the miscollocations and provide the learner with adequate collocations that the learner intends to write but misuses. It is hoped that this online assistant can facilitate EFL learner-writers' collocation use and help them transfer this essential knowledge to their future writing.

KW - Automatic error correction

KW - L1 influence

KW - Miscollocation analysis

UR - http://www.scopus.com/inward/record.url?scp=45849125415&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=45849125415&partnerID=8YFLogxK

U2 - 10.1080/09588220802090337

DO - 10.1080/09588220802090337

M3 - Article

AN - SCOPUS:45849125415

VL - 21

SP - 283

EP - 299

JO - Computer Assisted Language Learning

JF - Computer Assisted Language Learning

SN - 0958-8221

IS - 3

ER -