A corpus-based approach to the discovery of cross-strait lexical contrasts

Jia Fei Hong*, Chu Ren Huang


研究成果: 雜誌貢獻期刊論文同行評審

8 引文 斯高帕斯(Scopus)


Studies of cross-strait lexical contrasts in the use of Mandarin Chinese reveal that a divergence has become increasingly evident. This divergence is apparent in phonological, semantic, and pragmatic analyses and has become an obstacle to knowledge-sharing and information exchange. Given the wide range of divergences, it seems that Chinese character forms offer the most reliable regular mapping between cross-strait usage contrasts. We propose a new approach to discovery of cross-strait contrasts in this paper anchored on the regular correspondences of characters. Our approach is corpus-based and collocation-driven. We use known contrast pairs as seeds in a corpus containing data from both the PRC and Taiwan. Collocation patterns in terms of both lexical lists and grammatical functions of these contrast pairs are studied to semi-automatically discover additional contrast pairs. This approach obtains both NLP applicability and linguistic felicity since it yields both the contrast pairs as well as their usage contexts.

頁(從 - 到)221-238
期刊Language and Linguistics
出版狀態已發佈 - 2008

ASJC Scopus subject areas

  • 語言與語言學
  • 語言和語言學


深入研究「A corpus-based approach to the discovery of cross-strait lexical contrasts」主題。共同形成了獨特的指紋。