摘要
Studies of cross-strait lexical contrasts in the use of Mandarin Chinese reveal that a divergence has become increasingly evident. This divergence is apparent in phonological, semantic, and pragmatic analyses and has become an obstacle to knowledge-sharing and information exchange. Given the wide range of divergences, it seems that Chinese character forms offer the most reliable regular mapping between cross-strait usage contrasts. We propose a new approach to discovery of cross-strait contrasts in this paper anchored on the regular correspondences of characters. Our approach is corpus-based and collocation-driven. We use known contrast pairs as seeds in a corpus containing data from both the PRC and Taiwan. Collocation patterns in terms of both lexical lists and grammatical functions of these contrast pairs are studied to semi-automatically discover additional contrast pairs. This approach obtains both NLP applicability and linguistic felicity since it yields both the contrast pairs as well as their usage contexts.
原文 | 英語 |
---|---|
頁(從 - 到) | 221-238 |
頁數 | 18 |
期刊 | Language and Linguistics |
卷 | 9 |
發行號 | 2 |
出版狀態 | 已發佈 - 2008 |
對外發佈 | 是 |
ASJC Scopus subject areas
- 語言與語言學
- 語言和語言學