TY - JOUR
T1 - Words, constructions and corpora
T2 - Network representations of constructional semantics for Mandarin space particles
AU - Chen, Alvin Cheng Hsien
N1 - Publisher Copyright:
© 2020 Alvin Cheng-Hsien Chen, published by De Gruyter, Berlin/Boston.
PY - 2022/5/1
Y1 - 2022/5/1
N2 - In this study, we aim to demonstrate the effectiveness of network science in exploring the emergence of constructional semantics from the connectedness and relationships between linguistic units. With Mandarin locative constructions (MLCs) as a case study, we extracted constructional tokens from a representative corpus, including their respective space particles (SPs) and the head nouns of the landmarks (LMs), which constitute the nodes of the network. We computed edges based on the lexical similarities of word embeddings learned from large text corpora and the SP-LM contingency from collostructional analysis. We address three issues: (1) For each LM, how prototypical is it of the meaning of the SP? (2) For each SP, how semantically cohesive are its LM exemplars? (3) What are the emerging semantic fields from the constructional network of MLCs? We address these questions by examining the quantitative properties of the network at three levels: microscopic (i.e., node centrality and local clustering coefficient), mesoscopic (i.e., community) and macroscopic properties (i.e., small-worldness and scale-free). Our network analyses bring to the foreground the importance of repeated language experiences in the shaping and entrenchment of linguistic knowledge.
AB - In this study, we aim to demonstrate the effectiveness of network science in exploring the emergence of constructional semantics from the connectedness and relationships between linguistic units. With Mandarin locative constructions (MLCs) as a case study, we extracted constructional tokens from a representative corpus, including their respective space particles (SPs) and the head nouns of the landmarks (LMs), which constitute the nodes of the network. We computed edges based on the lexical similarities of word embeddings learned from large text corpora and the SP-LM contingency from collostructional analysis. We address three issues: (1) For each LM, how prototypical is it of the meaning of the SP? (2) For each SP, how semantically cohesive are its LM exemplars? (3) What are the emerging semantic fields from the constructional network of MLCs? We address these questions by examining the quantitative properties of the network at three levels: microscopic (i.e., node centrality and local clustering coefficient), mesoscopic (i.e., community) and macroscopic properties (i.e., small-worldness and scale-free). Our network analyses bring to the foreground the importance of repeated language experiences in the shaping and entrenchment of linguistic knowledge.
KW - collocation
KW - collostruction analysis
KW - construction grammar
KW - network analysis
KW - space particles
KW - usage-based grammar
UR - http://www.scopus.com/inward/record.url?scp=85093526595&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85093526595&partnerID=8YFLogxK
U2 - 10.1515/cllt-2020-0012
DO - 10.1515/cllt-2020-0012
M3 - Article
AN - SCOPUS:85093526595
SN - 1613-7027
VL - 18
SP - 209
EP - 235
JO - Corpus Linguistics and Linguistic Theory
JF - Corpus Linguistics and Linguistic Theory
IS - 2
ER -