Extracting the Semantic Representation of Chinese-Japanese Homophones with Word2Vec for Teaching Chinese as a Second/Foreign Language

Min Chi Lo, Li Yun Chang, Hou Chiang Tseng*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The purpose of this study was to quantify the semantic distance between Chinese-Japanese homophones, and to further examine whether the traditional classification of Chinese-Japanese homophones reflects varying semantic distances. Ultimately, this study aimed to offer corresponding teaching suggestions for teachers’ reference. We introduced a novel approach to classifying Chinese-Japanese homophones by combining the Revised Hierarchical Model with the Word2Vec technology to extract semantic representations and calculate the semantic distance of meanings in Chinese-Japanese homophones across “Same”, “Overlap”, and “Different” types. Through cluster analysis of semantic distances, we found that semantic distance indeed reflects the traditional classification of Chinese-Japanese homophones. The “Same” type had the average semantic distance of 0, showing the high degree of similarity between the meanings in Chinese and Japanese within this category. The “Different” type had the greatest average semantic distance, indicating the lowest similarity. The “Overlap” type had the average semantic distance between the “Same” and “Different” types, representing the complex nature of this category. Based on these findings, this study proposed teaching suggestions to help Chinese teachers determine the teaching sequence among Chinese-Japanese homophones.

Original languageEnglish
Title of host publicationInnovative Technologies and Learning - 7th International Conference, ICITL 2024, Proceedings
EditorsYu-Ping Cheng, Margus Pedaste, Emanuele Bardone, Yueh-Min Huang
PublisherSpringer Science and Business Media Deutschland GmbH
Pages271-280
Number of pages10
ISBN (Print)9783031658808
DOIs
Publication statusPublished - 2024
Event7th International Conference of Innovative Technologies and Learning, ICITL 2024 - Tartu, Estonia
Duration: 2024 Aug 142024 Aug 16

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume14785 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference7th International Conference of Innovative Technologies and Learning, ICITL 2024
Country/TerritoryEstonia
CityTartu
Period2024/08/142024/08/16

Keywords

  • Chinese-Japanese homophones
  • semantic representation
  • teaching Chinese as a second/foreign language
  • Word2Vec

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Extracting the Semantic Representation of Chinese-Japanese Homophones with Word2Vec for Teaching Chinese as a Second/Foreign Language'. Together they form a unique fingerprint.

Cite this