特徵選取演算法對可讀性模型的影響

Tsai Ning Tai, Hou Chiang Tseng, Yao Ting Sung

研究成果: 書貢獻/報告類型會議論文篇章

摘要

Reading is one of the most important ways of acquiring knowledge. Researchers have pointed out that to promote the effectiveness of reading, it is very important to provide materials of the right level of difficulty. If the reading materials are too easy, readers usually cannot acquire new knowledge in the process of reading; on the other hand, if the materials are too difficult, it will cause excessive cognitive burden to the readers, affecting their learning effectiveness. Therefore, giving readers appropriate reading is an important issue. To address this issue, many scholars have begun to develop readability models and found that feature selection enhances the accuracy of readability models. However, the interaction between various feature algorithms and classifiers has yet to be much explored in past studies. Therefore, in this study, three feature selection algorithms, Chi-squared test, ANOVA, Mutual Information, and 25 classifiers, were applied to compare the accuracy of readability models for grades 1-12 in the textbooks of the Chinese language. The experimental results show the feature selection algorithm and the paired classifiers with the highest accuracy. This study found that using ANOVA as the feature selection algorithm and LGBM as the classifier can have 48% accuracy, 73% adjacent accuracy, and 85% reduction in the number of features.

貢獻的翻譯標題Impact of Feature Selection Algorithms on Readability Model
原文繁體中文
主出版物標題ROCLING 2023 - Proceedings of the 35th Conference on Computational Linguistics and Speech Processing
編輯Jheng-Long Wu, Ming-Hsiang Su, Hen-Hsen Huang, Yu Tsao, Hou-Chiang Tseng, Chia-Hui Chang, Lung-Hao Lee, Yuan-Fu Liao, Wei-Yun Ma
發行者The Association for Computational Linguistics and Chinese Language Processing (ACLCLP)
頁面106-115
頁數10
ISBN(電子)9789869576963
出版狀態已發佈 - 2023
事件35th Conference on Computational Linguistics and Speech Processing, ROCLING 2023 - Taipei City, 臺灣
持續時間: 2023 10月 202023 10月 21

出版系列

名字ROCLING 2023 - Proceedings of the 35th Conference on Computational Linguistics and Speech Processing

會議

會議35th Conference on Computational Linguistics and Speech Processing, ROCLING 2023
國家/地區臺灣
城市Taipei City
期間2023/10/202023/10/21

Keywords

  • Chinese Readability
  • Classifier
  • Feature Selection
  • Machine Learning

ASJC Scopus subject areas

  • 語言與語言學
  • 言語和聽力

指紋

深入研究「特徵選取演算法對可讀性模型的影響」主題。共同形成了獨特的指紋。

引用此