Abstract
Users on Twitter often post various tweets describing their moods. Analyzing these tweets can aid in understanding an individual’s psychological state, which will be beneficial to research aimed at promoting public mental health. This study intends to perform automatic classification on tweets related to mental health literacy. Techniques including traditional machine learning as well as AI technologies like BERT, SetFit, GPT-3, and GPT-4 are used to automatically classify them into 11 items across five dimensions, with each item having five related intensity scores. The goal is to achieve a machine prediction effectiveness of over 0.8 with limited human-annotated training data, to ensure the machine can effectively assist in mental health research. The results show that using SetFit, most items can achieve a Macro F1 score of about 0.8, with only two items scoring around 0.65. The contribution of this study lies in presenting and comparing the effectiveness of various natural language processing paradigms in text comprehension and analysis on these difficult tasks.
Translated title of the contribution | 心理健康素養推文自動分類之研究 |
---|---|
Original language | English |
Pages (from-to) | 5-27 |
Number of pages | 23 |
Journal | Journal of Educational Media and Library Sciences |
Volume | 61 |
Issue number | 1 |
DOIs | |
Publication status | Published - 2024 |
Keywords
- Automatic classification
- Deep learning
- Few-shot fine-tuning
- Machine learning
- 少樣本微調
- 機器學習
- 深度學習
- 自動分類
ASJC Scopus subject areas
- Conservation
- Information Systems
- Archaeology
- Library and Information Sciences