跳至主導覽 跳至搜尋 跳過主要內容

Dual-Model Prediction of Affective Engagement and Vocal Attractiveness From Speaker Expressiveness in Video Learning

  • Hung Yue Suen
  • , Kuo En Hung
  • , Fan Hsun Tseng*
  • *此作品的通信作者

研究成果: 雜誌貢獻期刊論文同行評審

摘要

This article outlines a machine learning-enabled speaker-centric emotion AI approach capable of predicting audience-affective engagement and vocal attractiveness in asynchronous video-based learning, relying solely on speaker-side affective expressions. Inspired by the demand for scalable, privacy-preserving affective computing applications, this speaker-centric emotion AI approach incorporates two distinct regression models that leverage a massive corpus developed within massive open online courses (MOOCs) to enable affectively engaging experiences. The regression model predicting affective engagement is developed by assimilating emotional expressions emanating from facial dynamics, oculomotor features, prosody, and cognitive semantics, while incorporating a second regression model to predict vocal attractiveness based exclusively on speaker-side acoustic features. Notably, on speaker-independent test sets, both regression models yielded impressive predictive performance (R2 = 0.85 for affective engagement and R2 = 0.88 for vocal attractiveness), confirming that speaker-side affect can functionally represent aggregated audience feedback. This article provides a speaker-centric Emotion AI approach substantiated by an empirical study discovering that speaker-side multimodal features, including acoustics, can prospectively forecast audience feedback without necessarily employing audience-side input information.

原文英語
頁(從 - 到)4111-4119
頁數9
期刊IEEE Transactions on Computational Social Systems
13
發行號3
DOIs
出版狀態已發佈 - 2026 6月 1

ASJC Scopus subject areas

  • 建模與模擬
  • 社會科學(雜項)
  • 人機介面

指紋

深入研究「Dual-Model Prediction of Affective Engagement and Vocal Attractiveness From Speaker Expressiveness in Video Learning」主題。共同形成了獨特的指紋。

引用此