TY - JOUR
T1 - A novel real-time speech summarizer system for the learning of sustainability
AU - Wang, Hsiu Wen
AU - Cheng, Ding Yuan
AU - Chen, Chi Hua
AU - Wu, Yu Rou
AU - Lo, Chi Chun
AU - Lin, Hui Fei
N1 - Publisher Copyright:
© 2015 by the authors.
PY - 2015
Y1 - 2015
N2 - As the number of speech and video documents increases on the Internet and portable devices proliferate, speech summarization becomes increasingly essential. Relevant research in this domain has typically focused on broadcasts and news; however, the automatic summarization methods used in the past may not apply to other speech domains (e.g., speech in lectures). Therefore, this study explores the lecture speech domain. The features used in previous research were analyzed and suitable features were selected following experimentation; subsequently, a three-phase real-time speech summarizer for the learning of sustainability (RTSSLS) was proposed. Phase One involved selecting independent features (e.g., centrality, resemblance to the title, sentence length, term frequency, and thematic words) and calculating the independent feature scores; Phase Two involved calculating the dependent features, such as the position compared with the independent feature scores; and Phase Three involved comparing these feature scores to obtain weighted averages of the function-scores, determine the highest-scoring sentence, and provide a summary. In practical results, the accuracies of macro-average and micro-average for the RTSSLS were 70% and 73%, respectively. Therefore, using a RTSSLS can enable users to acquire key speech information for the learning of sustainability.
AB - As the number of speech and video documents increases on the Internet and portable devices proliferate, speech summarization becomes increasingly essential. Relevant research in this domain has typically focused on broadcasts and news; however, the automatic summarization methods used in the past may not apply to other speech domains (e.g., speech in lectures). Therefore, this study explores the lecture speech domain. The features used in previous research were analyzed and suitable features were selected following experimentation; subsequently, a three-phase real-time speech summarizer for the learning of sustainability (RTSSLS) was proposed. Phase One involved selecting independent features (e.g., centrality, resemblance to the title, sentence length, term frequency, and thematic words) and calculating the independent feature scores; Phase Two involved calculating the dependent features, such as the position compared with the independent feature scores; and Phase Three involved comparing these feature scores to obtain weighted averages of the function-scores, determine the highest-scoring sentence, and provide a summary. In practical results, the accuracies of macro-average and micro-average for the RTSSLS were 70% and 73%, respectively. Therefore, using a RTSSLS can enable users to acquire key speech information for the learning of sustainability.
KW - Feature selection
KW - Information retrieval
KW - Speech summarization
KW - Text mining
UR - http://www.scopus.com/inward/record.url?scp=84928974820&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84928974820&partnerID=8YFLogxK
U2 - 10.3390/su7043885
DO - 10.3390/su7043885
M3 - Article
AN - SCOPUS:84928974820
SN - 2071-1050
VL - 7
SP - 3885
EP - 3899
JO - Sustainability (Switzerland)
JF - Sustainability (Switzerland)
IS - 4
ER -