TY - JOUR
T1 - Leveraging kullbackLeibler divergence measures and information-rich cues for speech summarization
AU - Lin, Shih Hsiang
AU - Yeh, Yao Ming
AU - Chen, Berlin
N1 - Funding Information:
Manuscript received January 26, 2010; revised May 12, 2010; accepted July 21, 2010. Date of publication August 16, 2010; date of current version March 30, 2011. This work was supported in part by the National Science Council, Taiwan, under Grants NSC98-2221-E-003-011-MY3, NSC 99-2515-S-003-004, and NSC98-2631-S-003-002 and by National Taiwan Normal University under Grant 99T3060-1. The associate editor coordinating the review of this manuscript and approving it for publication was Dr. Gokhan Tur.
PY - 2011
Y1 - 2011
N2 - Imperfect speech recognition often leads to degraded performance when exploiting conventional text-based methods for speech summarization. To alleviate this problem, this paper investigates various ways to robustly represent the recognition hypotheses of spoken documents beyond the top scoring ones. Moreover, a summarization framework, building on the KullbackLeibler (KL) divergence measure and exploring both the relevance and topical information cues of spoken documents and sentences, is presented to work with such robust representations. Experiments on broadcast news speech summarization tasks appear to demonstrate the utility of the presented approaches.
AB - Imperfect speech recognition often leads to degraded performance when exploiting conventional text-based methods for speech summarization. To alleviate this problem, this paper investigates various ways to robustly represent the recognition hypotheses of spoken documents beyond the top scoring ones. Moreover, a summarization framework, building on the KullbackLeibler (KL) divergence measure and exploring both the relevance and topical information cues of spoken documents and sentences, is presented to work with such robust representations. Experiments on broadcast news speech summarization tasks appear to demonstrate the utility of the presented approaches.
KW - KullbackLeibler (KL) -divergence
KW - multiple recognition hypotheses
KW - relevance information
KW - speech summarization
KW - topical information
UR - http://www.scopus.com/inward/record.url?scp=79953287641&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79953287641&partnerID=8YFLogxK
U2 - 10.1109/TASL.2010.2066268
DO - 10.1109/TASL.2010.2066268
M3 - Article
AN - SCOPUS:79953287641
SN - 1558-7916
VL - 19
SP - 871
EP - 882
JO - IEEE Transactions on Audio, Speech and Language Processing
JF - IEEE Transactions on Audio, Speech and Language Processing
IS - 4
M1 - 5549862
ER -