A risk-aware modeling framework for speech summarization

Berlin Chen, Shih Hsiang Lin

Research output: Contribution to journalArticle

13 Citations (Scopus)

Abstract

Extractive speech summarization attempts to select a representative set of sentences from a spoken document so as to succinctly describe the main theme of the original document. In this paper, we adapt the notion of risk minimization for extractive speech summarization by formulating the selection of summary sentences as a decision-making problem. To this end, we develop several selection strategies and modeling paradigms that can leverage supervised and unsupervised summarization models to inherit their individual merits as well as to overcome their inherent limitations. On top of that, various component models are introduced, providing a principled way to render the redundancy and coherence relationships among sentences and between sentences and the whole document, respectively. A series of experiments on speech summarization seem to demonstrate that the methods deduced from our summarization framework are very competitive with existing summarization methods.

Original languageEnglish
Article number5876303
Pages (from-to)199-210
Number of pages12
JournalIEEE Transactions on Audio, Speech and Language Processing
Volume20
Issue number1
DOIs
Publication statusPublished - 2012 Jan 1

Fingerprint

sentences
Redundancy
decision making
redundancy
Decision making
optimization
Experiments

Keywords

  • Decision-making
  • language modeling
  • loss functions
  • risk minimization
  • speech summarization

ASJC Scopus subject areas

  • Acoustics and Ultrasonics
  • Electrical and Electronic Engineering

Cite this

A risk-aware modeling framework for speech summarization. / Chen, Berlin; Lin, Shih Hsiang.

In: IEEE Transactions on Audio, Speech and Language Processing, Vol. 20, No. 1, 5876303, 01.01.2012, p. 199-210.

Research output: Contribution to journalArticle

@article{8c4c1f6544664f9bb76c29e0531b0c33,
title = "A risk-aware modeling framework for speech summarization",
abstract = "Extractive speech summarization attempts to select a representative set of sentences from a spoken document so as to succinctly describe the main theme of the original document. In this paper, we adapt the notion of risk minimization for extractive speech summarization by formulating the selection of summary sentences as a decision-making problem. To this end, we develop several selection strategies and modeling paradigms that can leverage supervised and unsupervised summarization models to inherit their individual merits as well as to overcome their inherent limitations. On top of that, various component models are introduced, providing a principled way to render the redundancy and coherence relationships among sentences and between sentences and the whole document, respectively. A series of experiments on speech summarization seem to demonstrate that the methods deduced from our summarization framework are very competitive with existing summarization methods.",
keywords = "Decision-making, language modeling, loss functions, risk minimization, speech summarization",
author = "Berlin Chen and Lin, {Shih Hsiang}",
year = "2012",
month = "1",
day = "1",
doi = "10.1109/TASL.2011.2159596",
language = "English",
volume = "20",
pages = "199--210",
journal = "IEEE Transactions on Audio, Speech and Language Processing",
issn = "1558-7916",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
number = "1",

}

TY - JOUR

T1 - A risk-aware modeling framework for speech summarization

AU - Chen, Berlin

AU - Lin, Shih Hsiang

PY - 2012/1/1

Y1 - 2012/1/1

N2 - Extractive speech summarization attempts to select a representative set of sentences from a spoken document so as to succinctly describe the main theme of the original document. In this paper, we adapt the notion of risk minimization for extractive speech summarization by formulating the selection of summary sentences as a decision-making problem. To this end, we develop several selection strategies and modeling paradigms that can leverage supervised and unsupervised summarization models to inherit their individual merits as well as to overcome their inherent limitations. On top of that, various component models are introduced, providing a principled way to render the redundancy and coherence relationships among sentences and between sentences and the whole document, respectively. A series of experiments on speech summarization seem to demonstrate that the methods deduced from our summarization framework are very competitive with existing summarization methods.

AB - Extractive speech summarization attempts to select a representative set of sentences from a spoken document so as to succinctly describe the main theme of the original document. In this paper, we adapt the notion of risk minimization for extractive speech summarization by formulating the selection of summary sentences as a decision-making problem. To this end, we develop several selection strategies and modeling paradigms that can leverage supervised and unsupervised summarization models to inherit their individual merits as well as to overcome their inherent limitations. On top of that, various component models are introduced, providing a principled way to render the redundancy and coherence relationships among sentences and between sentences and the whole document, respectively. A series of experiments on speech summarization seem to demonstrate that the methods deduced from our summarization framework are very competitive with existing summarization methods.

KW - Decision-making

KW - language modeling

KW - loss functions

KW - risk minimization

KW - speech summarization

UR - http://www.scopus.com/inward/record.url?scp=81155132373&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=81155132373&partnerID=8YFLogxK

U2 - 10.1109/TASL.2011.2159596

DO - 10.1109/TASL.2011.2159596

M3 - Article

AN - SCOPUS:81155132373

VL - 20

SP - 199

EP - 210

JO - IEEE Transactions on Audio, Speech and Language Processing

JF - IEEE Transactions on Audio, Speech and Language Processing

SN - 1558-7916

IS - 1

M1 - 5876303

ER -