A margin-based discriminative modeling approach for extractive speech summarization

Shih Hung Liu, Kuan Yu Chen, Berlin Chen, Ea Ee Jan, Hsin Min Wang, Hsu Chun Yen, Wen Lian Hsu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

The task of extractive speech summarization is to select a set of salient sentences from an original spoken document and concatenate them to form a summary, facilitating users to better browse through and understand the content of the document. In this paper we present an empirical study of leveraging various supervised discriminative methods for effectively ranking important sentences of a spoken document to be summarized. In addition, we propose a novel margin-based discriminative training (MBDT) algorithm that aims to penalize non-summary sentences in an inverse proportion to their summarization evaluation scores, leading to better discrimination from the desired summary sentences. By doing so, the summarization model can be trained with an objective function that is closely coupled with the ultimate evaluation metric of extractive speech summarization. Furthermore, sentences of spoken documents are embodied by a wide range of prosodie, lexical and relevance features, whose utilities are extensively compared and analyzed. Experiments conducted on a Mandarin broadcast news summarization task demonstrate the performance merits of our summarization method when compared to several well-studied state-of-the-art supervised and unsupervised methods.

Original languageEnglish
Title of host publication2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9786163618238
DOIs
Publication statusPublished - 2014 Feb 12
Event2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014 - Chiang Mai, Thailand
Duration: 2014 Dec 92014 Dec 12

Publication series

Name2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014

Other

Other2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014
CountryThailand
CityChiang Mai
Period14/12/914/12/12

Fingerprint

Experiments

ASJC Scopus subject areas

  • Signal Processing
  • Information Systems

Cite this

Liu, S. H., Chen, K. Y., Chen, B., Jan, E. E., Wang, H. M., Yen, H. C., & Hsu, W. L. (2014). A margin-based discriminative modeling approach for extractive speech summarization. In 2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014 [7041591] (2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/APSIPA.2014.7041591

A margin-based discriminative modeling approach for extractive speech summarization. / Liu, Shih Hung; Chen, Kuan Yu; Chen, Berlin; Jan, Ea Ee; Wang, Hsin Min; Yen, Hsu Chun; Hsu, Wen Lian.

2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014. Institute of Electrical and Electronics Engineers Inc., 2014. 7041591 (2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Liu, SH, Chen, KY, Chen, B, Jan, EE, Wang, HM, Yen, HC & Hsu, WL 2014, A margin-based discriminative modeling approach for extractive speech summarization. in 2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014., 7041591, 2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014, Institute of Electrical and Electronics Engineers Inc., 2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014, Chiang Mai, Thailand, 14/12/9. https://doi.org/10.1109/APSIPA.2014.7041591
Liu SH, Chen KY, Chen B, Jan EE, Wang HM, Yen HC et al. A margin-based discriminative modeling approach for extractive speech summarization. In 2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014. Institute of Electrical and Electronics Engineers Inc. 2014. 7041591. (2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014). https://doi.org/10.1109/APSIPA.2014.7041591
Liu, Shih Hung ; Chen, Kuan Yu ; Chen, Berlin ; Jan, Ea Ee ; Wang, Hsin Min ; Yen, Hsu Chun ; Hsu, Wen Lian. / A margin-based discriminative modeling approach for extractive speech summarization. 2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014. Institute of Electrical and Electronics Engineers Inc., 2014. (2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014).
@inproceedings{1d98f7e3fbb74e9ea37445d64f5c6c73,
title = "A margin-based discriminative modeling approach for extractive speech summarization",
abstract = "The task of extractive speech summarization is to select a set of salient sentences from an original spoken document and concatenate them to form a summary, facilitating users to better browse through and understand the content of the document. In this paper we present an empirical study of leveraging various supervised discriminative methods for effectively ranking important sentences of a spoken document to be summarized. In addition, we propose a novel margin-based discriminative training (MBDT) algorithm that aims to penalize non-summary sentences in an inverse proportion to their summarization evaluation scores, leading to better discrimination from the desired summary sentences. By doing so, the summarization model can be trained with an objective function that is closely coupled with the ultimate evaluation metric of extractive speech summarization. Furthermore, sentences of spoken documents are embodied by a wide range of prosodie, lexical and relevance features, whose utilities are extensively compared and analyzed. Experiments conducted on a Mandarin broadcast news summarization task demonstrate the performance merits of our summarization method when compared to several well-studied state-of-the-art supervised and unsupervised methods.",
author = "Liu, {Shih Hung} and Chen, {Kuan Yu} and Berlin Chen and Jan, {Ea Ee} and Wang, {Hsin Min} and Yen, {Hsu Chun} and Hsu, {Wen Lian}",
year = "2014",
month = "2",
day = "12",
doi = "10.1109/APSIPA.2014.7041591",
language = "English",
series = "2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
booktitle = "2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014",

}

TY - GEN

T1 - A margin-based discriminative modeling approach for extractive speech summarization

AU - Liu, Shih Hung

AU - Chen, Kuan Yu

AU - Chen, Berlin

AU - Jan, Ea Ee

AU - Wang, Hsin Min

AU - Yen, Hsu Chun

AU - Hsu, Wen Lian

PY - 2014/2/12

Y1 - 2014/2/12

N2 - The task of extractive speech summarization is to select a set of salient sentences from an original spoken document and concatenate them to form a summary, facilitating users to better browse through and understand the content of the document. In this paper we present an empirical study of leveraging various supervised discriminative methods for effectively ranking important sentences of a spoken document to be summarized. In addition, we propose a novel margin-based discriminative training (MBDT) algorithm that aims to penalize non-summary sentences in an inverse proportion to their summarization evaluation scores, leading to better discrimination from the desired summary sentences. By doing so, the summarization model can be trained with an objective function that is closely coupled with the ultimate evaluation metric of extractive speech summarization. Furthermore, sentences of spoken documents are embodied by a wide range of prosodie, lexical and relevance features, whose utilities are extensively compared and analyzed. Experiments conducted on a Mandarin broadcast news summarization task demonstrate the performance merits of our summarization method when compared to several well-studied state-of-the-art supervised and unsupervised methods.

AB - The task of extractive speech summarization is to select a set of salient sentences from an original spoken document and concatenate them to form a summary, facilitating users to better browse through and understand the content of the document. In this paper we present an empirical study of leveraging various supervised discriminative methods for effectively ranking important sentences of a spoken document to be summarized. In addition, we propose a novel margin-based discriminative training (MBDT) algorithm that aims to penalize non-summary sentences in an inverse proportion to their summarization evaluation scores, leading to better discrimination from the desired summary sentences. By doing so, the summarization model can be trained with an objective function that is closely coupled with the ultimate evaluation metric of extractive speech summarization. Furthermore, sentences of spoken documents are embodied by a wide range of prosodie, lexical and relevance features, whose utilities are extensively compared and analyzed. Experiments conducted on a Mandarin broadcast news summarization task demonstrate the performance merits of our summarization method when compared to several well-studied state-of-the-art supervised and unsupervised methods.

UR - http://www.scopus.com/inward/record.url?scp=84949925578&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84949925578&partnerID=8YFLogxK

U2 - 10.1109/APSIPA.2014.7041591

DO - 10.1109/APSIPA.2014.7041591

M3 - Conference contribution

AN - SCOPUS:84949925578

T3 - 2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014

BT - 2014 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA 2014

PB - Institute of Electrical and Electronics Engineers Inc.

ER -