Retrieval of Mandarin broadcast news using spoken queries

Berlin Chen, Hsin Min Wang, Lin Shan Lee

Research output: Chapter in Book/Report/Conference proceedingConference contribution

6 Citations (Scopus)

Abstract

Considering the monosyllabic structure of the Chinese language, a whole class of indexing features for retrieval of Mandarin broadcast news using syllable-level statistical characteristics has been previously investigated. This paper presents the improvements achieved over the previous results. The major differences are: (1) Multi-scale character- and word-level indexing terms have been integrated with the syllable-level information. (2) Information cues from the contemporary newswire text corpus have been used to create more accurate syllable indexing terms. (3) Automatic document expansion, blind relevance feedback, and query expansion via the term association matrix have been applied in retrieval. With all these schemes, the average precision can be improved from 55.46% to 71.29%.

Original languageEnglish
Title of host publication6th International Conference on Spoken Language Processing, ICSLP 2000
PublisherInternational Speech Communication Association
ISBN (Electronic)7801501144, 9787801501141
Publication statusPublished - 2000 Jan 1
Event6th International Conference on Spoken Language Processing, ICSLP 2000 - Beijing, China
Duration: 2000 Oct 162000 Oct 20

Publication series

Name6th International Conference on Spoken Language Processing, ICSLP 2000

Other

Other6th International Conference on Spoken Language Processing, ICSLP 2000
CountryChina
CityBeijing
Period00/10/1600/10/20

Fingerprint

indexing
broadcast
news
language
Indexing
News Broadcasts

ASJC Scopus subject areas

  • Linguistics and Language
  • Language and Linguistics

Cite this

Chen, B., Wang, H. M., & Lee, L. S. (2000). Retrieval of Mandarin broadcast news using spoken queries. In 6th International Conference on Spoken Language Processing, ICSLP 2000 (6th International Conference on Spoken Language Processing, ICSLP 2000). International Speech Communication Association.

Retrieval of Mandarin broadcast news using spoken queries. / Chen, Berlin; Wang, Hsin Min; Lee, Lin Shan.

6th International Conference on Spoken Language Processing, ICSLP 2000. International Speech Communication Association, 2000. (6th International Conference on Spoken Language Processing, ICSLP 2000).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Chen, B, Wang, HM & Lee, LS 2000, Retrieval of Mandarin broadcast news using spoken queries. in 6th International Conference on Spoken Language Processing, ICSLP 2000. 6th International Conference on Spoken Language Processing, ICSLP 2000, International Speech Communication Association, 6th International Conference on Spoken Language Processing, ICSLP 2000, Beijing, China, 00/10/16.
Chen B, Wang HM, Lee LS. Retrieval of Mandarin broadcast news using spoken queries. In 6th International Conference on Spoken Language Processing, ICSLP 2000. International Speech Communication Association. 2000. (6th International Conference on Spoken Language Processing, ICSLP 2000).
Chen, Berlin ; Wang, Hsin Min ; Lee, Lin Shan. / Retrieval of Mandarin broadcast news using spoken queries. 6th International Conference on Spoken Language Processing, ICSLP 2000. International Speech Communication Association, 2000. (6th International Conference on Spoken Language Processing, ICSLP 2000).
@inproceedings{2f82fb24524849b9a3dae4d27f3e4a93,
title = "Retrieval of Mandarin broadcast news using spoken queries",
abstract = "Considering the monosyllabic structure of the Chinese language, a whole class of indexing features for retrieval of Mandarin broadcast news using syllable-level statistical characteristics has been previously investigated. This paper presents the improvements achieved over the previous results. The major differences are: (1) Multi-scale character- and word-level indexing terms have been integrated with the syllable-level information. (2) Information cues from the contemporary newswire text corpus have been used to create more accurate syllable indexing terms. (3) Automatic document expansion, blind relevance feedback, and query expansion via the term association matrix have been applied in retrieval. With all these schemes, the average precision can be improved from 55.46{\%} to 71.29{\%}.",
author = "Berlin Chen and Wang, {Hsin Min} and Lee, {Lin Shan}",
year = "2000",
month = "1",
day = "1",
language = "English",
series = "6th International Conference on Spoken Language Processing, ICSLP 2000",
publisher = "International Speech Communication Association",
booktitle = "6th International Conference on Spoken Language Processing, ICSLP 2000",

}

TY - GEN

T1 - Retrieval of Mandarin broadcast news using spoken queries

AU - Chen, Berlin

AU - Wang, Hsin Min

AU - Lee, Lin Shan

PY - 2000/1/1

Y1 - 2000/1/1

N2 - Considering the monosyllabic structure of the Chinese language, a whole class of indexing features for retrieval of Mandarin broadcast news using syllable-level statistical characteristics has been previously investigated. This paper presents the improvements achieved over the previous results. The major differences are: (1) Multi-scale character- and word-level indexing terms have been integrated with the syllable-level information. (2) Information cues from the contemporary newswire text corpus have been used to create more accurate syllable indexing terms. (3) Automatic document expansion, blind relevance feedback, and query expansion via the term association matrix have been applied in retrieval. With all these schemes, the average precision can be improved from 55.46% to 71.29%.

AB - Considering the monosyllabic structure of the Chinese language, a whole class of indexing features for retrieval of Mandarin broadcast news using syllable-level statistical characteristics has been previously investigated. This paper presents the improvements achieved over the previous results. The major differences are: (1) Multi-scale character- and word-level indexing terms have been integrated with the syllable-level information. (2) Information cues from the contemporary newswire text corpus have been used to create more accurate syllable indexing terms. (3) Automatic document expansion, blind relevance feedback, and query expansion via the term association matrix have been applied in retrieval. With all these schemes, the average precision can be improved from 55.46% to 71.29%.

UR - http://www.scopus.com/inward/record.url?scp=85008060161&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85008060161&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:85008060161

T3 - 6th International Conference on Spoken Language Processing, ICSLP 2000

BT - 6th International Conference on Spoken Language Processing, ICSLP 2000

PB - International Speech Communication Association

ER -