Improved spoken document retrieval by exploring extra acoustic and linguistic cues

Berlin Chen, Hsin Min Wang, Lin Shan Lee

Research output: Chapter in Book/Report/Conference proceedingConference contribution

13 Citations (Scopus)

Abstract

In this paper, we explored the use of various extra information to improve the performance of spoken document retrieval (SDR). From the speech recognition perspective, we incorporated the acoustic stress and word confusion information into the audio indexing. From the linguistic perspective, we applied the partof- speech information in both the audio indexing and the query representation. From the information retrieval perspective, we integrated techniques such as the query expansion by word associations and the blind relevance feedback into the retrieval process. The SDR experiments were based on the Topic Detection and Tracking Corpora (TDT-2 and TDT-3). We used the Chinese newswire text stories as query exemplars and the Mandarin Chinese audio news stories as the spoken documents. With all the above acoustic and linguistic cues applied, the average precision was improved from 0.5122 to 0.6312 for the TDT-2 collection and from 0.6216 to 0.7172 for the TDT-3 collection.

Original languageEnglish
Title of host publicationEUROSPEECH 2001 - SCANDINAVIA - 7th European Conference on Speech Communication and Technology
EditorsBorge Lindberg, Henrik Benner, Paul Dalsgaard, Zheng-Hua Tan
PublisherInternational Speech Communication Association
Pages299-302
Number of pages4
ISBN (Electronic)8790834100, 9788790834104
Publication statusPublished - 2001 Jan 1
Event7th European Conference on Speech Communication and Technology - Scandinavia, EUROSPEECH 2001 - Aalborg, Denmark
Duration: 2001 Sep 32001 Sep 7

Publication series

NameEUROSPEECH 2001 - SCANDINAVIA - 7th European Conference on Speech Communication and Technology

Other

Other7th European Conference on Speech Communication and Technology - Scandinavia, EUROSPEECH 2001
CountryDenmark
CityAalborg
Period01/9/301/9/7

Fingerprint

Linguistics
acoustics
Acoustics
indexing
linguistics
Information retrieval
Speech recognition
Feedback
information retrieval
news
Experiments
experiment
performance

ASJC Scopus subject areas

  • Communication
  • Linguistics and Language
  • Computer Science Applications
  • Software

Cite this

Chen, B., Wang, H. M., & Lee, L. S. (2001). Improved spoken document retrieval by exploring extra acoustic and linguistic cues. In B. Lindberg, H. Benner, P. Dalsgaard, & Z-H. Tan (Eds.), EUROSPEECH 2001 - SCANDINAVIA - 7th European Conference on Speech Communication and Technology (pp. 299-302). (EUROSPEECH 2001 - SCANDINAVIA - 7th European Conference on Speech Communication and Technology). International Speech Communication Association.

Improved spoken document retrieval by exploring extra acoustic and linguistic cues. / Chen, Berlin; Wang, Hsin Min; Lee, Lin Shan.

EUROSPEECH 2001 - SCANDINAVIA - 7th European Conference on Speech Communication and Technology. ed. / Borge Lindberg; Henrik Benner; Paul Dalsgaard; Zheng-Hua Tan. International Speech Communication Association, 2001. p. 299-302 (EUROSPEECH 2001 - SCANDINAVIA - 7th European Conference on Speech Communication and Technology).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Chen, B, Wang, HM & Lee, LS 2001, Improved spoken document retrieval by exploring extra acoustic and linguistic cues. in B Lindberg, H Benner, P Dalsgaard & Z-H Tan (eds), EUROSPEECH 2001 - SCANDINAVIA - 7th European Conference on Speech Communication and Technology. EUROSPEECH 2001 - SCANDINAVIA - 7th European Conference on Speech Communication and Technology, International Speech Communication Association, pp. 299-302, 7th European Conference on Speech Communication and Technology - Scandinavia, EUROSPEECH 2001, Aalborg, Denmark, 01/9/3.
Chen B, Wang HM, Lee LS. Improved spoken document retrieval by exploring extra acoustic and linguistic cues. In Lindberg B, Benner H, Dalsgaard P, Tan Z-H, editors, EUROSPEECH 2001 - SCANDINAVIA - 7th European Conference on Speech Communication and Technology. International Speech Communication Association. 2001. p. 299-302. (EUROSPEECH 2001 - SCANDINAVIA - 7th European Conference on Speech Communication and Technology).
Chen, Berlin ; Wang, Hsin Min ; Lee, Lin Shan. / Improved spoken document retrieval by exploring extra acoustic and linguistic cues. EUROSPEECH 2001 - SCANDINAVIA - 7th European Conference on Speech Communication and Technology. editor / Borge Lindberg ; Henrik Benner ; Paul Dalsgaard ; Zheng-Hua Tan. International Speech Communication Association, 2001. pp. 299-302 (EUROSPEECH 2001 - SCANDINAVIA - 7th European Conference on Speech Communication and Technology).
@inproceedings{b30f10fe48164771be54dfecfec7564a,
title = "Improved spoken document retrieval by exploring extra acoustic and linguistic cues",
abstract = "In this paper, we explored the use of various extra information to improve the performance of spoken document retrieval (SDR). From the speech recognition perspective, we incorporated the acoustic stress and word confusion information into the audio indexing. From the linguistic perspective, we applied the partof- speech information in both the audio indexing and the query representation. From the information retrieval perspective, we integrated techniques such as the query expansion by word associations and the blind relevance feedback into the retrieval process. The SDR experiments were based on the Topic Detection and Tracking Corpora (TDT-2 and TDT-3). We used the Chinese newswire text stories as query exemplars and the Mandarin Chinese audio news stories as the spoken documents. With all the above acoustic and linguistic cues applied, the average precision was improved from 0.5122 to 0.6312 for the TDT-2 collection and from 0.6216 to 0.7172 for the TDT-3 collection.",
author = "Berlin Chen and Wang, {Hsin Min} and Lee, {Lin Shan}",
year = "2001",
month = "1",
day = "1",
language = "English",
series = "EUROSPEECH 2001 - SCANDINAVIA - 7th European Conference on Speech Communication and Technology",
publisher = "International Speech Communication Association",
pages = "299--302",
editor = "Borge Lindberg and Henrik Benner and Paul Dalsgaard and Zheng-Hua Tan",
booktitle = "EUROSPEECH 2001 - SCANDINAVIA - 7th European Conference on Speech Communication and Technology",

}

TY - GEN

T1 - Improved spoken document retrieval by exploring extra acoustic and linguistic cues

AU - Chen, Berlin

AU - Wang, Hsin Min

AU - Lee, Lin Shan

PY - 2001/1/1

Y1 - 2001/1/1

N2 - In this paper, we explored the use of various extra information to improve the performance of spoken document retrieval (SDR). From the speech recognition perspective, we incorporated the acoustic stress and word confusion information into the audio indexing. From the linguistic perspective, we applied the partof- speech information in both the audio indexing and the query representation. From the information retrieval perspective, we integrated techniques such as the query expansion by word associations and the blind relevance feedback into the retrieval process. The SDR experiments were based on the Topic Detection and Tracking Corpora (TDT-2 and TDT-3). We used the Chinese newswire text stories as query exemplars and the Mandarin Chinese audio news stories as the spoken documents. With all the above acoustic and linguistic cues applied, the average precision was improved from 0.5122 to 0.6312 for the TDT-2 collection and from 0.6216 to 0.7172 for the TDT-3 collection.

AB - In this paper, we explored the use of various extra information to improve the performance of spoken document retrieval (SDR). From the speech recognition perspective, we incorporated the acoustic stress and word confusion information into the audio indexing. From the linguistic perspective, we applied the partof- speech information in both the audio indexing and the query representation. From the information retrieval perspective, we integrated techniques such as the query expansion by word associations and the blind relevance feedback into the retrieval process. The SDR experiments were based on the Topic Detection and Tracking Corpora (TDT-2 and TDT-3). We used the Chinese newswire text stories as query exemplars and the Mandarin Chinese audio news stories as the spoken documents. With all the above acoustic and linguistic cues applied, the average precision was improved from 0.5122 to 0.6312 for the TDT-2 collection and from 0.6216 to 0.7172 for the TDT-3 collection.

UR - http://www.scopus.com/inward/record.url?scp=85009062638&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85009062638&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:85009062638

T3 - EUROSPEECH 2001 - SCANDINAVIA - 7th European Conference on Speech Communication and Technology

SP - 299

EP - 302

BT - EUROSPEECH 2001 - SCANDINAVIA - 7th European Conference on Speech Communication and Technology

A2 - Lindberg, Borge

A2 - Benner, Henrik

A2 - Dalsgaard, Paul

A2 - Tan, Zheng-Hua

PB - International Speech Communication Association

ER -