Improving the informativeness of verbose queries using summarization techniques for spoken document retrieval

Shih Hsiang Lin, Berlin Chen, Ea Ee Jan

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

Query-by-example information retrieval aims at helping users to find relevant documents accurately when users provide specific query exemplars describing what they are interested in. The query exemplars are usually long and in the form of either a partial or even a full document. However, they may contain extraneous terms (or off-topic information) that would have a negative impact on the retrieval performance. In this paper, we propose to integrate extractive summarization techniques into the retrieval process so as to improve the informativeness of a verbose query exemplar. The original query exemplar is first divided into several sub-queries or sentences. To construct a new concise query exemplar, summarization techniques are then employed to select a salient subset of subqueries. Experiments on the TDT Chinese collection show that the proposed approach is indeed effective and promising.

Original languageEnglish
Title of host publication2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Proceedings
Pages75-79
Number of pages5
DOIs
Publication statusPublished - 2010 Dec 1
Event2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Tainan, Taiwan
Duration: 2010 Nov 292010 Dec 3

Publication series

Name2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Proceedings

Other

Other2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010
CountryTaiwan
CityTainan
Period10/11/2910/12/3

Fingerprint

information retrieval
experiment
performance

Keywords

  • Information retrieval
  • Query exemplar
  • Query-by-example
  • Summarization technique
  • Verbose queries

ASJC Scopus subject areas

  • Linguistics and Language

Cite this

Lin, S. H., Chen, B., & Jan, E. E. (2010). Improving the informativeness of verbose queries using summarization techniques for spoken document retrieval. In 2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Proceedings (pp. 75-79). [5684847] (2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Proceedings). https://doi.org/10.1109/ISCSLP.2010.5684847

Improving the informativeness of verbose queries using summarization techniques for spoken document retrieval. / Lin, Shih Hsiang; Chen, Berlin; Jan, Ea Ee.

2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Proceedings. 2010. p. 75-79 5684847 (2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Proceedings).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Lin, SH, Chen, B & Jan, EE 2010, Improving the informativeness of verbose queries using summarization techniques for spoken document retrieval. in 2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Proceedings., 5684847, 2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Proceedings, pp. 75-79, 2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010, Tainan, Taiwan, 10/11/29. https://doi.org/10.1109/ISCSLP.2010.5684847
Lin SH, Chen B, Jan EE. Improving the informativeness of verbose queries using summarization techniques for spoken document retrieval. In 2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Proceedings. 2010. p. 75-79. 5684847. (2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Proceedings). https://doi.org/10.1109/ISCSLP.2010.5684847
Lin, Shih Hsiang ; Chen, Berlin ; Jan, Ea Ee. / Improving the informativeness of verbose queries using summarization techniques for spoken document retrieval. 2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Proceedings. 2010. pp. 75-79 (2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Proceedings).
@inproceedings{c368f13064d24784b3297a1186724142,
title = "Improving the informativeness of verbose queries using summarization techniques for spoken document retrieval",
abstract = "Query-by-example information retrieval aims at helping users to find relevant documents accurately when users provide specific query exemplars describing what they are interested in. The query exemplars are usually long and in the form of either a partial or even a full document. However, they may contain extraneous terms (or off-topic information) that would have a negative impact on the retrieval performance. In this paper, we propose to integrate extractive summarization techniques into the retrieval process so as to improve the informativeness of a verbose query exemplar. The original query exemplar is first divided into several sub-queries or sentences. To construct a new concise query exemplar, summarization techniques are then employed to select a salient subset of subqueries. Experiments on the TDT Chinese collection show that the proposed approach is indeed effective and promising.",
keywords = "Information retrieval, Query exemplar, Query-by-example, Summarization technique, Verbose queries",
author = "Lin, {Shih Hsiang} and Berlin Chen and Jan, {Ea Ee}",
year = "2010",
month = "12",
day = "1",
doi = "10.1109/ISCSLP.2010.5684847",
language = "English",
isbn = "9781424462469",
series = "2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Proceedings",
pages = "75--79",
booktitle = "2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Proceedings",

}

TY - GEN

T1 - Improving the informativeness of verbose queries using summarization techniques for spoken document retrieval

AU - Lin, Shih Hsiang

AU - Chen, Berlin

AU - Jan, Ea Ee

PY - 2010/12/1

Y1 - 2010/12/1

N2 - Query-by-example information retrieval aims at helping users to find relevant documents accurately when users provide specific query exemplars describing what they are interested in. The query exemplars are usually long and in the form of either a partial or even a full document. However, they may contain extraneous terms (or off-topic information) that would have a negative impact on the retrieval performance. In this paper, we propose to integrate extractive summarization techniques into the retrieval process so as to improve the informativeness of a verbose query exemplar. The original query exemplar is first divided into several sub-queries or sentences. To construct a new concise query exemplar, summarization techniques are then employed to select a salient subset of subqueries. Experiments on the TDT Chinese collection show that the proposed approach is indeed effective and promising.

AB - Query-by-example information retrieval aims at helping users to find relevant documents accurately when users provide specific query exemplars describing what they are interested in. The query exemplars are usually long and in the form of either a partial or even a full document. However, they may contain extraneous terms (or off-topic information) that would have a negative impact on the retrieval performance. In this paper, we propose to integrate extractive summarization techniques into the retrieval process so as to improve the informativeness of a verbose query exemplar. The original query exemplar is first divided into several sub-queries or sentences. To construct a new concise query exemplar, summarization techniques are then employed to select a salient subset of subqueries. Experiments on the TDT Chinese collection show that the proposed approach is indeed effective and promising.

KW - Information retrieval

KW - Query exemplar

KW - Query-by-example

KW - Summarization technique

KW - Verbose queries

UR - http://www.scopus.com/inward/record.url?scp=79851472266&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79851472266&partnerID=8YFLogxK

U2 - 10.1109/ISCSLP.2010.5684847

DO - 10.1109/ISCSLP.2010.5684847

M3 - Conference contribution

AN - SCOPUS:79851472266

SN - 9781424462469

T3 - 2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Proceedings

SP - 75

EP - 79

BT - 2010 7th International Symposium on Chinese Spoken Language Processing, ISCSLP 2010 - Proceedings

ER -