Error correction in a Chinese OCR test collection

Yuen Hsien Tseng*

*Corresponding author for this work

Research output: Contribution to journalConference articlepeer-review

Abstract

This article proposes a technique for correcting Chinese OCR errors to support retrieval of scanned documents. The technique uses a completely automatic technique (no manually constructed lexicons or confusion resources) to identify both keywords and confusable terms. Improved retrieval effectiveness on a single term query experiment is demonstrated.

Original languageEnglish
Pages (from-to)429-430
Number of pages2
JournalSIGIR Forum (ACM Special Interest Group on Information Retrieval)
DOIs
Publication statusPublished - 2002
Externally publishedYes
EventProceedings of the Twenty-Fifth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval - Tampere, Finland
Duration: 2002 Aug 112002 Aug 15

Keywords

  • Chinese
  • Confusing pair
  • Error correction
  • Term clustering

ASJC Scopus subject areas

  • Management Information Systems
  • Hardware and Architecture

Fingerprint

Dive into the research topics of 'Error correction in a Chinese OCR test collection'. Together they form a unique fingerprint.

Cite this