Error correction in a Chinese OCR test collection

Research output: Contribution to journalArticle

Abstract

This article proposes a technique for correcting Chinese OCR errors to support retrieval of scanned documents. The technique uses a completely automatic technique (no manually constructed lexicons or confusion resources) to identify both keywords and confusable terms. Improved retrieval effectiveness on a single term query experiment is demonstrated.

Original languageEnglish
Pages (from-to)429-430
Number of pages2
JournalSIGIR Forum (ACM Special Interest Group on Information Retrieval)
Publication statusPublished - 2002
Externally publishedYes

Keywords

  • Chinese
  • Confusing pair
  • Error correction
  • Term clustering

ASJC Scopus subject areas

  • Management Information Systems
  • Hardware and Architecture

Fingerprint Dive into the research topics of 'Error correction in a Chinese OCR test collection'. Together they form a unique fingerprint.

  • Cite this