Abstract
This article proposes a technique for correcting Chinese OCR errors to support retrieval of scanned documents. The technique uses a completely automatic technique (no manually constructed lexicons or confusion resources) to identify both keywords and confusable terms. Improved retrieval effectiveness on a single term query experiment is demonstrated.
Original language | English |
---|---|
Pages (from-to) | 429-430 |
Number of pages | 2 |
Journal | SIGIR Forum (ACM Special Interest Group on Information Retrieval) |
DOIs | |
Publication status | Published - 2002 |
Externally published | Yes |
Event | Proceedings of the Twenty-Fifth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval - Tampere, Finland Duration: 2002 Aug 11 → 2002 Aug 15 |
Keywords
- Chinese
- Confusing pair
- Error correction
- Term clustering
ASJC Scopus subject areas
- Management Information Systems
- Hardware and Architecture