New OCR pipeline enhances retail bill digitization with adaptive enhancement

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 2 sources

Researchers have developed and benchmarked an adaptive Optical Character Recognition (OCR) pipeline designed for digitizing retail bills across various commercial sectors. The system incorporates a CNN-based image enhancement module, an image quality analyzer, a feedback loop for iterative retries, and an NLP-based correction layer. Tested on a dataset of 360 retail bills, the pipeline achieved a Character Error Rate (CER) of 18.4% and a Word Error Rate (WER) of 27.6%, significantly outperforming the Raw Tesseract baseline and demonstrating a notable speed advantage over EasyOCR. AI

Summary written by gemini-2.5-flash-lite from 2 sources. How we write summaries →

IMPACT Establishes a new benchmark for OCR in retail, potentially improving data extraction efficiency for businesses.

RANK_REASON Academic paper detailing a new OCR pipeline and its benchmarked performance.

Read on arXiv cs.CV →

paper
other

COVERAGE [2]

arXiv cs.CV TIER_1 · Vijaysinh Gaikwad · 2026-04-29 04:00

Benchmarking OCR Pipelines with Adaptive Enhancement for Multi-Domain Retail Bill Digitization

arXiv:2604.25176v1 Announce Type: new Abstract: The digitization of multi-domain retail billing documents remains a challenging task due to variability in scan quality, layout heterogeneity, and domain diversity across commercial sectors. This paper proposes and benchmarks an int…
arXiv cs.CV TIER_1 · Vijaysinh Gaikwad · 2026-04-28 03:31

Benchmarking OCR Pipelines with Adaptive Enhancement for Multi-Domain Retail Bill Digitization

The digitization of multi-domain retail billing documents remains a challenging task due to variability in scan quality, layout heterogeneity, and domain diversity across commercial sectors. This paper proposes and benchmarks an intelligent, quality-aware adaptive Optical Charact…

COVERAGE [2]

Benchmarking OCR Pipelines with Adaptive Enhancement for Multi-Domain Retail Bill Digitization

Benchmarking OCR Pipelines with Adaptive Enhancement for Multi-Domain Retail Bill Digitization

RELATED ENTITIES

RELATED TOPICS