OmniDocBench
PulseAugur coverage of OmniDocBench — every cluster mentioning OmniDocBench across labs, papers, and developer communities, ranked by signal.
2 day(s) with sentiment data
-
New method enhances VLM document layout understanding
Researchers have developed a new method to improve how Vision-Language Models (VLMs) understand document layouts, particularly for documents with structures not seen during training. The approach pre-resolves layout inf…
-
New PureDocBench benchmark reveals document parsing is far from solved
Researchers have introduced PureDocBench, a new benchmark for document parsing that addresses issues with the existing OmniDocBench dataset, which suffers from annotation errors and potential contamination. PureDocBench…
-
RTPrune boosts DeepSeek-OCR inference speed by 1.23x with novel token pruning
Researchers have developed RTPrune, a novel two-stage token pruning method designed to enhance the efficiency of DeepSeek-OCR inference. This method mimics the model's two-stage reading process, first prioritizing high-…