V*Bench
PulseAugur coverage of V*Bench — every cluster mentioning V*Bench across labs, papers, and developer communities, ranked by signal.
1 day(s) with sentiment data
-
HiDe framework boosts MLLM performance on high-res images
Researchers have developed a new training-free framework called HiDe to improve the performance of Multimodal Large Language Models (MLLMs) on high-resolution images. HiDe addresses background interference rather than o…
-
Perceptual Flow Network and VGR enhance visual reasoning in LLMs
Researchers have developed a Perceptual Flow Network (PFlowNet) to improve visual reasoning in Large-Vision Language Models (LVLMs). PFlowNet decouples perception from reasoning and uses variational reinforcement learni…
-
SIEVES method boosts multimodal LLM coverage on visual tasks with evidence scoring
Researchers have developed SIEVES, a novel method for improving the reliability of multimodal large language models (MLLMs) in out-of-distribution scenarios. SIEVES works by learning to estimate the quality of visual ev…