MVBench
PulseAugur coverage of MVBench — every cluster mentioning MVBench across labs, papers, and developer communities, ranked by signal.
1 day(s) with sentiment data
-
ReTool-Video enhances video agents with recursive tool use
Researchers have introduced ReTool-Video, a novel approach for video understanding agents that enhances their reasoning capabilities. This method utilizes an expanded tool library with 134 specialized tools, including m…
-
VideoThinker framework improves lightweight MLLMs' video reasoning via causal debiasing
Researchers have developed VideoThinker, a novel framework designed to enhance the reasoning capabilities of lightweight multimodal language models (MLLMs) in video analysis. This approach addresses the issue of percept…
-
ReGATE method accelerates multimodal LLM training by selectively pruning tokens
Researchers have developed ReGATE, a novel method to accelerate the training of multimodal large language models (MLLMs) by adaptively pruning tokens. This technique uses a teacher-student framework where a frozen teach…
-
New PushupBench benchmark reveals VLMs struggle with counting repetitions
Researchers have introduced PushupBench, a new dataset designed to evaluate the ability of vision-language models (VLMs) to accurately count repetitions in videos. The benchmark highlights that even top-tier VLMs strugg…