Gemini 2.5 Pro
PulseAugur coverage of Gemini 2.5 Pro — every cluster mentioning Gemini 2.5 Pro across labs, papers, and developer communities, ranked by signal.
3 day(s) with sentiment data
-
LLMs struggle to detect culturally specific health misinformation on YouTube
Two new research papers explore the limitations of Large Language Models (LLMs) in detecting culturally specific health misinformation, particularly concerning the promotion of cow urine as a remedy on YouTube in India.…
-
Google Cloud touts integrated AI stack for enterprise agents
Google Cloud is positioning its integrated AI stack as a key differentiator for enterprise AI agents, according to Andi Gutmans. He argues that Google uniquely combines infrastructure, frontier models like Gemini 2.5, a…
-
TaNOS framework boosts numerical reasoning in tables, outperforming GPT-5
Researchers have developed TaNOS, a new framework designed to improve numerical reasoning in AI models when dealing with tabular data. This approach uses anonymized headers, operation sketches for structural cues, and s…
-
HiCrew: Hierarchical Reasoning for Long-Form Video Understanding via Question-Aware Multi-Agent Collaboration
Researchers have developed new frameworks to improve video understanding and reasoning capabilities in AI models. StoryTR introduces a benchmark and training method focused on 'Theory of Mind' to infer narrative causali…
-
LLMs struggle to play video games, despite coding prowess, experts say
Despite rapid advancements in areas like coding, large language models (LLMs) demonstrate significant limitations when it comes to playing video games. While some models have achieved success in specific games, their pe…
-
Google DeepMind details 2025 AI breakthroughs with Gemini 3 and new models
Google DeepMind and Google Research have detailed significant AI advancements throughout 2025, highlighted by the release of their Gemini 3 and Gemini 3 Flash models. These models demonstrate state-of-the-art performanc…
-
Google DeepMind enhances Gemini audio models for natural voice interactions and translation
Google DeepMind has released upgraded Gemini 2.5 audio models, enhancing capabilities for both live voice agents and text-to-speech generation. The Gemini 2.5 Flash Native Audio model now offers improved function callin…
-
LLMs fail 'pass the butter' robot test, scoring far below human performance
A new evaluation called Butter-Bench has revealed that current state-of-the-art large language models struggle significantly with controlling robots for practical tasks. In tests designed to assess their ability to perf…
-
Google DeepMind releases Gemini 2.5 Flash-Lite, its fastest and cheapest model
Google DeepMind has released the stable version of Gemini 2.5 Flash-Lite, a fast and cost-efficient model designed for scaled production use. This model offers a balance of performance and affordability, with features l…
-
Google DeepMind launches Deep Think for Gemini Ultra subscribers
Google DeepMind has released a new AI capability called Deep Think, now available to Google AI Ultra subscribers via the Gemini app. This feature utilizes parallel thinking techniques, allowing the model to explore mult…
-
Google DeepMind releases Gemini 2.5 Pro and Flash models, introduces Flash-Lite preview
Google DeepMind has made its Gemini 2.5 Pro and Flash models generally available, allowing developers to build production applications with confidence. The company also introduced Gemini 2.5 Flash-Lite in preview, touti…
-
DeepSeek releases R1-0528, an open-weights model rivaling Gemini 2.5 Pro
DeepSeek has released DeepSeek-R1-0528, an open-weights model that rivals Gemini 2.5 Pro in performance. This release marks a significant advancement in publicly available AI models, offering a powerful alternative for …
-
AI code review bots show limits in automated evaluation, GitHub COO discusses ambient AI
A new paper explores the limitations of automated evaluation for AI code review bots, finding that current automated methods like G-Eval and LLM-as-a-Judge show only moderate alignment with human developer labels. The s…
-
Gemini 3.1 Pro and the Downfall of Benchmarks: Welcome to the Vibe Era of AI
Google DeepMind has released Gemini 3.1 Pro, an upgraded version of its core intelligence model, enhancing reasoning capabilities for complex problem-solving. This new model demonstrates significant improvements on benc…
-
Google AI teaches models to read maps and monitor nature
Google AI has developed a new system called MapTrace to train multimodal large language models (MLLMs) to visually follow routes on maps, addressing a gap in their spatial reasoning abilities. This system uses a scalabl…