Artificial Analysis
PulseAugur coverage of Artificial Analysis — every cluster mentioning Artificial Analysis across labs, papers, and developer communities, ranked by signal.
2 day(s) with sentiment data
-
Luma Agents gain multimodal support; new coding agent benchmark released
Artificial Analysis has released the Coding Agent Index, a benchmark evaluating AI coding agents across three key benchmarks, considering token usage and cost to aid in selection. Separately, Luma Labs has enhanced its …
-
Alibaba's Happy Horse-1.0 video model aims for cinematic storytelling
Alibaba's Happy Horse-1.0 video generation model has entered a closed beta, aiming to advance beyond basic visual output to cinematic storytelling. Early tests show promise in maintaining character consistency across mu…
-
GPT-5.5 edges out Claude Opus on intelligence benchmark
A recent analysis by Artificial Analysis indicates that GPT-5.5 has surpassed Claude Opus by three points on their intelligence benchmark. This benchmark evaluates models across categories like agents, coding, general k…
-
Artificial Analysis offers MiniMax-M2.7 with SambaNovaAI leading inference speed
Artificial Analysis has made its MiniMax-M2.7 model available through six different inference providers, highlighting significant differences in speed and cost. SambaNovaAI leads in performance, achieving 435 tokens per…
-
AI advancements span robot manufacturing, dev tools, and cost-effective models
A discussion is emerging around the potential for integrated model handoff stacks to serve as new Integrated Development Environments (IDEs), particularly for multimodal workflows involving image, vision, and 3D models.…
-
X launches Grok 4.3 with improved agentic performance and lower price
xAI has released Grok-4.3, a new iteration of its AI model, which offers improved agentic performance and a lower price point compared to its predecessor. The model achieved a significant increase of 321 ELO points on t…
-
Alibaba's HappyHorse 1.0 leads AI video generation, with new GPT models and tools emerging
Alibaba has released HappyHorse 1.0, an open-source AI video generator capable of producing 1080p videos with synchronized audio. This model, powered by a 15 billion parameter transformer, is being touted as the top AI …
-
Google DeepMind launches autonomous research agents powered by Gemini 3.1 Pro
Google DeepMind has launched two new autonomous research agents, Deep Research and Deep Research Max, powered by Gemini 3.1 Pro. These agents are designed to securely analyze user-provided or third-party data, with Deep…