The AI startup Poetiq has developed a self-optimizing harness that achieves new state-of-the-art performance on coding and ARC-AGI benchmarks. This harness, utilizing Google's Gemini 3 Flash model, has surpassed Anthropic's Claude Opus 4.7 in these evaluations. This recursive self-improvement technique represents a significant advancement in AI reasoning efficiency. AI
Summary written by gemini-2.5-flash-lite from 4 sources. How we write summaries →
IMPACT Sets new SOTA on coding and ARC-AGI benchmarks, showcasing advancements in AI reasoning efficiency.
RANK_REASON The cluster reports on a new benchmark achievement for an AI system, which is a research milestone.