A new method called Xanther Context Engine (XCE) has enabled the MiniMax M2.5 model to achieve a 78.2% score on the SWE-bench Verified benchmark, outperforming all other models. This achievement is notable because MiniMax M2.5 is a low-cost model, costing only $0.02 per call, and the performance gains are attributed to improved contextual understanding rather than a more powerful underlying model. The XCE provides AI coding agents with architectural context, significantly enhancing their ability to fix bugs in complex codebases. AI
Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →
IMPACT Enhances AI coding agent performance on complex tasks by providing architectural context, potentially lowering costs for software development.
RANK_REASON The cluster describes a new method and benchmark results for AI coding agents, not a release from a frontier lab. [lever_c_demoted from research: ic=1 ai=1.0]