A user on Mastodon shared thoughts on Opus 4.7, noting that while many perceive a performance decline compared to Opus 4.6, their analysis of offline and online evaluations suggests overall improvement. The user also raised questions about whether unquantifiable aspects like 'personality' might be contributing to the perceived differences. AI
Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →
IMPACT User-provided analysis suggests potential discrepancies between perceived and evaluated performance of AI models.
RANK_REASON User opinion on model performance differences.