ENTITY Claude Sonnet 4.5

Claude Sonnet 4.5

PulseAugur coverage of Claude Sonnet 4.5 — every cluster mentioning Claude Sonnet 4.5 across labs, papers, and developer communities, ranked by signal.

Total · 30d

25 over 90d

Releases · 30d

0 over 90d

Papers · 30d

13 over 90d

TIER MIX · 90D

frontier release 1
research 9
tool 11
commentary 4

RELATIONSHIPS

developed by Anthropic 100%

TIMELINE

2026-05-12 product_launch Claude Sonnet 4.5 is being retired from the claude.ai model selector. source

SENTIMENT · 30D

7 day(s) with sentiment data

RECENT · PAGE 2/2 · 36 TOTAL

RESEARCH · CL_13030 · May 2 · 13:14

Advanced AI Models GPT-4o, Claude 3.5 Show Systematic Thinking Errors

New analysis indicates that advanced AI models like GPT-4o and Claude 3.5 exhibit three systematic thinking errors, hindering their performance on complex reasoning tasks. These flaws highlight a fundamental gap in mach…
RESEARCH · CL_21441 · Apr 30 · 21:59

LLMs show internal emotion concepts; limit agent self-critique loops to two iterations

A recent paper from Anthropic explores how large language models, specifically Claude Sonnet 4.5, develop internal representations of emotion concepts. These representations allow the models to generalize and track oper…
FRONTIER RELEASE · CL_09264 · Apr 29 · 16:18

Mistral releases Mistral Medium 3.5, a powerful new AI model

Mistral AI has released its new Mistral Medium 3.5 model, which is being praised for its performance. Early indications suggest its capabilities are on par with Anthropic's Sonnet 4.5 model. This release highlights adva…
RESEARCH · CL_09141 · Apr 28 · 20:00

Anthropic's Claude AI integrates with Adobe, expands creative tool connectors

Anthropic has released connectors for Claude, enabling direct integration with tools like Adobe and Blender, and has also announced a partnership with Ableton, Canva, Autodesk, and others. Separately, Mistral has releas…
RESEARCH · CL_06644 · Apr 28 · 04:00

LLM theorem generation falls short on semantic correctness, new benchmark reveals

Researchers have developed a new framework called T to evaluate the semantic correctness of theorems generated by large language models in automated theorem proving. This approach, inspired by code generation testing, v…
RESEARCH · CL_06413 · Apr 28 · 04:00

AeSlides framework uses verifiable rewards to improve LLM slide generation aesthetics

Researchers have introduced AeSlides, a novel reinforcement learning framework designed to improve the aesthetic quality of slides generated by large language models. This system utilizes verifiable metrics to quantify …
RESEARCH · CL_08227 · Apr 28 · 01:21

Researchers probe VLM safety with embedding-guided typographic attacks

Researchers have developed a method to probe the safety vulnerabilities of vision-language models (VLMs) by using typographic prompt injections. Their study found that multimodal embedding distance strongly predicts att…
RESEARCH · CL_14197 · Apr 27 · 06:12

New research probes LLM reasoning and reveals novel jailbreaking vulnerabilities

Researchers have developed a new method to jailbreak large language models by exploiting their safe completion mechanisms through deceptive multi-turn conversations. This technique, termed intention deception, gradually…
FRONTIER RELEASE · CL_04146 · Apr 26 · 12:39

Meta plans $25B bond offering as US economy shows mixed signals

DeepSeek has released its V4 model, featuring a 1.6 trillion parameter version and a 1 million token context window, optimized for Huawei's Ascend AI chips. This move marks a significant shift away from Nvidia hardware,…
RESEARCH · CL_13606 · Apr 26 · 09:14

Bankers find AI-generated reports unusable, while software engineers embrace coding agents in 2026

A recent benchmark involving 500 investment bankers found that AI-generated client reports are unusable for professional engagement in the banking sector. Models such as GPT-5.4 and Claude Opus 4.6 produced reports that…
RESEARCH · CL_04994 · Apr 24 · 01:52

AI models show Western bias, homogenizing values across cultures

A new study auditing large language models found that three leading systems—Claude Sonnet 4.5, GPT-5.4, and Gemini 2.5 Flash—consistently provided individualistic advice, even when presented with dilemmas from users in …
TOOL · CL_17370 · Apr 23 · 21:21

Anthropic updates Claude models, Haiku 4.5 passes safety tests

Anthropic has updated its Claude Code product to allow users to select specific models, including Opus 4.7, Sonnet 4.6, and various 4.5 versions, through commands or environment variables. Separately, an evaluation of A…
RESEARCH · CL_02985 · Apr 23 · 03:48

New metrics quantify LLM agent behavioral similarity and convergence

A new paper introduces two metrics, Response Pattern Similarity (RPS) and Action Graph Similarity (AGS), to quantify how similar the tool-use behaviors of different AI agents are. These metrics aim to distinguish betwee…
TOOL · CL_17455 · Apr 15 · 06:24

Anthropic ends model version pinning, users report Sonnet 4.6 style issues

Anthropic is phasing out specific model version pinning for its Claude Sonnet models, forcing users to adopt the latest version, Sonnet 4.6. This change has led to user frustration as client applications may break with …
RESEARCH · CL_03728 · Apr 4 · 06:30

LLMs show emotional representations and susceptibility to false beliefs

A new paper from Anthropic's interpretability team reveals that their Claude Sonnet 4.5 model develops internal representations that emulate human emotions, influencing its behavior and decision-making. These "functiona…
TOOL · CL_17669 · Feb 23 · 20:16

Most AI models fail simple 'car wash' reasoning test, Opper finds

A new benchmark called the "Car Wash Test" reveals that many leading AI models struggle with basic reasoning. When asked whether to walk or drive 50 meters to a car wash, 42 out of 53 tested models incorrectly suggested…

Advanced AI Models GPT-4o, Claude 3.5 Show Systematic Thinking Errors

LLMs show internal emotion concepts; limit agent self-critique loops to two iterations

Mistral releases Mistral Medium 3.5, a powerful new AI model

Anthropic's Claude AI integrates with Adobe, expands creative tool connectors

LLM theorem generation falls short on semantic correctness, new benchmark reveals

AeSlides framework uses verifiable rewards to improve LLM slide generation aesthetics

Researchers probe VLM safety with embedding-guided typographic attacks

New research probes LLM reasoning and reveals novel jailbreaking vulnerabilities

Meta plans $25B bond offering as US economy shows mixed signals

Bankers find AI-generated reports unusable, while software engineers embrace coding agents in 2026

AI models show Western bias, homogenizing values across cultures

Anthropic updates Claude models, Haiku 4.5 passes safety tests

New metrics quantify LLM agent behavioral similarity and convergence

Anthropic ends model version pinning, users report Sonnet 4.6 style issues

LLMs show emotional representations and susceptibility to false beliefs

Most AI models fail simple 'car wash' reasoning test, Opper finds