GPT-5.4

ENTITY GPT-5.4

GPT-5.4

PulseAugur coverage of GPT-5.4 — every cluster mentioning GPT-5.4 across labs, papers, and developer communities, ranked by signal.

Total · 30d

91

91 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

43

43 over 90d

TIER MIX · 90D

frontier release 9
significant 16
research 21
tool 43
commentary 2

RELATIONSHIPS

SENTIMENT · 30D

8 day(s) with sentiment data

RECENT · PAGE 2/4 · 78 TOTAL

TOOL · CL_20391 · May 7 · 04:00

AsymmetryZero framework operationalizes human preferences for AI evaluation

Researchers have introduced AsymmetryZero, a framework designed to translate human expert preferences into measurable semantic evaluations for AI models. This system aims to address the difficulty of encoding subjective…
TOOL · CL_20502 · May 7 · 04:00

Adversarial examples trick VLMs into laundering AI authority, spreading misinformation

Researchers have demonstrated a new vulnerability in vision-language models (VLMs) called "AI authority laundering." This attack involves subtly altering images so that VLMs confidently provide authoritative responses a…
SIGNIFICANT · CL_19920 · May 6 · 19:39

Z.AI's GLM 5.1 model leads in long-horizon agentic tasks, outperforming rivals

Z.AI has released its GLM 5.1 model, an open-source option designed for long-horizon agentic tasks capable of running autonomously for up to 8 hours. This model reportedly outperforms GPT-5.4, Claude Opus 4.6, and Gemin…
RESEARCH · CL_20622 · May 6 · 17:42

New MRI-Eval benchmark reveals LLMs struggle with GE scanner operations

Researchers have developed MRI-Eval, a new benchmark designed to assess large language models' understanding of MRI physics and GE scanner operations. The benchmark, comprising 1365 questions across three difficulty tie…
COMMENTARY · CL_19176 · May 6 · 10:16

Multi-LLM routing breaks prompts and latency, developers face new production challenges

In May 2026, the LLM landscape is characterized by the widespread adoption of multiple providers, with developers routing requests across five different models to leverage their unique strengths. This multi-model approa…
TOOL · CL_18499 · May 6 · 04:59

Polite AI interactions boost model performance, new study finds

New research from UC Berkeley, UC Davis, Vanderbilt University, and MIT suggests that AI models exhibit a measurable "functional well-being" that can be influenced by user interaction. Treating AI models with politeness…
TOOL · CL_16529 · May 5 · 10:10

Azure OpenAI users face quota tiers limiting access to newer models like GPT-5.5

A user exploring Azure OpenAI's Microsoft Foundry discovered that access to newer models like GPT-5.5 is restricted by "quota tiers." These tiers, ranging from 1 to 6, dictate the available requests per minute (RPM) and…
TOOL · CL_17495 · May 5 · 05:31

DeepClaude offers cheaper AI coding agent alternative to Anthropic and OpenAI

A new tool called DeepClaude allows developers to use the DeepSeek V4 Pro model with the Claude Code interface, offering a significantly cheaper alternative to using Anthropic's API directly. This setup, which requires …
TOOL · CL_15946 · May 5 · 04:00

New dataset and benchmark advance Bangla text-to-gloss translation for BdSL

Researchers have developed the first dataset and benchmark for Bangla text-to-gloss translation, addressing a significant gap for the Bangla Sign Language (BdSL) community. The dataset includes manually annotated and sy…
TOOL · CL_13262 · May 2 · 19:49

Fabrica launches as a terminal-based coding agent supporting multiple AI models

Fabrica is a new terminal-based coding agent harness developed in Rust. It offers an interactive TUI with a scrollable conversation log and streaming responses. The tool supports multiple AI providers, including Google …
SIGNIFICANT · CL_12673 · May 2 · 00:54

AI coding tools end subsidies, shift to pay-as-you-go pricing amid rising costs

The era of heavily subsidized AI coding tools is ending as companies like Microsoft and Anthropic shift from flat-rate subscriptions to pay-as-you-go pricing. This change reflects the immense scale of AI investment, wit…
RESEARCH · CL_12039 · May 1 · 09:34

Google DeepMind's AI Co-Clinician beats GPT-5.4 in medical tests, aids doctors

Google DeepMind has developed an AI co-clinician designed to assist physicians with diagnostics and patient care, aiming to reduce errors and improve efficiency. In blind evaluations, this AI demonstrated superior perfo…
TOOL · CL_22894 · May 1 · 07:02

Anthropic launches Claude AI assistant for Microsoft 365 apps

Anthropic has officially launched Claude for Microsoft 365 applications, allowing users to directly utilize Claude within Excel, PowerPoint, and Word. This integration aims to enhance productivity by enabling users to l…
RESEARCH · CL_11687 · May 1 · 04:00

AI agent swarms may fail due to 'Inverse-Wisdom Law,' study finds

A new paper introduces the Inverse-Wisdom Law, challenging the assumption that AI agent swarms benefit from the "Wisdom of the Crowd." The research demonstrates that these swarms can prioritize internal architectural ag…
RESEARCH · CL_11817 · May 1 · 04:00

GPT-5.4 leads LLMs in new EU digital battery passport conformance task

Researchers have introduced BatteryPass-12K, the first dataset designed for classifying digital battery passport conformance, in anticipation of the EU's upcoming battery regulation. They evaluated 22 language models, f…
TOOL · CL_14252 · Apr 30 · 19:27

OpenAI restricts Cyber tool access, mirroring Anthropic's Mythos strategy

OpenAI is restricting access to its new cybersecurity tool, Cyber, which is built on GPT-5.5. This move follows criticism from OpenAI CEO Sam Altman towards Anthropic for similar restrictions on their tool, Mythos. Cybe…
RESEARCH · CL_11488 · Apr 30 · 15:01

New VeriGround model achieves reliable circuit-to-Verilog code generation

Researchers have identified a significant reliability issue in multimodal large language models (MLLMs) when generating hardware description language (HDL) code from circuit diagrams. This "Mirage" phenomenon occurs whe…
SIGNIFICANT · CL_09620 · Apr 29 · 23:47

Amazon's custom AI chips hit $20B run rate, securing major client commitments

Amazon's custom silicon business has achieved an annual revenue run rate of $20 billion, positioning it among the top three datacenter chip providers globally. CEO Andy Jassy indicated this figure could reach $50 billio…
TOOL · CL_17256 · Apr 29 · 18:22

AWS and OpenAI launch managed agents for simplified enterprise AI deployment

AWS has launched a new managed agent service, powered by OpenAI, designed to simplify the deployment of production-ready agents for enterprises. This service integrates OpenAI's GPT-5.5 and GPT-5.4 models, along with it…
TOOL · CL_09121 · Apr 29 · 13:47

Lingo.dev launches v1.0 with AI-powered localization engine

Lingo.dev has launched version 1.0 of its localization platform, introducing retrieval augmented localization (RAL). This approach injects glossary context and brand voice rules into LLM requests to improve translation …