GPT-5.4

ENTITY GPT-5.4

GPT-5.4

PulseAugur coverage of GPT-5.4 — every cluster mentioning GPT-5.4 across labs, papers, and developer communities, ranked by signal.

Total · 30d

91

91 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

43

43 over 90d

TIER MIX · 90D

frontier release 9
significant 16
research 21
tool 43
commentary 2

RELATIONSHIPS

SENTIMENT · 30D

8 day(s) with sentiment data

RECENT · PAGE 4/4 · 78 TOTAL

RESEARCH · CL_02960 · Apr 23 · 12:36

Process Supervision via Verbal Critique Improves Reasoning in Large Language Models

Researchers have developed a new framework called Verbal Process Supervision (VPS) that enhances the reasoning capabilities of large language models without requiring gradient updates. This method utilizes structured na…
RESEARCH · CL_02975 · Apr 23 · 07:02

AI models evaluated on meeting summaries, GPT-5.1 shows gains

Researchers have developed a reusable pipeline for evaluating AI-generated meeting summaries, designed to be adaptable across different domains. The system treats both ground truth and AI outputs as structured artifacts…
RESEARCH · CL_02999 · Apr 22 · 22:56

AI system enhances science classroom discourse analysis using multi-task learning

Researchers have developed an automated discourse analysis system (ADAS) to classify teacher and student utterances in science classrooms, aiming to understand knowledge construction and improve teaching. The system use…
FRONTIER RELEASE · CL_03443 · Apr 21 · 00:00

Moonshot AI's Kimi K2.6 tops benchmarks, Bezos eyes $10B AI fundraise

Moonshot AI has released Kimi K2.6, a model claiming superior performance on coding and agentic benchmarks, surpassing models like GPT-5.4 and Claude Opus 4.6. Alibaba's Qwen3.6-Max-Preview also shows improved instructi…
SIGNIFICANT · CL_02679 · Apr 18 · 06:50

[AINews] The Two Sides of OpenClaw

Anthropic has launched Claude Design, a new experimental tool that generates visual prototypes, slides, and one-pagers from natural language prompts. This release, powered by Claude Opus 4.7, positions Anthropic as a co…
SIGNIFICANT · CL_00035 · Apr 17 · 18:07

Anthropic's Claude Opus 4.7 faces user backlash over performance decline and security concerns

Anthropic has released Claude Opus 4.7, which shows mixed performance compared to competitors like Gemini and GPT 5.4. Users report significant frustration, claiming the model has been "nerfed" and exhibits decreased re…
RESEARCH · CL_17282 · Apr 17 · 15:47

OpenAI releases GPT-5.4-Cyber for cybersecurity, contrasting with Anthropic's limited Claude Mythos

OpenAI has released GPT-5.4-Cyber, a specialized version of its GPT-5.4 model, aimed at enhancing cybersecurity defenses. This model, available through OpenAI's Trusted Access for Cyber program, offers capabilities like…
RESEARCH · CL_17452 · Apr 17 · 14:09

Public AI models replicate Anthropic's vulnerability research findings

Vidoc Security has replicated findings from Anthropic's Mythos project using publicly available models like GPT-5.4 and Claude Opus 4.6. Their research indicates that advanced AI capabilities for identifying software vu…
SIGNIFICANT · CL_02143 · Apr 13 · 06:00

OpenAI powers enterprise AI adoption with Cloudflare and Hyatt integrations

OpenAI has partnered with Hyatt to integrate ChatGPT Enterprise across the hospitality company's global operations. This collaboration aims to enhance employee productivity by automating manual tasks, allowing staff to …
FRONTIER RELEASE · CL_11191 · Apr 8 · 16:00

RT Artificial Analysis: Meta is back! Muse Spark scores 52 on the Artificial Analysis Intelligence Index, behind only Gemini 3.1 Pro, GPT-5.4, and Cla...

Meta AI has released Muse Spark, a new frontier-class multimodal model developed by Meta Superintelligence Labs. This marks Meta's return to the frontier AI race after a period of relative quiet and is their first model…
RESEARCH · CL_03449 · Apr 8 · 00:00

Anthropic's Claude Mythos finds zero-days; GLM-5.1 targets long tasks

Anthropic's Claude Mythos Preview has demonstrated a significant capability in identifying zero-day vulnerabilities in critical software, leading to the formation of Project Glasswing to enhance cybersecurity. Meanwhile…
TOOL · CL_19489 · Mar 19 · 16:01

Canary launches AI QA tool that outperforms GPT-5.4 and Claude Code on code verification

Canary, a new AI-powered QA tool, has launched to automate testing for pull requests by understanding codebases and generating end-to-end tests for user workflows. The tool aims to catch regressions before code merges, …
RESEARCH · CL_01004 · Mar 18 · 13:02

OpenAI's GPT 5.4 shows significant improvements for agent tasks, rivaling Claude

The author finds OpenAI's GPT 5.4, particularly within the Codex agent, to be a significant improvement for complex, multi-step tasks. Unlike previous iterations that often failed on operations like git commands, GPT 5.…
FRONTIER RELEASE · CL_02626 · Jan 28 · 09:51

OpenAI releases GPT-5.4 mini/nano, Mistral open-sources Small 4, Anthropic updates Claude

OpenAI has released GPT-5.4 mini and nano models with significantly larger context windows and higher per-token prices, while also exploring ads within ChatGPT and implementing age prediction for enhanced safety. Mistra…
FRONTIER RELEASE · CL_04126 · Nov 19 · 05:44

OpenAI launches GPT-5.5, enhancing agentic AI and integrating Codex for enterprise

OpenAI has launched GPT-5.5, a new frontier model that demonstrates significant improvements in agentic capabilities and efficiency. This release integrates enhanced coding abilities through Codex, alongside broader com…
RESEARCH · CL_00834 · Nov 1 · 15:31

In the Arena: How LMSys changed LLM Benchmarking Forever

The AraGen benchmark, developed by Hugging Face, aims to improve LLM evaluation by addressing limitations of static benchmarks. It introduces a crowdsourced approach similar to LMSys's Chatbot Arena, allowing for more d…
SIGNIFICANT · CL_00451 · Feb 20 · 17:11

DeepSeek's cheaper AI models challenge compute-centric US labs, while AI safety debates continue.

DeepSeek's new V4 model offers capabilities comparable to GPT-5.4 but at a significantly lower cost, highlighting a strategic shift towards inference efficiency. This development is particularly relevant for Chinese AI …
FRONTIER RELEASE · CL_00980 · Nov 5 · 08:00

OpenAI launches GPT-5.5, boosting AI intelligence and speed for complex tasks

OpenAI has released GPT-5.5 and GPT-5.5 Pro, their latest and most intuitive models, designed for complex tasks and agentic capabilities. These models excel in areas like coding, data analysis, and operating software, o…