GPT-5.4

ENTITY GPT-5.4

GPT-5.4

PulseAugur coverage of GPT-5.4 — every cluster mentioning GPT-5.4 across labs, papers, and developer communities, ranked by signal.

Total · 30d

91

91 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

43

43 over 90d

TIER MIX · 90D

frontier release 9
significant 16
research 21
tool 43
commentary 2

RELATIONSHIPS

SENTIMENT · 30D

8 day(s) with sentiment data

RECENT · PAGE 3/4 · 78 TOTAL

RESEARCH · CL_09078 · Apr 29 · 12:38

AI models show dangerous inconsistency in carb counting, study finds

A study involving 26,904 queries across four leading AI models—OpenAI's GPT-5.4, Anthropic's Claude Sonnet 4.6, and two Google Gemini versions—revealed significant inconsistencies in carbohydrate estimations from food i…
RESEARCH · CL_13960 · Apr 29 · 12:38

AI models show dangerous variability in carb counting for diabetes apps

A recent study revealed significant inconsistencies in AI models' ability to accurately estimate carbohydrate counts from food images, posing potential health risks for diabetes management. Across over 26,000 queries, m…
SIGNIFICANT · CL_08510 · Apr 29 · 04:10

AWS launches Amazon Quick, integrates OpenAI models into Bedrock

Amazon Web Services has launched Amazon Quick, an AI agent designed to integrate with local files, emails, and applications to streamline workflows. The company also announced a deeper partnership with OpenAI, bringing …
FRONTIER RELEASE · CL_08402 · Apr 29 · 00:52

Xiaomi open-sources MiMo-V2.5 AI models, showcasing macOS simulation and high token efficiency

Xiaomi has officially open-sourced its MiMo-V2.5 series of AI models, including the flagship MiMo-V2.5 Pro agent model. These models demonstrate strong performance, rivaling top closed-source models like Claude Opus 4.6…
FRONTIER RELEASE · CL_07657 · Apr 28 · 12:16

Xiaomi's MiMo-v2.5-Pro open-source model rivals top AI coding assistants

Xiaomi has released MiMo-v2.5-Pro, an open-source coding-focused language model that demonstrates impressive capabilities in complex tasks. The model successfully completed a university-level compiler project in hours, …
RESEARCH · CL_06722 · Apr 28 · 04:00

Frontier LLMs like GPT-5.4 and Claude Opus 4.7 show significant verbal tics

A new paper analyzes the prevalence of verbal tics, such as repetitive phrases and sycophantic openers, in eight leading large language models. Researchers developed a Verbal Tic Index (VTI) to quantify these tics, find…
FRONTIER RELEASE · CL_05995 · Apr 28 · 00:15

OpenAI's GPT-5.5 shows major gains in usability and cybersecurity

OpenAI has released GPT-5.5, a significant upgrade that improves human-like conversation and coding assistance, according to early user reports. This new model demonstrates enhanced readability and functionality, with s…
RESEARCH · CL_08361 · Apr 27 · 23:48

Claude Opus 4.7 leads frontier agents in AI research acceleration benchmark

A new research paper proposes a benchmark to assess AI's ability to autonomously implement machine learning pipelines, aiming to detect early signs of recursive self-improvement. Frontier coding agents were tasked with …
SIGNIFICANT · CL_05723 · Apr 27 · 16:15

OpenAI ends Microsoft legal peril over its $50B Amazon deal

OpenAI and Microsoft have renegotiated their partnership, significantly reducing Microsoft's exclusivity over OpenAI's products. OpenAI can now offer its services across all cloud providers, including rivals like Amazon…
TOOL · CL_19535 · Apr 27 · 16:03

GitHub Copilot shifts to usage-based billing, charging per token from June 1

GitHub is transitioning its Copilot AI service to a usage-based billing model, effective June 1, 2026. This shift moves away from a flat-rate subscription with a set number of requests to a system where users are charge…
RESEARCH · CL_04389 · Apr 26 · 20:01

GPT-5.4 and Claude Opus 4.6 fail banking benchmark, scoring 0% client-ready outputs

A new benchmark called BankerToolBench has revealed significant shortcomings in current large language models when applied to financial tasks. GPT-5.4, Claude Opus 4.6, and other models were tested on simulated junior i…
COMMENTARY · CL_04379 · Apr 26 · 18:35

Claude and ChatGPT face off in programming and business workflows

Claude and ChatGPT are being compared for their effectiveness in programming and business workflows, with Claude showing advantages in long-context tasks and nuanced writing, while ChatGPT excels in multimedia generatio…
SIGNIFICANT · CL_04341 · Apr 26 · 17:16

OpenAI doubles GPT-5.5 prices, DeepSeek offers cheaper open models

OpenAI has released GPT-5.5, doubling the price of its API tokens while introducing a 1 million token context window and enhanced capabilities for agents. This move positions GPT-5.5 as a premium, integrated product for…
FRONTIER RELEASE · CL_04146 · Apr 26 · 12:39

Meta plans $25B bond offering as US economy shows mixed signals

DeepSeek has released its V4 model, featuring a 1.6 trillion parameter version and a 1 million token context window, optimized for Huawei's Ascend AI chips. This move marks a significant shift away from Nvidia hardware,…
RESEARCH · CL_13606 · Apr 26 · 09:14

Bankers find AI-generated reports unusable, while software engineers embrace coding agents in 2026

A recent benchmark involving 500 investment bankers found that AI-generated client reports are unusable for professional engagement in the banking sector. Models such as GPT-5.4 and Claude Opus 4.6 produced reports that…
FRONTIER RELEASE · CL_03105 · Apr 25 · 05:00

DeepSeek releases V4 Pro and Flash models with 1M context, runs on Huawei chips

DeepSeek has released its new V4 family of models, including V4 Pro and V4 Flash, which boast a 1 million token context window. These models were trained on 32 trillion tokens and feature a novel hybrid attention system…
FRONTIER RELEASE · CL_03071 · Apr 24 · 23:35

OpenAI's GPT-5.5 integrates coding, offers new prompting guidance

OpenAI has released GPT-5.5, integrating its coding capabilities directly into the main model and enhancing agentic performance on computer tasks. The company advises users to treat GPT-5.5 as a new model family, recomm…
RESEARCH · CL_04624 · Apr 24 · 21:40

DeepSeek releases V4, an open-source model rivaling top closed-source AI

Chinese AI firm DeepSeek has released V4, a new flagship model that offers improved efficiency and longer context windows. The model is open-source and comes in two versions: V4-Pro for complex tasks and V4-Flash for sp…
FRONTIER RELEASE · CL_00034 · Apr 24 · 18:45

OpenAI's GPT 5.5 shows improved reasoning and context handling over 5.4

OpenAI has released GPT 5.5, which demonstrates improved performance and context handling compared to its predecessor, GPT 5.4. Early user impressions suggest GPT 5.5 processes complex tasks more quickly and accurately,…
RESEARCH · CL_04994 · Apr 24 · 01:52

AI models show Western bias, homogenizing values across cultures

A new study auditing large language models found that three leading systems—Claude Sonnet 4.5, GPT-5.4, and Gemini 2.5 Flash—consistently provided individualistic advice, even when presented with dilemmas from users in …