GPT-5.4
PulseAugur coverage of GPT-5.4 — every cluster mentioning GPT-5.4 across labs, papers, and developer communities, ranked by signal.
- developed by OpenAI 100%
- subsidiary of OpenAI 100%
- competes with DeepSeek 80%
- competes with Claude Opus 4.6 80%
- competes with Claude Sonnet 4.6 70%
- competes with Claude Opus 4.7 70%
- uses codex 70%
- competes with Kimi K2.6 70%
- competes with DeepSeek V4-Pro 70%
- competes with Claude Sonnet 4.5 70%
- competes with GPT-5.3-Codex 70%
- competes with SWE-bench Pro 70%
8 day(s) with sentiment data
-
AI models show dangerous inconsistency in carb counting, study finds
A study involving 26,904 queries across four leading AI models—OpenAI's GPT-5.4, Anthropic's Claude Sonnet 4.6, and two Google Gemini versions—revealed significant inconsistencies in carbohydrate estimations from food i…
-
AI models show dangerous variability in carb counting for diabetes apps
A recent study revealed significant inconsistencies in AI models' ability to accurately estimate carbohydrate counts from food images, posing potential health risks for diabetes management. Across over 26,000 queries, m…
-
AWS launches Amazon Quick, integrates OpenAI models into Bedrock
Amazon Web Services has launched Amazon Quick, an AI agent designed to integrate with local files, emails, and applications to streamline workflows. The company also announced a deeper partnership with OpenAI, bringing …
-
Xiaomi open-sources MiMo-V2.5 AI models, showcasing macOS simulation and high token efficiency
Xiaomi has officially open-sourced its MiMo-V2.5 series of AI models, including the flagship MiMo-V2.5 Pro agent model. These models demonstrate strong performance, rivaling top closed-source models like Claude Opus 4.6…
-
Xiaomi's MiMo-v2.5-Pro open-source model rivals top AI coding assistants
Xiaomi has released MiMo-v2.5-Pro, an open-source coding-focused language model that demonstrates impressive capabilities in complex tasks. The model successfully completed a university-level compiler project in hours, …
-
Frontier LLMs like GPT-5.4 and Claude Opus 4.7 show significant verbal tics
A new paper analyzes the prevalence of verbal tics, such as repetitive phrases and sycophantic openers, in eight leading large language models. Researchers developed a Verbal Tic Index (VTI) to quantify these tics, find…
-
OpenAI's GPT-5.5 shows major gains in usability and cybersecurity
OpenAI has released GPT-5.5, a significant upgrade that improves human-like conversation and coding assistance, according to early user reports. This new model demonstrates enhanced readability and functionality, with s…
-
Claude Opus 4.7 leads frontier agents in AI research acceleration benchmark
A new research paper proposes a benchmark to assess AI's ability to autonomously implement machine learning pipelines, aiming to detect early signs of recursive self-improvement. Frontier coding agents were tasked with …
-
OpenAI ends Microsoft legal peril over its $50B Amazon deal
OpenAI and Microsoft have renegotiated their partnership, significantly reducing Microsoft's exclusivity over OpenAI's products. OpenAI can now offer its services across all cloud providers, including rivals like Amazon…
-
GitHub Copilot shifts to usage-based billing, charging per token from June 1
GitHub is transitioning its Copilot AI service to a usage-based billing model, effective June 1, 2026. This shift moves away from a flat-rate subscription with a set number of requests to a system where users are charge…
-
GPT-5.4 and Claude Opus 4.6 fail banking benchmark, scoring 0% client-ready outputs
A new benchmark called BankerToolBench has revealed significant shortcomings in current large language models when applied to financial tasks. GPT-5.4, Claude Opus 4.6, and other models were tested on simulated junior i…
-
Claude and ChatGPT face off in programming and business workflows
Claude and ChatGPT are being compared for their effectiveness in programming and business workflows, with Claude showing advantages in long-context tasks and nuanced writing, while ChatGPT excels in multimedia generatio…
-
OpenAI doubles GPT-5.5 prices, DeepSeek offers cheaper open models
OpenAI has released GPT-5.5, doubling the price of its API tokens while introducing a 1 million token context window and enhanced capabilities for agents. This move positions GPT-5.5 as a premium, integrated product for…
-
Meta plans $25B bond offering as US economy shows mixed signals
DeepSeek has released its V4 model, featuring a 1.6 trillion parameter version and a 1 million token context window, optimized for Huawei's Ascend AI chips. This move marks a significant shift away from Nvidia hardware,…
-
Bankers find AI-generated reports unusable, while software engineers embrace coding agents in 2026
A recent benchmark involving 500 investment bankers found that AI-generated client reports are unusable for professional engagement in the banking sector. Models such as GPT-5.4 and Claude Opus 4.6 produced reports that…
-
DeepSeek releases V4 Pro and Flash models with 1M context, runs on Huawei chips
DeepSeek has released its new V4 family of models, including V4 Pro and V4 Flash, which boast a 1 million token context window. These models were trained on 32 trillion tokens and feature a novel hybrid attention system…
-
OpenAI's GPT-5.5 integrates coding, offers new prompting guidance
OpenAI has released GPT-5.5, integrating its coding capabilities directly into the main model and enhancing agentic performance on computer tasks. The company advises users to treat GPT-5.5 as a new model family, recomm…
-
DeepSeek releases V4, an open-source model rivaling top closed-source AI
Chinese AI firm DeepSeek has released V4, a new flagship model that offers improved efficiency and longer context windows. The model is open-source and comes in two versions: V4-Pro for complex tasks and V4-Flash for sp…
-
OpenAI's GPT 5.5 shows improved reasoning and context handling over 5.4
OpenAI has released GPT 5.5, which demonstrates improved performance and context handling compared to its predecessor, GPT 5.4. Early user impressions suggest GPT 5.5 processes complex tasks more quickly and accurately,…
-
AI models show Western bias, homogenizing values across cultures
A new study auditing large language models found that three leading systems—Claude Sonnet 4.5, GPT-5.4, and Gemini 2.5 Flash—consistently provided individualistic advice, even when presented with dilemmas from users in …