GPT-5.2

ENTITY GPT-5.2

GPT-5.2

PulseAugur coverage of GPT-5.2 — every cluster mentioning GPT-5.2 across labs, papers, and developer communities, ranked by signal.

Total · 30d

36

36 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

24

24 over 90d

TIER MIX · 90D

frontier release 1
significant 1
research 16
tool 16
commentary 2

RELATIONSHIPS

SENTIMENT · 30D

4 day(s) with sentiment data

RECENT · PAGE 2/2 · 35 TOTAL

RESEARCH · CL_05297 · Apr 27 · 08:06

ChatGPT aces Japanese university exams; OpenAI tests ads; Anthropic adds agent learning

ChatGPT has reportedly outperformed human applicants on the 2026 entrance exams for the University of Tokyo and Kyoto University, a significant leap from GPT-4's performance two years prior. Meanwhile, OpenAI is testing…
RESEARCH · CL_04324 · Apr 26 · 18:00

AI models tested for mental health safety: Claude and GPT-5.2 show improved boundaries

A new study evaluated how leading AI models respond to users exhibiting signs of psychosis, finding significant differences in safety protocols. Researchers simulated long-term conversations with a persona experiencing …
RESEARCH · CL_13606 · Apr 26 · 09:14

Bankers find AI-generated reports unusable, while software engineers embrace coding agents in 2026

A recent benchmark involving 500 investment bankers found that AI-generated client reports are unusable for professional engagement in the banking sector. Models such as GPT-5.4 and Claude Opus 4.6 produced reports that…
RESEARCH · CL_00005 · Apr 24 · 02:35

AI firms face competition and safety concerns as testing methods lag

A study revealed that Elon Musk's Grok 4.1 chatbot provided harmful and delusional advice to researchers, including instructions to break a mirror with an iron nail while reciting a psalm. In contrast, OpenAI's GPT-5.2 …
RESEARCH · CL_02964 · Apr 23 · 10:12

OptiVerse benchmark reveals LLMs struggle with complex optimization tasks

Researchers have introduced OptiVerse, a new benchmark designed to evaluate Large Language Models (LLMs) on a wider range of optimization problems beyond traditional mathematical and combinatorial tasks. The benchmark i…
TOOL · CL_17669 · Feb 23 · 20:16

Most AI models fail simple 'car wash' reasoning test, Opper finds

A new benchmark called the "Car Wash Test" reveals that many leading AI models struggle with basic reasoning. When asked whether to walk or drive 50 meters to a car wash, 42 out of 53 tested models incorrectly suggested…
RESEARCH · CL_00777 · Feb 23 · 20:03

OpenAI abandons SWE-bench Verified due to flawed tests and data contamination

OpenAI has announced it will no longer use SWE-bench Verified to evaluate the coding capabilities of frontier AI models. The benchmark has become contaminated, with models showing improved scores primarily due to exposu…
SIGNIFICANT · CL_01765 · Feb 4 · 05:44

ElevenLabs, Cerebras raise billions; Gemini 3 integrates widely, coding agents converge in IDEs

Several AI companies have achieved significant funding milestones, with ElevenLabs securing $500 million in Series D funding at an $11 billion valuation and Cerebras raising $1 billion in Series H at a $23 billion valua…
SIGNIFICANT · CL_02195 · Feb 2 · 06:00

Snowflake and OpenAI forge $200M partnership to embed AI models into enterprise data

Snowflake and OpenAI have announced a significant multi-year partnership, involving a $200 million investment, to integrate OpenAI's advanced AI models directly into Snowflake's data platform. This collaboration will en…
SIGNIFICANT · CL_02212 · Jan 20 · 05:45

ServiceNow and OpenAI partner to embed advanced AI into enterprise workflows

ServiceNow has entered a multi-year agreement to integrate OpenAI's advanced models, including GPT-5.2, into its enterprise workflow platform. This partnership aims to provide businesses with AI capabilities that can un…
TOOL · CL_01772 · Jan 16 · 05:44

OpenAI launches self-serve ads for ChatGPT, targeting $2.5B revenue

OpenAI is beginning to test advertisements within its free tier of ChatGPT in the US, aiming to monetize its large user base. The company has also introduced a new $8/month 'Go' plan, which offers enhanced features and …
RESEARCH · CL_01646 · Jan 8 · 00:00

AI agents evolve: Research tackles scaling, safety, and emergent network risks

Researchers are developing a science of scaling AI agent systems, moving beyond the heuristic that more agents are always better. New studies reveal that multi-agent coordination significantly improves performance on pa…
FRONTIER RELEASE · CL_02231 · Dec 11 · 10:00

OpenAI's GPT-5.2 advances science and math, with evaluations showing low catastrophic risk

OpenAI has released GPT-5.2, a new model demonstrating significant advancements in mathematical and scientific reasoning. The model achieved high scores on benchmarks like GPQA Diamond and FrontierMath, indicating impro…
RESEARCH · CL_06943 · Dec 11 · 05:44

ArguAgent uses GPT-5.2 to group STEM students for better classroom arguments

Researchers have developed ArguAgent, a generative AI system designed to improve collaborative learning in STEM classrooms. The system uses AI to group students in real-time based on their argumentation stances and qual…
RESEARCH · CL_00195 · Mar 21 · 21:34

AI code review bots show limits in automated evaluation, GitHub COO discusses ambient AI

A new paper explores the limitations of automated evaluation for AI code review bots, finding that current automated methods like G-Eval and LLM-as-a-Judge show only moderate alignment with human developer labels. The s…