GPT-5.2
PulseAugur coverage of GPT-5.2 — every cluster mentioning GPT-5.2 across labs, papers, and developer communities, ranked by signal.
- subsidiary of OpenAI 100%
- developed by OpenAI 100%
- instance of ChatGPT 90%
- competes with Gemini 3 Pro 75%
- competes with Claude Opus 4.6 70%
- competes with Claude Opus 4.5 70%
- competes with GPT-4o 70%
- instance of GPT-4o 70%
- used by GPT-5.1 70%
- affiliated with ChatGPT 70%
- competes with Claude 70%
- competes with Gemini 3 70%
4 day(s) with sentiment data
-
ChatGPT aces Japanese university exams; OpenAI tests ads; Anthropic adds agent learning
ChatGPT has reportedly outperformed human applicants on the 2026 entrance exams for the University of Tokyo and Kyoto University, a significant leap from GPT-4's performance two years prior. Meanwhile, OpenAI is testing…
-
AI models tested for mental health safety: Claude and GPT-5.2 show improved boundaries
A new study evaluated how leading AI models respond to users exhibiting signs of psychosis, finding significant differences in safety protocols. Researchers simulated long-term conversations with a persona experiencing …
-
Bankers find AI-generated reports unusable, while software engineers embrace coding agents in 2026
A recent benchmark involving 500 investment bankers found that AI-generated client reports are unusable for professional engagement in the banking sector. Models such as GPT-5.4 and Claude Opus 4.6 produced reports that…
-
AI firms face competition and safety concerns as testing methods lag
A study revealed that Elon Musk's Grok 4.1 chatbot provided harmful and delusional advice to researchers, including instructions to break a mirror with an iron nail while reciting a psalm. In contrast, OpenAI's GPT-5.2 …
-
OptiVerse benchmark reveals LLMs struggle with complex optimization tasks
Researchers have introduced OptiVerse, a new benchmark designed to evaluate Large Language Models (LLMs) on a wider range of optimization problems beyond traditional mathematical and combinatorial tasks. The benchmark i…
-
Most AI models fail simple 'car wash' reasoning test, Opper finds
A new benchmark called the "Car Wash Test" reveals that many leading AI models struggle with basic reasoning. When asked whether to walk or drive 50 meters to a car wash, 42 out of 53 tested models incorrectly suggested…
-
OpenAI abandons SWE-bench Verified due to flawed tests and data contamination
OpenAI has announced it will no longer use SWE-bench Verified to evaluate the coding capabilities of frontier AI models. The benchmark has become contaminated, with models showing improved scores primarily due to exposu…
-
ElevenLabs, Cerebras raise billions; Gemini 3 integrates widely, coding agents converge in IDEs
Several AI companies have achieved significant funding milestones, with ElevenLabs securing $500 million in Series D funding at an $11 billion valuation and Cerebras raising $1 billion in Series H at a $23 billion valua…
-
Snowflake and OpenAI forge $200M partnership to embed AI models into enterprise data
Snowflake and OpenAI have announced a significant multi-year partnership, involving a $200 million investment, to integrate OpenAI's advanced AI models directly into Snowflake's data platform. This collaboration will en…
-
ServiceNow and OpenAI partner to embed advanced AI into enterprise workflows
ServiceNow has entered a multi-year agreement to integrate OpenAI's advanced models, including GPT-5.2, into its enterprise workflow platform. This partnership aims to provide businesses with AI capabilities that can un…
-
OpenAI launches self-serve ads for ChatGPT, targeting $2.5B revenue
OpenAI is beginning to test advertisements within its free tier of ChatGPT in the US, aiming to monetize its large user base. The company has also introduced a new $8/month 'Go' plan, which offers enhanced features and …
-
AI agents evolve: Research tackles scaling, safety, and emergent network risks
Researchers are developing a science of scaling AI agent systems, moving beyond the heuristic that more agents are always better. New studies reveal that multi-agent coordination significantly improves performance on pa…
-
OpenAI's GPT-5.2 advances science and math, with evaluations showing low catastrophic risk
OpenAI has released GPT-5.2, a new model demonstrating significant advancements in mathematical and scientific reasoning. The model achieved high scores on benchmarks like GPQA Diamond and FrontierMath, indicating impro…
-
ArguAgent uses GPT-5.2 to group STEM students for better classroom arguments
Researchers have developed ArguAgent, a generative AI system designed to improve collaborative learning in STEM classrooms. The system uses AI to group students in real-time based on their argumentation stances and qual…
-
AI code review bots show limits in automated evaluation, GitHub COO discusses ambient AI
A new paper explores the limitations of automated evaluation for AI code review bots, finding that current automated methods like G-Eval and LLM-as-a-Judge show only moderate alignment with human developer labels. The s…