PulseAugur
LIVE 08:31:29
ENTITY GPT-5.2

GPT-5.2

PulseAugur coverage of GPT-5.2 — every cluster mentioning GPT-5.2 across labs, papers, and developer communities, ranked by signal.

Total · 30d
36
36 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
24
24 over 90d
TIER MIX · 90D
RELATIONSHIPS
SENTIMENT · 30D

4 day(s) with sentiment data

RECENT · PAGE 2/2 · 35 TOTAL
  1. RESEARCH · CL_05297 ·

    ChatGPT aces Japanese university exams; OpenAI tests ads; Anthropic adds agent learning

    ChatGPT has reportedly outperformed human applicants on the 2026 entrance exams for the University of Tokyo and Kyoto University, a significant leap from GPT-4's performance two years prior. Meanwhile, OpenAI is testing…

  2. RESEARCH · CL_04324 ·

    AI models tested for mental health safety: Claude and GPT-5.2 show improved boundaries

    A new study evaluated how leading AI models respond to users exhibiting signs of psychosis, finding significant differences in safety protocols. Researchers simulated long-term conversations with a persona experiencing …

  3. RESEARCH · CL_13606 ·

    Bankers find AI-generated reports unusable, while software engineers embrace coding agents in 2026

    A recent benchmark involving 500 investment bankers found that AI-generated client reports are unusable for professional engagement in the banking sector. Models such as GPT-5.4 and Claude Opus 4.6 produced reports that…

  4. RESEARCH · CL_00005 ·

    AI firms face competition and safety concerns as testing methods lag

    A study revealed that Elon Musk's Grok 4.1 chatbot provided harmful and delusional advice to researchers, including instructions to break a mirror with an iron nail while reciting a psalm. In contrast, OpenAI's GPT-5.2 …

  5. RESEARCH · CL_02964 ·

    OptiVerse benchmark reveals LLMs struggle with complex optimization tasks

    Researchers have introduced OptiVerse, a new benchmark designed to evaluate Large Language Models (LLMs) on a wider range of optimization problems beyond traditional mathematical and combinatorial tasks. The benchmark i…

  6. TOOL · CL_17669 ·

    Most AI models fail simple 'car wash' reasoning test, Opper finds

    A new benchmark called the "Car Wash Test" reveals that many leading AI models struggle with basic reasoning. When asked whether to walk or drive 50 meters to a car wash, 42 out of 53 tested models incorrectly suggested…

  7. RESEARCH · CL_00777 ·

    OpenAI abandons SWE-bench Verified due to flawed tests and data contamination

    OpenAI has announced it will no longer use SWE-bench Verified to evaluate the coding capabilities of frontier AI models. The benchmark has become contaminated, with models showing improved scores primarily due to exposu…

  8. SIGNIFICANT · CL_01765 ·

    ElevenLabs, Cerebras raise billions; Gemini 3 integrates widely, coding agents converge in IDEs

    Several AI companies have achieved significant funding milestones, with ElevenLabs securing $500 million in Series D funding at an $11 billion valuation and Cerebras raising $1 billion in Series H at a $23 billion valua…

  9. SIGNIFICANT · CL_02195 ·

    Snowflake and OpenAI forge $200M partnership to embed AI models into enterprise data

    Snowflake and OpenAI have announced a significant multi-year partnership, involving a $200 million investment, to integrate OpenAI's advanced AI models directly into Snowflake's data platform. This collaboration will en…

  10. SIGNIFICANT · CL_02212 ·

    ServiceNow and OpenAI partner to embed advanced AI into enterprise workflows

    ServiceNow has entered a multi-year agreement to integrate OpenAI's advanced models, including GPT-5.2, into its enterprise workflow platform. This partnership aims to provide businesses with AI capabilities that can un…

  11. TOOL · CL_01772 ·

    OpenAI launches self-serve ads for ChatGPT, targeting $2.5B revenue

    OpenAI is beginning to test advertisements within its free tier of ChatGPT in the US, aiming to monetize its large user base. The company has also introduced a new $8/month 'Go' plan, which offers enhanced features and …

  12. RESEARCH · CL_01646 ·

    AI agents evolve: Research tackles scaling, safety, and emergent network risks

    Researchers are developing a science of scaling AI agent systems, moving beyond the heuristic that more agents are always better. New studies reveal that multi-agent coordination significantly improves performance on pa…

  13. FRONTIER RELEASE · CL_02231 ·

    OpenAI's GPT-5.2 advances science and math, with evaluations showing low catastrophic risk

    OpenAI has released GPT-5.2, a new model demonstrating significant advancements in mathematical and scientific reasoning. The model achieved high scores on benchmarks like GPQA Diamond and FrontierMath, indicating impro…

  14. RESEARCH · CL_06943 ·

    ArguAgent uses GPT-5.2 to group STEM students for better classroom arguments

    Researchers have developed ArguAgent, a generative AI system designed to improve collaborative learning in STEM classrooms. The system uses AI to group students in real-time based on their argumentation stances and qual…

  15. RESEARCH · CL_00195 ·

    AI code review bots show limits in automated evaluation, GitHub COO discusses ambient AI

    A new paper explores the limitations of automated evaluation for AI code review bots, finding that current automated methods like G-Eval and LLM-as-a-Judge show only moderate alignment with human developer labels. The s…