ENTITY GPT-4o mini

GPT-4o mini

PulseAugur coverage of GPT-4o mini — every cluster mentioning GPT-4o mini across labs, papers, and developer communities, ranked by signal.

Total · 30d

8

8 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

3

3 over 90d

TIER MIX · 90D

frontier release 2
significant 1
research 1
tool 3
commentary 1

RELATIONSHIPS

SENTIMENT · 30D

5 day(s) with sentiment data

RECENT · PAGE 2/2 · 32 TOTAL

RESEARCH · CL_07061 · Apr 28 · 04:00

LLM-generated code for construction safety shows high failure rates

A new study assessed the reliability of Large Language Models (LLMs) generating code for construction safety, a practice termed "vibe coding." The research found that while LLMs can produce syntactically correct code, t…
RESEARCH · CL_06993 · Apr 28 · 04:00

Claude 3.5 Haiku resists jailbreaks, while Gemini 2.0 and GPT-4o mini show vulnerabilities

A new paper evaluates the jailbreaking vulnerabilities of large language models when used in smart grid operations, testing OpenAI's GPT-4o mini, Google's Gemini 2.0 Flash-Lite, and Anthropic's Claude 3.5 Haiku against …
RESEARCH · CL_06725 · Apr 28 · 04:00

New PARASITE technique hijacks LLMs via conditional system prompt poisoning

Researchers have developed a new framework called PARASITE that can conditionally poison system prompts for large language models. This method allows adversaries to create prompts that appear benign but trigger compromi…
RESEARCH · CL_05034 · Apr 24 · 06:34

New research suggests LLM self-correction can degrade performance if not carefully managed.

A new research paper introduces a control-theoretic framework to analyze when iterative self-correction in large language models (LLMs) is beneficial or detrimental. The study proposes a diagnostic based on error correc…
RESEARCH · CL_05048 · Apr 23 · 20:42

LLMs show instability in psychiatric risk scores with irrelevant data

A new study evaluated the reliability of large language models (LLMs) in predicting psychiatric hospitalization risk. Researchers found that including medically insignificant details in patient profiles significantly in…
RESEARCH · CL_03728 · Apr 4 · 06:30

LLMs show emotional representations and susceptibility to false beliefs

A new paper from Anthropic's interpretability team reveals that their Claude Sonnet 4.5 model develops internal representations that emulate human emotions, influencing its behavior and decision-making. These "functiona…
RESEARCH · CL_06943 · Dec 11 · 05:44

ArguAgent uses GPT-5.2 to group STEM students for better classroom arguments

Researchers have developed ArguAgent, a generative AI system designed to improve collaborative learning in STEM classrooms. The system uses AI to group students in real-time based on their argumentation stances and qual…
SIGNIFICANT · CL_02283 · Oct 2 · 10:00

OpenAI bolsters AI safety with external testing as GPT-5 powers Wrtn's user growth

OpenAI is enhancing its safety protocols for advanced AI models by incorporating external testing and assessments. This involves collaborating with independent experts to evaluate capabilities, risks, and mitigation str…
FRONTIER RELEASE · CL_01024 · Aug 9 · 11:23

OpenAI launches affordable GPT-4o mini and open-weight gpt-oss models

OpenAI has released GPT-4o mini, a new, highly cost-efficient small model designed to broaden AI accessibility and application development. This model demonstrates superior performance on benchmarks like MMLU, MGSM, and…
RESEARCH · CL_01132 · Apr 16 · 00:00

AI research tackles LLM context, social agents, and evaluation benchmarks

Researchers are developing new methods to evaluate and improve Large Language Models (LLMs). One paper introduces a benchmark to assess LLMs' contextual understanding, finding that quantized models show performance degr…
FRONTIER RELEASE · CL_00230 · May 22 · 00:30

OpenAI releases GPT-4o with fine-tuning and enhanced multimodal capabilities

OpenAI has released fine-tuning capabilities for its GPT-4o model, allowing developers to customize its performance and tone for specific applications. This feature, available on paid tiers, offers developers the chance…
FRONTIER RELEASE · CL_01524 · Dec 15 · 00:00

OpenAI launches advanced audio models for API, enhancing voice agents

OpenAI has released new, advanced audio models through its API, enhancing capabilities for voice agents. The updated speech-to-text models, including gpt-4o-transcribe and gpt-4o-mini-transcribe, offer improved accuracy…