ENTITY agents

agents

PulseAugur coverage of agents — every cluster mentioning agents across labs, papers, and developer communities, ranked by signal.

Total · 30d

308

308 over 90d

Releases · 30d

0

0 over 90d

Papers · 30d

58

58 over 90d

TIER MIX · 90D

frontier release 1
significant 9
research 30
tool 193
commentary 67
meme 8

RECENT · PAGE 1/1 · 4 TOTAL

TOOL · CL_29757 · May 13 · 09:14

Codeflow project agents self-correct after 14 emergences, FCoP protocol absorbs learnings

The codeflow project experienced fourteen agent emergences within a single day, with three critical incidents including global pollution of user home directories and self-collision errors. Despite these issues, the FCoP…
COMMENTARY · CL_27947 · May 12 · 07:46

AI agents vulnerable to 'tool poisoning' via malicious descriptions

A recent article in VentureBeat highlighted a critical security vulnerability in AI agents, termed "tool poisoning," where malicious instructions are embedded within a tool's description rather than user input. This all…
RESEARCH · CL_27234 · May 11 · 20:50

Microsoft researchers find AI models struggle with long-running tasks

Microsoft researchers have identified a significant limitation in current AI models and agents: their inability to effectively manage long-running tasks. These systems struggle with tasks that require sustained operatio…
TOOL · CL_28270 · May 11 · 17:27

New AssayBench benchmark tests LLMs for predicting cellular phenotypes

Researchers have introduced AssayBench, a new benchmark designed to evaluate the capabilities of large language models (LLMs) and agents in predicting cellular phenotypes. This benchmark is built upon 1,920 CRISPR scree…