AgentDojo
PulseAugur coverage of AgentDojo — every cluster mentioning AgentDojo across labs, papers, and developer communities, ranked by signal.
2 day(s) with sentiment data
-
LLM attack benchmarks cover less than 25% of threat landscape
Researchers have developed a new framework to audit the coverage of benchmarks designed to test Large Language Model (LLM) attacks. This framework, based on a taxonomy of over 500 inference-time attacks, reveals that cu…
-
LLM attack benchmarks show significant gaps in security coverage
Researchers have developed a new framework to audit the coverage of LLM attack benchmarks, revealing significant gaps in current evaluations. Their analysis of six public benchmarks showed they collectively cover less t…
-
New attack exploits LLM agent relays, bypassing alignment defenses
Researchers have identified a new vulnerability in LLM agent architectures that use Bring-Your-Own-Key (BYOK) systems. These architectures route LLM traffic through third-party relays, creating an integrity gap where a …