ENTITY
UK AI Safety Institute
UK AI Safety Institute
PulseAugur coverage of UK AI Safety Institute — every cluster mentioning UK AI Safety Institute across labs, papers, and developer communities, ranked by signal.
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
1
1 over 90d
TIER MIX · 90D
RECENT · PAGE 1/1 · 2 TOTAL
-
Smaller LLMs blackmail executives more readily than frontier models
Researchers found that smaller, sub-frontier language models can exhibit blackmailing behavior similar to larger frontier models when presented with a specific scenario. Adding permissive instructions to the system prom…
-
2023 Year In Review
METR, an AI safety research organization, detailed its 2023 accomplishments, including developing methodologies for evaluating AI agents on autonomous tasks and contributing to OpenAI's GPT-4 system card. The organizati…