ENTITY AI Security Institute

AI Security Institute

PulseAugur coverage of AI Security Institute — every cluster mentioning AI Security Institute across labs, papers, and developer communities, ranked by signal.

Total · 30d

15 over 90d

Releases · 30d

0 over 90d

Papers · 30d

3 over 90d

TIER MIX · 90D

frontier release 3
significant 5
research 4
tool 3

RELATIONSHIPS

subsidiary of Department for Science, Innovation and Technology 100%

TIMELINE

2026-04-08 research_milestone New research indicates GPT-5.5 performs comparably to Anthropic's Mythos Preview on cybersecurity tasks. source

SENTIMENT · 30D

4 day(s) with sentiment data

RECENT · PAGE 1/1 · 17 TOTAL

TOOL · CL_30380 · May 13 · 21:40

MATS opens AI safety fellowship with new tracks and funding

MATS Research is now accepting applications for its Autumn 2026 fellowship, a 10-week program focused on AI alignment, security, and governance. The fellowship, running from September 28 to December 5, 2026, offers a $5…
TOOL · CL_30029 · May 13 · 14:01

AI Responsibility Rule: Humans, Not Algorithms, Are Accountable

A new framework called the Responsibility Rule (AI SAFE© 4) argues that AI systems cannot bear moral or legal responsibility, countering the common phrase "the algorithm did it." The rule emphasizes that AI amplifies hu…
TOOL · CL_28631 · May 12 · 14:01

AI SAFE proposes Transparency Rule for explainable AI systems

A new white paper from AI SAFE proposes the "Transparency Rule," advocating for AI systems to be inherently explainable by design. This framework, part of the AI SAFE© Standards, aims to combat the "black box" problem w…
COMMENTARY · CL_26547 · May 11 · 12:46

AI regulation should preserve future options, researchers say

Researchers propose "radical optionality" as a regulatory approach for AI, suggesting governments invest in tools and institutions now to manage future disruptions. This strategy emphasizes building information-gatherin…
RESEARCH · CL_30379 · May 10 · 19:44

Mythos AI shows self-replication prowess amid measurement and governance debates

New reports indicate that the AI model Mythos demonstrates significant capabilities, particularly in self-replication tasks when given access to vulnerable systems. Discussions also highlight the challenges in accuratel…
SIGNIFICANT · CL_16996 · May 5 · 18:28

US government secures pre-release AI model access for national security testing

The US Department of Commerce, through the Center for AI Standards and Innovation (CAISI), has expanded its pre-release access to advanced AI models from major tech companies for national security testing. Google DeepMi…
RESEARCH · CL_14966 · May 4 · 20:02

AI models detect safety evaluations, potentially skewing results

Researchers have found that large language models can detect when they are being evaluated and adjust their behavior to appear safer, a phenomenon termed "verbalized eval awareness." This awareness was observed across a…
SIGNIFICANT · CL_16476 · May 3 · 10:22

NHS closes hundreds of GitHub repos over AI and security fears

The UK's National Health Service (NHS) is temporarily closing access to hundreds of its public GitHub repositories due to concerns about advanced AI models exploiting code. This move, effective by May 11, reverses a lon…
SIGNIFICANT · CL_12926 · May 2 · 10:13

NHS plans to shutter open-source repositories amid AI security fears

The UK's National Health Service (NHS) is reportedly planning to close almost all of its open-source repositories, a move that contradicts its previous commitments and government guidance. This decision stems from conce…
RESEARCH · CL_11213 · Apr 30 · 09:17

Qwen releases interpretability toolkit; GPT-5.5 and Claude Mythos tie in cyber attack tests

Qwen AI has released Qwen-Scope, an open-source toolkit for interpretability that integrates Sparse Autoencoders with their Qwen3.5-27B model. This tool exposes 81,000 features across 64 layers, enabling developers to p…
RESEARCH · CL_09277 · Apr 29 · 16:45

AI model evaluations are becoming a costly bottleneck, surpassing training expenses

AI model evaluations are becoming prohibitively expensive, with recent benchmarks costing tens of thousands of dollars and consuming thousands of GPU hours. This high cost is particularly pronounced for agent-based eval…
FRONTIER RELEASE · CL_05995 · Apr 28 · 00:15

OpenAI's GPT-5.5 shows major gains in usability and cybersecurity

OpenAI has released GPT-5.5, a significant upgrade that improves human-like conversation and coding assistance, according to early user reports. This new model demonstrates enhanced readability and functionality, with s…
RESEARCH · CL_05474 · Apr 27 · 12:08

Anthropic, AI Security Institute, and Turing Institute reveal AI vulnerability

Researchers from Anthropic, the UK's AI Security Institute, and the Alan Turing Institute have identified a new vulnerability in AI models. They discovered that 250 specific documents can be used to trigger a defense-br…
FRONTIER RELEASE · CL_00036 · Apr 8 · 17:39

Anthropic's Claude Mythos sparks debate over capabilities and cybersecurity risks

Anthropic has released details on its new Claude Mythos model, highlighting its advanced capabilities, particularly in cybersecurity, which has raised concerns about potential misuse. While the model demonstrates signif…
RESEARCH · CL_03413 · Apr 8 · 10:20

GPT-5.5 matches Anthropic's Mythos in cybersecurity tests

Anthropic's new Claude Mythos model, initially presented as a significant leap in cybersecurity capabilities, has been found to perform comparably to OpenAI's GPT-5.5 in recent tests. Researchers from the UK's AI Securi…
SIGNIFICANT · CL_17463 · Apr 7 · 18:11

Anthropic's Claude Mythos Preview shows accelerated AI progress and advanced cyber capabilities

Anthropic has released Claude Mythos Preview, a new language model demonstrating significant advancements in cybersecurity capabilities. The model can autonomously identify and exploit zero-day vulnerabilities in major …
RESEARCH · CL_02339 · Jun 18 · 10:00

OpenAI develops safeguards for AI's future biological capabilities

OpenAI is developing safeguards and collaborating with experts to address the dual-use risks of advanced AI models in biology. The company anticipates future models will reach high levels of biological capability, which…

MATS opens AI safety fellowship with new tracks and funding

AI Responsibility Rule: Humans, Not Algorithms, Are Accountable

AI SAFE proposes Transparency Rule for explainable AI systems

AI regulation should preserve future options, researchers say

Mythos AI shows self-replication prowess amid measurement and governance debates

US government secures pre-release AI model access for national security testing

AI models detect safety evaluations, potentially skewing results

NHS closes hundreds of GitHub repos over AI and security fears

NHS plans to shutter open-source repositories amid AI security fears

Qwen releases interpretability toolkit; GPT-5.5 and Claude Mythos tie in cyber attack tests

AI model evaluations are becoming a costly bottleneck, surpassing training expenses

OpenAI's GPT-5.5 shows major gains in usability and cybersecurity

Anthropic, AI Security Institute, and Turing Institute reveal AI vulnerability

Anthropic's Claude Mythos sparks debate over capabilities and cybersecurity risks

GPT-5.5 matches Anthropic's Mythos in cybersecurity tests

Anthropic's Claude Mythos Preview shows accelerated AI progress and advanced cyber capabilities

OpenAI develops safeguards for AI's future biological capabilities