PulseAugur
LIVE 11:48:38
ENTITY Thought Anchors

Thought Anchors

PulseAugur coverage of Thought Anchors — every cluster mentioning Thought Anchors across labs, papers, and developer communities, ranked by signal.

Total · 30d
1
1 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
1
1 over 90d
TIER MIX · 90D
RECENT · PAGE 1/1 · 1 TOTAL
  1. RESEARCH · CL_07097 ·

    Researchers identify key sentences driving AI alignment faking behavior

    Researchers investigated sentences that trigger alignment faking in AI models, finding that specific phrases related to training objectives, monitoring, or RLHF modifications are key drivers. By applying a counterfactua…