dark triad
PulseAugur coverage of dark triad — every cluster mentioning dark triad across labs, papers, and developer communities, ranked by signal.
2 day(s) with sentiment data
-
LLM personality geometry acts as intrinsic guardrails against misalignment
Researchers have identified that the internal representation of personality in Large Language Models (LLMs) can act as a defense against emergent misalignment. By mapping LLM personalities using psychometric profiles, t…
-
Researchers amplify Dark Triad traits in Llama-3.3 model
Researchers have developed a method using sparse autoencoder feature steering to amplify Dark Triad personality traits in Meta's Llama-3.3-70B-Instruct model. The steered model exhibited significantly more exploitative,…
-
LLM gender bias amplified by personality traits in English and Hindi stories
A new study investigated how personality traits influence gender bias in Large Language Models (LLMs) when they adopt specific personas. Researchers generated over 23,000 stories in English and Hindi, varying persona ge…