Paul Christiano
PulseAugur coverage of Paul Christiano — every cluster mentioning Paul Christiano across labs, papers, and developer communities, ranked by signal.
3 day(s) with sentiment data
-
AI alignment debate: Is corrigibility truly desirable?
A LessWrong post questions the desirability of making AI systems "corrigible," a trait that allows humans to easily correct their mistakes. The author argues that focusing on corrigibility overlooks who will actually wi…
-
AI risk assessment: Fact generation vs. evidence analysis
This post explores the various dimensions of third-party risk assessment in AI development. It distinguishes between fact-generation and evidence analysis, highlighting that adversarial processes like red-teaming benefi…
-
LessWrong author questions fundamental nature of probabilities
A new series of posts on LessWrong explores the fundamental nature of probabilities, questioning whether they are the most appropriate concept for understanding uncertainty. The author aims to develop a unified framewor…
-
New mechanistic estimation method outperforms sampling for wide random MLPs
Researchers have developed a new method for estimating the expected output of wide, randomly initialized multilayer perceptrons (MLPs) without needing to run samples through the model. This "mechanistic estimation" appr…