Phi-4
PulseAugur coverage of Phi-4 — every cluster mentioning Phi-4 across labs, papers, and developer communities, ranked by signal.
2 day(s) with sentiment data
-
AI reasoning studies flawed by focus on final answer, not computation
A new research paper identifies a significant flaw in chain-of-thought (CoT) corruption studies, which are used to evaluate the faithfulness of AI reasoning. The study found that these evaluations often mistakenly ident…
-
LLMs show mixed reliability for mental health screening
A new research paper investigates the reliability of large language models (LLMs) for mental health screening, specifically their ability to estimate anxiety and depression scores from speech. The study evaluated three …
-
Autolearn framework enables language models to learn from documents without supervision
Researchers have introduced Autolearn, a novel framework designed to enable language models to learn from documents without external supervision. The system identifies passages that generate unusually high per-token los…