PulseAugur
LIVE 03:21:08
ENTITY MMLU

MMLU

PulseAugur coverage of MMLU — every cluster mentioning MMLU across labs, papers, and developer communities, ranked by signal.

Total · 30d
29
29 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
25
25 over 90d
TIER MIX · 90D
RECENT · PAGE 1/1 · 1 TOTAL
  1. TOOL · CL_27573 ·

    New Metacognitive Probe assesses LLM confidence and self-awareness

    Researchers have developed a new diagnostic tool called the Metacognitive Probe to assess how well Large Language Models (LLMs) understand their own confidence levels. This five-task probe decomposes an LLM's confidence…