PulseAugur
LIVE 06:15:29
ENTITY MMLU-Hard

MMLU-Hard

PulseAugur coverage of MMLU-Hard — every cluster mentioning MMLU-Hard across labs, papers, and developer communities, ranked by signal.

Total · 30d
1
1 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
1
1 over 90d
TIER MIX · 90D
RECENT · PAGE 1/1 · 1 TOTAL
  1. TOOL · CL_18587 ·

    Homogeneous multi-agent debate is less effective than self-correction

    A new research paper, "The Cost of Consensus," reveals that homogeneous multi-agent debate among LLMs is less effective and more costly than isolated self-correction. The study, using models like Qwen2.5-7B and Llama-3.…