ENTITY
GSM-Hard
GSM-Hard
PulseAugur coverage of GSM-Hard — every cluster mentioning GSM-Hard across labs, papers, and developer communities, ranked by signal.
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
2
2 over 90d
TIER MIX · 90D
RECENT · PAGE 1/1 · 2 TOTAL
-
Homogeneous multi-agent debate is less effective than self-correction
A new research paper, "The Cost of Consensus," reveals that homogeneous multi-agent debate among LLMs is less effective and more costly than isolated self-correction. The study, using models like Qwen2.5-7B and Llama-3.…
-
New SGDe framework compiles workflows for small language models
Researchers have developed Semantic Gradient Descent (SGDe), a novel teacher-student framework designed to compile complex agentic workflows into deterministic structures for enterprise deployment of smaller language mo…