ENTITY
Elo
Elo
PulseAugur coverage of Elo — every cluster mentioning Elo across labs, papers, and developer communities, ranked by signal.
Total · 30d
536
536 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
38
38 over 90d
TIER MIX · 90D
RECENT · PAGE 1/1 · 2 TOTAL
-
Study finds global LLM leaderboards misleading, proposes portfolio rankings
A new research paper argues that current leaderboards for large language models (LLMs) are misleading due to significant heterogeneity in user preferences across languages and tasks. The study analyzed approximately 89,…
-
Chess-GPT model learns world model, can be manipulated to change skill
Researchers have explored interventions on a language model trained to play chess, dubbed Chess-GPT. By manipulating the model's internal representations of the board state and player skill, they demonstrated a causal l…