ENTITY
ClassEval-Pro
ClassEval-Pro
PulseAugur coverage of ClassEval-Pro — every cluster mentioning ClassEval-Pro across labs, papers, and developer communities, ranked by signal.
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
2
2 over 90d
TIER MIX · 90D
RECENT · PAGE 1/1 · 2 TOTAL
-
ClassEval-Pro benchmark reveals LLMs struggle with class-level code generation
Researchers have introduced ClassEval-Pro, a new benchmark designed to evaluate the class-level code generation capabilities of large language models. This benchmark consists of 300 tasks across 11 domains, created usin…
-
LLM research explores new methods for training, evaluation, and understanding model behavior
Researchers are developing new methods to improve LLM capabilities in various domains. One study introduces MemCoE, a cognition-inspired framework for LLM agents to learn how to organize and update long-term user memory…