FormalRewardBench benchmark evaluates LLM reward models for theorem proving

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 1 source

Researchers have introduced FormalRewardBench, a new benchmark designed to evaluate reward models used in formal theorem proving. This benchmark addresses the challenge of sparse credit assignment in reinforcement learning for theorem provers by enabling the comparison of reward models without extensive retraining. FormalRewardBench includes 250 preference pairs with various error injection strategies and has been used to test several large language models, revealing that frontier models perform best in evaluating proof quality. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT This benchmark aims to improve reward models for AI theorem provers, potentially leading to more capable AI systems in formal mathematics and complex reasoning tasks.

RANK_REASON The cluster describes a new academic paper introducing a benchmark for evaluating AI models in a specific domain. [lever_c_demoted from research: ic=1 ai=1.0]

Read on arXiv cs.AI →

paper
safety

COVERAGE [1]

arXiv cs.AI TIER_1 · Gözde Gül Şahin · 2026-05-11 07:51

FormalRewardBench: A Benchmark for Formal Theorem Proving Reward Models

Recent neural theorem provers use reinforcement learning with verifiable rewards (RLVR), where proof assistants provide binary correctness signals. While verifiable rewards are cheap and scalable without reward hacking issues, they suffer from sparse credit assignment: models rec…

COVERAGE [1]

FormalRewardBench: A Benchmark for Formal Theorem Proving Reward Models

RELATED ENTITIES

RELATED TOPICS