PulseAugur
EN
LIVE 21:41:04
ENTITY miniF2F

miniF2F

PulseAugur coverage of miniF2F — every cluster mentioning miniF2F across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
4
4 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
4
4 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D

2 day(s) with sentiment data

RECENT · PAGE 1/1 · 4 TOTAL
  1. TOOL · CL_74387 ·

    LLMs evaluated for formal math proofs in Lean 4

    A new research paper evaluates the performance of various Large Language Models (LLMs) in generating formal mathematical proofs using the Lean 4 theorem prover. The study employed pass@k and refine@k metrics on subsets …

  2. TOOL · CL_70440 ·

    LLM autoformalization struggles with paraphrased inputs

    Researchers have investigated the robustness of large language models (LLMs) in autoformalization tasks, specifically their ability to generate formal proofs from natural language statements. The study found that LLMs e…

  3. TOOL · CL_22214 ·

    New AI method achieves 100% formal validity in theorem autoformalization

    Researchers have developed a novel reference-free iterative refinement process for autoformalizing entire mathematical theorems. This method utilizes feedback from theorem provers and LLM-based judges to enhance formal …

  4. RESEARCH · CL_06763 ·

    Lean 4 autoformalization sensitive to surface phrasing, not semantics

    Researchers have investigated the impact of natural language variations on Lean 4 autoformalization, finding that semantically equivalent paraphrases can lead to different formal outputs. Their study, using GPT-family m…