Artificial Intelligence #llm#machine learning
Mask-Proof: New LLM Pipeline Automates Data Curation for Mathematical Proofs with 96.8% Accuracy
Researchers introduce Mask-Proof, an LLM-based pipeline that turns real mathematical proofs into automatically checkable masked-step tasks. The resulting Mask-ProofBench contains 292 problems. Seventeen models tested show reasoning-enhanced models outperform standard ones by 12-27%, with the evaluator achieving 96.8% agreement with expert annotators.
Jun 16, 2026 1 source