Topic
code
Semi-Supervised Framework Scales LLM Reasoning Using 10-15x Fewer Labels Than Traditional Methods
A new semi-supervised framework for training LLM reasoning uses a lightweight verifier to judge reasoning quality, requiring only a few labeled samples. Experiments on math problems and visual question answering show accuracy comparable to 10-15x more labeled data. The method could reduce the cost of building large-scale reasoning datasets.
New Agentic Programming Framework Shifts Control from LLMs to Deterministic Code for Greater Reliability
A new paper argues that current LLM agent frameworks have architectural flaws leading to token explosion, control-flow hallucination, and unreliable completion. The authors propose Agentic Programming, where the program governs all control flow and the LLM is an adaptive component called LLM-as-Code, invoked only for reasoning or generation. A case study on computer-use agents shows improved stability in long visual operation sequences.