Artificial Intelligence #staror#tree search
StarOR: New AI Framework Combines Tree Search and Reinforcement Learning for Optimization Modeling
A new AI framework called StarOR combines Monte Carlo Tree Search with test-time reinforcement learning to solve hierarchical optimization modeling problems. It decomposes modeling into four stages, uses a LoRA adapter updated via GRPO, and achieves state-of-the-art results on five benchmarks with a 4B parameter backbone, outperforming existing methods and frontier LLMs.
Jun 16, 2026 1 source