Artificial Intelligence #ai#deep research
Hybrid Open-Ended Tri-Evolution Framework Boosts Deep Research AI Performance
Researchers propose the Hybrid Open-Ended Tri-Evolution (HOTE) framework that uses hybrid-mode reinforcement learning to collaboratively evolve a proposer, solver, and judge for deep research tasks. An 8B model trained with HOTE surpasses static open 8-32B models and state-of-the-art deep research training methods while requiring less time overhead.
Jun 17, 2026 1 source