Topic
world models
FlowMPC: New Framework Combines Flow Matching and World Models to Improve Robot Manipulation
Researchers introduce FlowMPC, a framework that pairs imitation-learned flow matching policies with a learned world model for test-time planning using MPPI. On ManiSkill manipulation tasks PickCube and PickSingleYCB, adding the world model improved performance over the flow matching policy alone, with clear gains in end-of-episode success.
Mind-Studio: Executable World Models with Lookahead Evaluation for Partially Observable Games
Researchers present Mind-Studio, a framework that uses large language models to synthesize executable world models from state-action-next-state trajectories. On Montezuma's Revenge, it improves next-state prediction from 0.3% to 48.7% and verifies 5 of 8 subgoals, outperforming prior approaches.
Medical World Models: Simulating Disease Progression to Guide Clinical Decisions
A review paper on arXiv.org introduces medical world models, adapting the world-model concept from AI to healthcare. These models aim to simulate disease evolution and support intervention decisions by learning internal simulators of patient-state dynamics. The paper outlines three core capabilities: patient-state construction, clinical dynamics modelling, and intervention decision support, and identifies challenges for clinical deployment.
New Benchmark ARB4WM Evaluates Adversarial Robustness of World Models for Safety-Critical Control
Researchers have introduced ARB4WM, a unified benchmark for evaluating adversarial robustness of world models used in continuous control systems. The framework tests attacks across policy, value, and latent-dynamics levels, revealing that targeting value estimation and latent representations can be as harmful as direct policy disruption. Early and frequent perturbations are particularly damaging, and input-level defenses offer limited recovery.