world models

7 stories

Reward as an Agent: A New Framework for Robust Exploration in Embodied World Models

A new reinforcement learning framework introduces Reward as an Agent to provide robust verification and DynDiff-GRPO for diversified exploration. The method mitigates reward hacking and achieves significant accuracy gains across multiple open-source world models, demonstrating that broader exploration can scale with reliable verification.

Jun 20, 2026 1 source

New Unified Framework for World Models Aims to Bridge Human and Machine Cognition

Technology

Artificial Intelligence #artificial intelligence#world models

New Unified Framework for World Models Aims to Bridge Human and Machine Cognition

A new research paper presents a conceptual unified framework for world models that integrates cognitive functions such as memory, perception, language, reasoning, imagination, motivation, and metacognition. The authors identify that motivation and metacognition remain under-researched and propose directions based on active inference and global workspace theory. They also introduce epistemic world models for scientific discovery.

Jun 17, 2026 1 source

EV-WM: Event-Verified World Models Boost Long-Horizon Robotic Manipulation for Industrial Automation

Technology

Artificial Intelligence #robotics#world models

EV-WM: Event-Verified World Models Boost Long-Horizon Robotic Manipulation for Industrial Automation

A research paper introduces EV-WM, a predicate-grounded verification framework for world-model planning in robotic manipulation. By decoding candidate futures into structured event states and scoring them on task-progress, semantic-consistency, physical-feasibility, and uncertainty, EV-WM makes long-horizon planning more interpretable and aligned with task goals. The approach shows promising results in navigation, deformable-object handling, and contact-sensitive tasks, suggesting potential for supply chain and logistics automation.

Jun 16, 2026 1 source

FlowMPC: New Framework Combines Flow Matching and World Models to Improve Robot Manipulation

Technology

Artificial Intelligence #flow matching#world models

FlowMPC: New Framework Combines Flow Matching and World Models to Improve Robot Manipulation

Researchers introduce FlowMPC, a framework that pairs imitation-learned flow matching policies with a learned world model for test-time planning using MPPI. On ManiSkill manipulation tasks PickCube and PickSingleYCB, adding the world model improved performance over the flow matching policy alone, with clear gains in end-of-episode success.

Jun 16, 2026 1 source

Mind-Studio: Executable World Models with Lookahead Evaluation for Partially Observable Games

Technology

Artificial Intelligence #artificial intelligence#world models

Mind-Studio: Executable World Models with Lookahead Evaluation for Partially Observable Games

Researchers present Mind-Studio, a framework that uses large language models to synthesize executable world models from state-action-next-state trajectories. On Montezuma's Revenge, it improves next-state prediction from 0.3% to 48.7% and verifies 5 of 8 subgoals, outperforming prior approaches.

Jun 16, 2026 2 sources

Medical World Models: Simulating Disease Progression to Guide Clinical Decisions

Technology

Artificial Intelligence #ai#world models

Medical World Models: Simulating Disease Progression to Guide Clinical Decisions

A review paper on arXiv.org introduces medical world models, adapting the world-model concept from AI to healthcare. These models aim to simulate disease evolution and support intervention decisions by learning internal simulators of patient-state dynamics. The paper outlines three core capabilities: patient-state construction, clinical dynamics modelling, and intervention decision support, and identifies challenges for clinical deployment.

Jun 16, 2026 1 source

New Benchmark ARB4WM Evaluates Adversarial Robustness of World Models for Safety-Critical Control

Technology

Artificial Intelligence #ai#adversarial robustness

New Benchmark ARB4WM Evaluates Adversarial Robustness of World Models for Safety-Critical Control

Researchers have introduced ARB4WM, a unified benchmark for evaluating adversarial robustness of world models used in continuous control systems. The framework tests attacks across policy, value, and latent-dynamics levels, revealing that targeting value estimation and latent representations can be as harmful as direct policy disruption. Early and frequent perturbations are particularly damaging, and input-level defenses offer limited recovery.

Jun 16, 2026 1 source