Artificial Intelligence #ai#adversarial robustness
New Benchmark ARB4WM Evaluates Adversarial Robustness of World Models for Safety-Critical Control
Researchers have introduced ARB4WM, a unified benchmark for evaluating adversarial robustness of world models used in continuous control systems. The framework tests attacks across policy, value, and latent-dynamics levels, revealing that targeting value estimation and latent representations can be as harmful as direct policy disruption. Early and frequent perturbations are particularly damaging, and input-level defenses offer limited recovery.
Jun 16, 2026 1 source