Topic
adversarial robustness
GRAPE: New Training Method Boosts Adversarial Robustness with 21% Fewer Parameters
A new training framework called GRAPE (Guided Parameter-Space Evolution) improves adversarial robustness in neural networks by progressively exposing parameters, achieving 56.94% robust accuracy on CIFAR-10 with 21.4% fewer parameters than standard adversarial training, according to an arXiv paper.
New Benchmark ARB4WM Evaluates Adversarial Robustness of World Models for Safety-Critical Control
Researchers have introduced ARB4WM, a unified benchmark for evaluating adversarial robustness of world models used in continuous control systems. The framework tests attacks across policy, value, and latent-dynamics levels, revealing that targeting value estimation and latent representations can be as harmful as direct policy disruption. Early and frequent perturbations are particularly damaging, and input-level defenses offer limited recovery.