Artificial Intelligence #robotics#ai
ViTaL Framework Combines Vision and Touch to Boost Robot Manipulation Success by 51%
ViTaL, a visuo-tactile inference-time steering framework, uses a bi-level optimization combining visual sampling and tactile diffusion to guide robot policies. On three real-world contact-rich manipulation tasks, it improved success by 51% over the base policy, outperformed unimodal steering by at least 33%, and exceeded naive multimodal fusion by at least 20%.
Jun 16, 2026 2 sources