Artificial Intelligence #spatial reasoning#multimodal agents
SpatialWorld Benchmark Reveals Multimodal Agents Struggle with Interactive Spatial Reasoning
Researchers introduced SpatialWorld, a benchmark for evaluating interactive spatial understanding of multimodal agents in real-world tasks. Testing 15 advanced agents, the strongest model (GPT-5) achieved only 17.4% task success rate, highlighting challenges in active exploration and long-horizon planning.
Jun 16, 2026 1 source