Visit IGEN World Explore IGEN Expo

EXPLORE UPGRADE PLANS

BREAKING

AI-Powered Microphone Monitors Elderly Father for Falls, Raising Privacy Questions New UDS Framework Slashes LLM Fine-Tuning Time While Boosting Model Performance Cascaded Sparse Autoencoders Enable Hierarchical Visual Concept Learning in Multimodal LLMs Multiple Factors Set to Reset Ocean Rates in Coming Weeks Orcheo: An Open-Source Modular Full-Stack Platform for Conversational Search First Model-Free Universal AI Agent Proved Asymptotically Optimal in General Reinforcement Learning AuAu Benchmark Audits Authoritarian Alignment in Large Language Models from Four Regions VinQA Dataset Enables Multimodal Document QA with Interleaved Visual Elements for Enterprise AI AlignCoder Uses Reinforcement Learning to Improve Repository-Level Code Completion by 18% New Fluid-Guided Algorithm Optimizes LLM Inference Scheduling Under Memory Constraints AI-Powered Microphone Monitors Elderly Father for Falls, Raising Privacy Questions New UDS Framework Slashes LLM Fine-Tuning Time While Boosting Model Performance Cascaded Sparse Autoencoders Enable Hierarchical Visual Concept Learning in Multimodal LLMs Multiple Factors Set to Reset Ocean Rates in Coming Weeks Orcheo: An Open-Source Modular Full-Stack Platform for Conversational Search First Model-Free Universal AI Agent Proved Asymptotically Optimal in General Reinforcement Learning AuAu Benchmark Audits Authoritarian Alignment in Large Language Models from Four Regions VinQA Dataset Enables Multimodal Document QA with Interleaved Visual Elements for Enterprise AI AlignCoder Uses Reinforcement Learning to Improve Repository-Level Code Completion by 18% New Fluid-Guided Algorithm Optimizes LLM Inference Scheduling Under Memory Constraints

Home ›› Topics ›› manipulation policies

Topic

manipulation policies

1 story

ATOM-Bench: New Benchmark Evaluates Atomic Skills and Compositional Generalization in Robotic Manipulation Policies

Artificial Intelligence #benchmark#atomic skills

ATOM-Bench: New Benchmark Evaluates Atomic Skills and Compositional Generalization in Robotic Manipulation Policies

Researchers introduce ATOM-Bench, a real-world benchmark that factorizes tabletop manipulation into atomic skills and compositional tasks. It includes 30 atomic tasks and 24 held-out compositional tasks across single-arm and dual-arm tracks, with 3,000 human demonstrations. Through 2,700 physical rollouts, the team found that current policies struggle with fine-grained motor skills, counting, and logical filtering, and strong atomic performance does not guarantee compositional transfer.

Jun 16, 2026 1 source