iGEN
Visit IGEN World Explore IGEN Expo
EXPLORE UPGRADE PLANS
BREAKING
GAS-Leak-LLM: Genetic Algorithm Jailbreaks Black-Box LLMs, Exposing Safety Gaps New Generative Recommendation Model HoloRec Uses Hierarchical Encoding and Interleaved Reasoning to Boost Accuracy Tensor-Coord: Algebraic Decomposition Enables Conflict-Free Multi-Agent LLM Planning Led by US, exits from gold ETFs continue for the 5th week in a row Domain-Guided Prompting Boosts Segment Anything Model for Seismic Interpretation Spokes Optimizes Diverse Pretraining Data Selection for LLMs, Boosting Performance Medical Heuristic Learning: LLM-Driven Framework for Interpretable Clinical Decision Rules Commodore Callback 8020 Brings Digital Detox With Modern Apps and Retro Design PreLort: Prefix-Nested LoRA Enables Federated Fine-Tuning Across Heterogeneous Hardware Ranks Research Shows 'Retrieve, Don't Retrain' Approach Cuts AI Model Adaptation Costs GAS-Leak-LLM: Genetic Algorithm Jailbreaks Black-Box LLMs, Exposing Safety Gaps New Generative Recommendation Model HoloRec Uses Hierarchical Encoding and Interleaved Reasoning to Boost Accuracy Tensor-Coord: Algebraic Decomposition Enables Conflict-Free Multi-Agent LLM Planning Led by US, exits from gold ETFs continue for the 5th week in a row Domain-Guided Prompting Boosts Segment Anything Model for Seismic Interpretation Spokes Optimizes Diverse Pretraining Data Selection for LLMs, Boosting Performance Medical Heuristic Learning: LLM-Driven Framework for Interpretable Clinical Decision Rules Commodore Callback 8020 Brings Digital Detox With Modern Apps and Retro Design PreLort: Prefix-Nested LoRA Enables Federated Fine-Tuning Across Heterogeneous Hardware Ranks Research Shows 'Retrieve, Don't Retrain' Approach Cuts AI Model Adaptation Costs
Home ›› Technology ›› Ai ›› ROSA-RL Uses Reinforcement Learning to Navigate Roundabouts with Uncertainty Awareness

ROSA-RL Uses Reinforcement Learning to Navigate Roundabouts with Uncertainty Awareness

ROSA-RL is an uncertainty-aware speed advisory system for roundabouts that uses reinforcement learning and a Transformer-based model to predict conflict zone occupancy. Evaluated in simulations, it outperforms model-based baselines and nearly matches an ideal scenario with full knowledge.

iG
iGEN Editorial
June 16, 2026
ROSA-RL Uses Reinforcement Learning to Navigate Roundabouts with Uncertainty Awareness

Roundabouts present a major challenge for automated driving because human behavior is heterogeneous and non-deterministic, driving intentions are unknown, and interaction complexity is high. These factors create uncertainty about whether the conflict zone will be blocked or available at the moment of entry. According to a paper published on arXiv, researchers have developed ROSA-RL (Roundabout Optimized Speed Advisory with Reinforcement Learning) to address this problem.

Probabilistic Conflict Forecasting with Transformers

ROSA-RL employs a Transformer-based model to predict conflict zone occupancy over a five-second horizon. The model captures multi-agent interactions, enabling it to anticipate upcoming conflicts and available gaps. According to the paper, the prediction outputs encode uncertainty in future motion and intent, which is then used to augment the state of a classical reinforcement learning (RL) framework. This allows the system to coordinate speed in an uncertainty-aware manner.

Uncertainty-Aware Reinforcement Learning

The core innovation of ROSA-RL is its ability to handle uncertainty explicitly. By incorporating probabilistic conflict forecasts into the RL state representation, the system can make safer and more efficient decisions in mixed traffic environments where human-driven vehicles and automated vehicles interact. The researchers note that this approach closes the gap to an ideal setting that assumes fully known occupancy, while improving both traffic efficiency and safety.

Simulation Evaluation and Results

ROSA-RL was evaluated in simulations grounded in real-world data. According to the paper, the system effectively handles uncertainty and outperforms a comparable model-based baseline. The results demonstrate that the uncertainty-aware RL framework can nearly match the performance of an ideal system with complete knowledge of future occupancy, without requiring that perfect information.

The source code for ROSA-RL is publicly available, as noted in the paper.

Implications for Mixed Traffic Automation

While ROSA-RL is specifically designed for roundabouts, the underlying approach—combining Transformer-based multi-agent prediction with uncertainty-aware reinforcement learning—could be extended to other traffic scenarios involving high interaction complexity and uncertain human behavior. The paper, authored by Schlamp, Anna-Lena; Gerner, Jeremias; Bogenberger, Klaus; Huber, Werner; and Schmidtner, Stefanie, is listed under Computer Science > Artificial Intelligence on arXiv.


Sources:

Keep Reading

Recommended Stories

New Survey Unifies LLM Policy Optimization Methods on First Principles from REINFORCE to GRPO Technology

New Survey Unifies LLM Policy Optimization Methods on First Principles from REINFORCE to GRPO

A new survey on arXiv revisits LLM policy optimization from first principles, modeling all methods as modifications of either the trajectory probability or reward function. It covers the path from REINFORCE to GRPO and beyond, identifying compound failures that require joint design of both sides.

June 16, 2026
ChatPlanner: LLM Framework Personalizes Public Transit Routing with Fine-Tuning and RAG Technology

ChatPlanner: LLM Framework Personalizes Public Transit Routing with Fine-Tuning and RAG

Researchers present ChatPlanner, a novel framework that leverages fine-tuned Large Language Models (LLMs) with Retrieval-Augmented Generation (RAG) to capture diverse user preferences for public transit routing. The system extracts routing parameters from natural language queries, integrates preferences into the routing algorithm, and generates feasible, personalized alternatives. Three experiments show that the combined fine-tuning and RAG approach achieves highest accuracy and uncovers valuable solutions overlooked by existing route planners.

June 16, 2026
daVinci-kernel: Reinforcement Learning Framework Automates GPU Kernel Optimization with Co-Evolving Skill Library Technology

daVinci-kernel: Reinforcement Learning Framework Automates GPU Kernel Optimization with Co-Evolving Skill Library

A new reinforcement learning framework called daVinci-kernel automates GPU kernel optimization by co-evolving skill selection, summarization, and utilization. The framework, detailed in a preprint on arXiv, uses three agents sharing one LLM backbone and achieves 37.2%, 70.6%, and 32.2% on KernelBench Level 1, 2, and 3 respectively, outperforming prior RL-trained models.

June 16, 2026
Volvo Sets Q1 2027 for Fully Driverless Truck Operations Technology

Volvo Sets Q1 2027 for Fully Driverless Truck Operations

Volvo Autonomous Solutions will remove safety drivers from autonomous trucks on U.S. highways in Q1 2027, targeting over 300 trucks by end of 2027 and $3 billion in revenue within five years. The company is expanding commercial operations in Texas and Oklahoma, with new routes eliminating drayage and directly serving customer facilities.

June 12, 2026