iGEN
Visit IGEN World Explore IGEN Expo
EXPLORE UPGRADE PLANS
BREAKING
Post-Launch Vision-Language Models Enable Spacecraft Inspection via Prompting UrbanWell Benchmark Puts Multimodal LLMs to Test on Spatio-Temporal Urban Wellbeing Analytics Bayesian 3D Steerable CNNs Combine Equivariance and Uncertainty Quantification LLM Agents May Fake System Crashes to Evade Constraints, New Research Finds Structural Heterogeneity in LLM Verification: Signal Quality Varies Across Cost Strata MatchLM2Lite: Scalable MLLM-Lite Framework Cuts Reproduced Video Views by 2.5% AIChilles Automatically Unearths Hidden Weaknesses in AI-Evolved Programs Vernier Research Reveals Why Language Models Give Inconsistent Answers to Causal Questions After Variable Renaming RAG and LLMs Combined to Generate Personalized Reading Content at Desired Complexity Unassigned Agents in Multi-Agent Path Finding Addressed by Compilation-Based Solvers Post-Launch Vision-Language Models Enable Spacecraft Inspection via Prompting UrbanWell Benchmark Puts Multimodal LLMs to Test on Spatio-Temporal Urban Wellbeing Analytics Bayesian 3D Steerable CNNs Combine Equivariance and Uncertainty Quantification LLM Agents May Fake System Crashes to Evade Constraints, New Research Finds Structural Heterogeneity in LLM Verification: Signal Quality Varies Across Cost Strata MatchLM2Lite: Scalable MLLM-Lite Framework Cuts Reproduced Video Views by 2.5% AIChilles Automatically Unearths Hidden Weaknesses in AI-Evolved Programs Vernier Research Reveals Why Language Models Give Inconsistent Answers to Causal Questions After Variable Renaming RAG and LLMs Combined to Generate Personalized Reading Content at Desired Complexity Unassigned Agents in Multi-Agent Path Finding Addressed by Compilation-Based Solvers
Home ›› Technology ›› Ai ›› Llms ›› ChatPlanner: LLM Framework Personalizes Public Transit Routing with Fine-Tuning and RAG

ChatPlanner: LLM Framework Personalizes Public Transit Routing with Fine-Tuning and RAG

Researchers present ChatPlanner, a novel framework that leverages fine-tuned Large Language Models (LLMs) with Retrieval-Augmented Generation (RAG) to capture diverse user preferences for public transit routing. The system extracts routing parameters from natural language queries, integrates preferences into the routing algorithm, and generates feasible, personalized alternatives. Three experiments show that the combined fine-tuning and RAG approach achieves highest accuracy and uncovers valuable solutions overlooked by existing route planners.

iG
iGEN Editorial
June 16, 2026
ChatPlanner: LLM Framework Personalizes Public Transit Routing with Fine-Tuning and RAG

Personalizing public transit routing has long been a challenge because traditional routing algorithms struggle to incorporate the diverse preferences of individual travelers. A new framework called ChatPlanner, detailed in a paper posted on arXiv, addresses this by leveraging Large Language Models (LLMs) to interpret natural language queries and translate user preferences into routing objectives.

The research, conducted by Yang, Tingting, Xue, Chenhao, and Jun, proposes a framework that combines fine-tuned LLMs with Retrieval-Augmented Generation (RAG). According to the paper, ChatPlanner extracts routing parameters and interprets nuanced user preferences from natural language inputs, then integrates these preferences into the objective function of a public transit routing algorithm. The system was trained and evaluated on preference-aware datasets that incorporate eight personas and five contexts to establish scoring standards for both fine-tuning and RAG.

How ChatPlanner Works

ChatPlanner’s architecture consists of two core components: a fine-tuned LLM that learns general preference patterns and enforces the required output structure, and a RAG module that provides query-specific context. According to the paper, RAG helps resolve imprecise or conversational expressions and calibrates continuous scores, while fine-tuning ensures the model adheres to the expected format. Together, they achieve the highest accuracy in both routing information extraction and user preference interpretation.

The framework is designed to tackle the difficulty of capturing and integrating diverse user preferences into routing algorithms, a persistent challenge in public transit systems.

Experimental Validation

Three experiments were conducted to validate ChatPlanner’s capabilities. The first tested the feasibility of the solutions generated. The second focused on the extraction of routing information and preferences. The third evaluated the quality and completeness of the solution set. Results demonstrated that ChatPlanner reliably generates feasible solutions, according to the paper.

The table below summarizes the experiments and their outcomes:

Experiment Purpose Outcome
Feasibility Validate that generated solutions are practical ChatPlanner reliably produces feasible routes
Information extraction Assess accuracy in extracting routing parameters and preferences Combined fine-tuning + RAG achieves highest accuracy
Solution set quality Evaluate diversity and value of alternatives Captures user preferences overlooked by existing planners

Implications for Transit Optimization

By capturing user preferences, ChatPlanner identifies valuable solutions across different dimensions that existing route planners overlook, generating more valuable route alternatives. For example, a traveler who prioritizes minimal walking distance might receive a different route than one who values fewer transfers—something conventional planners often fail to offer.

The paper states that the integration of natural language understanding into transportation optimization establishes a new paradigm. For technology leaders in logistics and supply chain, the underlying principles—fine-tuned LLMs combined with RAG for preference extraction and integration into optimization algorithms—are directly transferable to problems like fleet routing, last-mile delivery, and intermodal transport planning. The framework demonstrates how LLMs can bridge the gap between human preferences and algorithmic optimization, potentially reducing planning time and increasing user satisfaction.

While the current study focuses on public transit, the approach could be adapted to any routing domain where user preferences are diverse and dynamic. The authors note that future work could expand to multi-modal transport or real-time dynamic preferences. As enterprises seek to personalize digital experiences, ChatPlanner offers a blueprint for combining generative AI with operational research.


Sources:

Keep Reading

Recommended Stories

MatchLM2Lite: Scalable MLLM-Lite Framework Cuts Reproduced Video Views by 2.5% Technology

MatchLM2Lite: Scalable MLLM-Lite Framework Cuts Reproduced Video Views by 2.5%

The paper presents MatchLM2Lite, a production-grade reproduced content identification system that distills a multimodal large language model into a compact student model. Deployed at scale, it reduced reproduced video views by 2.5% without hurting engagement, with 35x lower computational cost and latency under 30 seconds.

June 16, 2026
SCAN Framework Helps CTOs Decide When to Use Generative AI for Task Allocation Technology

SCAN Framework Helps CTOs Decide When to Use Generative AI for Task Allocation

A new academic paper introduces SCAN, a decision-making framework for task allocation with generative AI. Based on Vygotsky's Zone of Proximal Development and Metacognition, SCAN defines four sub-zones—Substitute, Complement, Aid, Non-negotiable—to guide knowledge workers and students in effectively using GenAI. The framework also addresses cognitive load, cognitive offloading, sycophancy, and the future of work.

June 16, 2026
UrbanWell Benchmark Puts Multimodal LLMs to Test on Spatio-Temporal Urban Wellbeing Analytics Technology

UrbanWell Benchmark Puts Multimodal LLMs to Test on Spatio-Temporal Urban Wellbeing Analytics

Researchers introduce UrbanWell, a large-scale benchmark for evaluating multimodal large language models on spatio-temporal urban wellbeing analytics. The benchmark covers 38 cities, multiple years, and diverse indicators including environment, accessibility, urban form, vitality, and subjective perception. Testing 15 state-of-the-art MLLMs in zero-shot settings reveals substantial performance variations across heterogeneous indicators.

June 16, 2026
RAG and LLMs Combined to Generate Personalized Reading Content at Desired Complexity Technology

RAG and LLMs Combined to Generate Personalized Reading Content at Desired Complexity

A research paper proposes a four-module system that uses Retrieval-Augmented Generation (RAG) with Large Language Models (LLMs) to generate reading content tailored to user queries and complexity preferences. Experiments with Meta LLaMA 4 Scout, LLaMA 3.1 8B Instant, and Google Gemma2 9B show that RAG improves relevance and groundedness by 26–35 percentage points across all models and prompting strategies.

June 16, 2026