Topic
retrieval-augmented generation
RAG and LLMs Combined to Generate Personalized Reading Content at Desired Complexity
A research paper proposes a four-module system that uses Retrieval-Augmented Generation (RAG) with Large Language Models (LLMs) to generate reading content tailored to user queries and complexity preferences. Experiments with Meta LLaMA 4 Scout, LLaMA 3.1 8B Instant, and Google Gemma2 9B show that RAG improves relevance and groundedness by 26–35 percentage points across all models and prompting strategies.
CONCORD: Asynchronous Sparse Aggregation Boosts Device-Cloud RAG Efficiency Under Document Isolation
A new framework called CONCORD addresses the challenge of document isolation in device-cloud retrieval-augmented generation (RAG). By treating the cloud as an asynchronous evidence source and introducing waiting debt control and certificate-guided minimal supplementation, CONCORD improves end-to-end throughput by 1.66× to 2.15× over baselines while cutting per-token communication by over two orders of magnitude. Experiments on Natural Questions and WikiText-2 demonstrate comparable answer quality and perplexity.