long context

4 stories

Artificial Intelligence #training#evaluation

New Study Challenges Prior Claims on Scaling Context Length in Imitation Learning

Researchers evaluated diffusion policies for robotic imitation learning across varying context lengths, challenging prior claims that long-context scaling is fragile. They propose a training algorithm that jointly trains policies at multiple context lengths, reducing sample complexity.

Jun 17, 2026 1 source

OBCache Prunes KV Cache for Efficient Long-Context LLM Inference with Output-Aware Scoring

Technology

Artificial Intelligence #llm#ai

OBCache Prunes KV Cache for Efficient Long-Context LLM Inference with Output-Aware Scoring

A new method called Optimal Brain Cache (OBCache) treats key-value cache eviction as a layer-wise structured pruning problem. By measuring token saliency through perturbation in attention outputs, OBCache outperforms heuristic-based approaches on LLaMA and Qwen models, consistently improving long-context accuracy according to the paper.

Jun 16, 2026 1 source

Parallel Hybrid Architecture Combines GSS and Attention for Efficient Long-Context Language Modeling

Technology

Artificial Intelligence #long-context#transformer

Parallel Hybrid Architecture Combines GSS and Attention for Efficient Long-Context Language Modeling

Researchers propose the Parallel Hybrid Architecture (PHA), combining Gated State Spaces, Grouped Query Attention, and Feed-Forward Networks in parallel branches fused by a learnable mixing mechanism. On WikiText-103, PHA achieves 16.51 PPL at 125M parameters, outperforming comparable models, and scales to 180M parameters with 16.42 PPL while delivering 24% higher throughput and up to 40% lower memory usage.

Jun 16, 2026 1 source

MMLongEmbed Benchmark Reveals Limitations in Long-Context Multimodal Embedding Models

Technology

Artificial Intelligence #multimodal#embedding

MMLongEmbed Benchmark Reveals Limitations in Long-Context Multimodal Embedding Models

MMLongEmbed is the first comprehensive benchmark for evaluating multimodal embedding models (MEMs) in long-context scenarios. It comprises four retrieval tasks covering text, document, and video modalities. The evaluation reveals that current MEMs rely heavily on superficial feature matching and struggle with deep semantic and structural dependencies, with performance degrading systematically based on context length and key information placement.

Jun 16, 2026 1 source