Topic
scheduling
Artificial Intelligence #llm#inference
New Fluid-Guided Algorithm Optimizes LLM Inference Scheduling Under Memory Constraints
A new paper from researchers including David Simchi-Levi introduces a fluid-guided online scheduling approach for LLM inference that addresses memory constraints from Key-Value cache growth. The WAIT and Nested WAIT algorithms approximate an optimal fluid benchmark, reducing latency in overloaded regimes according to simulations on Llama-2-7B with A100 GPUs.
Jun 16, 2026 1 source
Software #mergesort#human-in-the-loop
Surprise-Guided MergeSort Reduces Human Ranking Costs by Using AI to Prioritise Comparisons
Researchers propose Surprise-Guided MergeSort (SGS), a framework that combines a Vision-Language Model with MergeSort to schedule pairwise comparisons for subjective ranking tasks. SGS routes only ambiguous comparisons to humans, achieving Kendall's τ×100 improvements of +6 to +12 over Active Elo under the same budget.
Jun 16, 2026 1 source