Topic
compression
Artificial Intelligence #lossy compression#neural networks
Lossy Compression Slashes Storage 39x for Neural Surrogate Models, Study Finds
A new study quantifies the impact of lossy compression on neural generative surrogate models, finding that storage can be reduced by up to 39x and training time by up to 3x with negligible effect on model quality, offering a path to more efficient AI training in data-intensive domains.
Jun 16, 2026 1 source
Artificial Intelligence #kv cache#compression
PolyKV: Layer-Wise KV Cache Compression Boosts LLM Inference Efficiency by Up to 54.5%
PolyKV is a new framework for compressing the key-value cache in large language model inference. It selects a compression policy per transformer layer and allocates non-uniform cache budgets, outperforming uniform approaches. On LongBench tasks, PolyKV recovers 40%-54.5% of the performance gap between the strongest single-policy baseline and full KV cache.
Jun 16, 2026 1 source