iGEN
Visit IGEN World Explore IGEN Expo
EXPLORE UPGRADE PLANS
BREAKING
PATCH Monitor Enables Robots to Handle Unexpected Disturbances During Manipulation Tasks Z-Plane Neural Networks Replace ReLU and LayerNorm with Bounded Geometric Activation APEC Climate Center Upgrades El Niño to Strong; Indian Monsoon Faces Elevated Risk New Architecture GRIL Enables Gradient Descent-Like Learning in Linear Recurrent Networks ToolSelf AI Agents Achieve 28.8 Point Gain Through Runtime Self-Reconfiguration ArtNet: JEPA-Like Articulatory Framework Achieves 20.56% Error Reduction in Zero-Shot Phoneme Recognition LLM-Assisted Stance Detection in Scientific Discourse Reaches 0.76 Combined Reliability Score New Drift-RAE Method Distills Transformers Efficiently Using Representation Autoencoders Cough Regression Benchmark Reveals Trade-Offs in Respiratory Acoustic Foundation Models Spacex Acquires AI Coding Startup Cursor For $60bn Days After Bumper IPO PATCH Monitor Enables Robots to Handle Unexpected Disturbances During Manipulation Tasks Z-Plane Neural Networks Replace ReLU and LayerNorm with Bounded Geometric Activation APEC Climate Center Upgrades El Niño to Strong; Indian Monsoon Faces Elevated Risk New Architecture GRIL Enables Gradient Descent-Like Learning in Linear Recurrent Networks ToolSelf AI Agents Achieve 28.8 Point Gain Through Runtime Self-Reconfiguration ArtNet: JEPA-Like Articulatory Framework Achieves 20.56% Error Reduction in Zero-Shot Phoneme Recognition LLM-Assisted Stance Detection in Scientific Discourse Reaches 0.76 Combined Reliability Score New Drift-RAE Method Distills Transformers Efficiently Using Representation Autoencoders Cough Regression Benchmark Reveals Trade-Offs in Respiratory Acoustic Foundation Models Spacex Acquires AI Coding Startup Cursor For $60bn Days After Bumper IPO
Home ›› Technology ›› Ai ›› Lossy Compression Slashes Storage 39x for Neural Surrogate Models, Study Finds

Lossy Compression Slashes Storage 39x for Neural Surrogate Models, Study Finds

A new study quantifies the impact of lossy compression on neural generative surrogate models, finding that storage can be reduced by up to 39x and training time by up to 3x with negligible effect on model quality, offering a path to more efficient AI training in data-intensive domains.

iG
iGEN Editorial
June 16, 2026
Lossy Compression Slashes Storage 39x for Neural Surrogate Models, Study Finds

Neural networks are increasingly used as generative surrogate models to replace time-consuming numerical simulations, but the massive training datasets required create significant storage and I/O bottlenecks. A new study from researchers including Zhimin, Menon, Harshitha, Jekel, Charles, Pascucci, Valerio, and Lindstrom, published on arXiv, examines how lossy compression of training data impacts the quality of these surrogate models. The findings show that compression can reduce storage requirements by up to 23.7x and 39x across two application simulations, while also speeding up training by up to 3x—all with negligible impact on model quality.

The Storage Challenge in Generative Surrogate Modeling

High-fidelity generative surrogate models demand large training datasets, which can create storage and I/O challenges, according to the paper. Lossy compression is a promising way to reduce this burden, but compression errors may affect model quality in subtle ways, making it difficult to quantify their impact. The researchers set out to characterize this uncertainty and develop a method to estimate how much compression-induced error a surrogate model can tolerate without degrading accuracy.

Methodology: Characterizing Inherent Uncertainty

The team began by characterizing the uncertainty inherent in training neural networks, showing that identical training configurations can produce different models. By exploiting this variability, they proposed a method to estimate the tolerance of a surrogate model to compression errors. The approach was evaluated on two application simulations, though the specific applications are not named in the paper.

Results: Compression Savings and Training Speedup

The evaluation demonstrated significant reductions in memory and storage requirements while maintaining high-quality surrogate models. The key results are summarized in the table below.

Metric Improvement Context
Data storage reduction (simulation 1) Up to 23.7x Negligible impact on model quality
Data storage reduction (simulation 2) Up to 39x Negligible impact on model quality
Training time reduction Up to 3x Due to reduced data size and faster loading

"These results show that lossy compression saves data storage up to 23.7x and 39x with negligible impact on the quality of the surrogate model."

Additionally, reducing the size of the training data set enhances data loading speed, contributing to the overall training time reduction of up to 3x.

Implications for Enterprise AI

While the study focuses on scientific discovery simulations, the approach has direct relevance for enterprise AI applications that rely on large training datasets for neural surrogate models, such as digital twins in supply chain, logistics optimization, and manufacturing. The ability to cut storage requirements by nearly 40x and training time by 3x without sacrificing model fidelity can significantly lower infrastructure costs and accelerate model development cycles. For CTOs and technology leaders managing data-intensive AI pipelines, lossy compression, when carefully validated, offers a practical lever to scale generative surrogate modeling without proportional storage investment.

The researchers note that the method exploits the inherent variability in neural network training to estimate compression tolerance, suggesting that similar approaches could be generalized to other domains where training data volume is a bottleneck. As enterprises increasingly adopt surrogate models to replace costly simulations—whether for demand forecasting, route optimization, or equipment failure prediction—techniques that reduce the data footprint without compromising accuracy will become critical competitive differentiators.


Sources:

Keep Reading

Recommended Stories

Multiple Descents in Deep Learning Linked to Order-Chaos Transitions in LSTM Networks, New Research Shows Technology

Multiple Descents in Deep Learning Linked to Order-Chaos Transitions in LSTM Networks, New Research Shows

Researchers have observed a 'multiple-descent' phenomenon in LSTM networks, where test performance cycles through ups and downs after overtraining. Asymptotic stability analysis reveals these cycles are linked to order-chaos phase transitions, with the most optimal training step at the first transition from order to chaos, where the 'edge of chaos' is widest.

June 16, 2026
New Research Reveals Truthfulness Preserved Across LLM Lineages, Enabling Better Hallucination Control Technology

New Research Reveals Truthfulness Preserved Across LLM Lineages, Enabling Better Hallucination Control

A new paper from researchers shows that truthfulness-related attention heads are preserved across generations of large language models, even after instruction tuning or multimodal adaptation. The authors propose TruthProbe, a soft-gating strategy that amplifies these heads to reduce hallucinations, with improvements on HaluEval, POPE, and CHAIR benchmarks.

June 16, 2026
EEGNet Study Reveals Key Limitations in fNIRS Cognitive Load Classification Technology

EEGNet Study Reveals Key Limitations in fNIRS Cognitive Load Classification

A comprehensive study published on arXiv systematically evaluates EEGNet for classifying cognitive load from fNIRS signals. The research highlights critical challenges in generalization, achieving only 56.11% accuracy under subject-independent evaluation, and underscores the importance of segmentation strategy and learning rate selection.

June 16, 2026
LaWAM: Latent World Action Model Enables Efficient, Dynamics-Aware Robot Control with Low Latency Technology

LaWAM: Latent World Action Model Enables Efficient, Dynamics-Aware Robot Control with Low Latency

LaWAM (Latent World Action Model) is a new robotics AI that uses compact latent visual subgoals instead of full video generation to achieve fast, dynamics-aware robot control. It achieves state-of-the-art success rates on LIBERO (98.6%) and RoboTwin (91.22%) with 187ms per action-chunk and up to 24x lower latency than pixel-space World Action Models.

June 16, 2026