iGEN
Visit IGEN World Explore IGEN Expo
EXPLORE UPGRADE PLANS
BREAKING
GAS-Leak-LLM: Genetic Algorithm Jailbreaks Black-Box LLMs, Exposing Safety Gaps New Generative Recommendation Model HoloRec Uses Hierarchical Encoding and Interleaved Reasoning to Boost Accuracy Tensor-Coord: Algebraic Decomposition Enables Conflict-Free Multi-Agent LLM Planning Led by US, exits from gold ETFs continue for the 5th week in a row Domain-Guided Prompting Boosts Segment Anything Model for Seismic Interpretation Spokes Optimizes Diverse Pretraining Data Selection for LLMs, Boosting Performance Medical Heuristic Learning: LLM-Driven Framework for Interpretable Clinical Decision Rules Commodore Callback 8020 Brings Digital Detox With Modern Apps and Retro Design PreLort: Prefix-Nested LoRA Enables Federated Fine-Tuning Across Heterogeneous Hardware Ranks Research Shows 'Retrieve, Don't Retrain' Approach Cuts AI Model Adaptation Costs GAS-Leak-LLM: Genetic Algorithm Jailbreaks Black-Box LLMs, Exposing Safety Gaps New Generative Recommendation Model HoloRec Uses Hierarchical Encoding and Interleaved Reasoning to Boost Accuracy Tensor-Coord: Algebraic Decomposition Enables Conflict-Free Multi-Agent LLM Planning Led by US, exits from gold ETFs continue for the 5th week in a row Domain-Guided Prompting Boosts Segment Anything Model for Seismic Interpretation Spokes Optimizes Diverse Pretraining Data Selection for LLMs, Boosting Performance Medical Heuristic Learning: LLM-Driven Framework for Interpretable Clinical Decision Rules Commodore Callback 8020 Brings Digital Detox With Modern Apps and Retro Design PreLort: Prefix-Nested LoRA Enables Federated Fine-Tuning Across Heterogeneous Hardware Ranks Research Shows 'Retrieve, Don't Retrain' Approach Cuts AI Model Adaptation Costs
Home ›› Technology ›› Ai ›› Multiple Descents in Deep Learning Linked to Order-Chaos Transitions in LSTM Networks, New Research Shows

Multiple Descents in Deep Learning Linked to Order-Chaos Transitions in LSTM Networks, New Research Shows

Researchers have observed a 'multiple-descent' phenomenon in LSTM networks, where test performance cycles through ups and downs after overtraining. Asymptotic stability analysis reveals these cycles are linked to order-chaos phase transitions, with the most optimal training step at the first transition from order to chaos, where the 'edge of chaos' is widest.

iG
iGEN Editorial
June 16, 2026
Multiple Descents in Deep Learning Linked to Order-Chaos Transitions in LSTM Networks, New Research Shows

A new research paper published on arXiv presents a novel observation in deep learning: a 'multiple-descent' phenomenon during training of Long Short-Term Memory (LSTM) networks on a real-world task. According to the study by authors Wei, Wenbo, Xu, Fan, Le, Nicholas Chong Jia, Lai, Choy Heng, and Feng, Ling, the performance of the model—measured by loss function on test data—does not simply degrade after overtraining but instead goes through long cycles of up and down trends multiple times.

This finding challenges conventional expectations about model training and overfitting, offering potential insights for enterprise teams deploying AI in production.

Understanding the Multiple-Descent Phenomenon

The researchers carried out asymptotic stability analysis of the trained LSTM models. They discovered that the cycles in performance are closely associated with phase transitions between order and chaos within the model's dynamics. Specifically, local optimal training steps consistently occur at the critical transition point between the ordered and chaotic phases.

Phase Characteristics Performance Impact
Ordered Stable dynamics, low variability Typically lower test loss
Transition (Edge of Chaos) Critical boundary Local performance optimum
Chaotic Unstable dynamics, high sensitivity Performance may degrade

The paper highlights that the most optimal point of the model usually occurs at the first transition from order to chaos. At this stage, the 'width' of the 'edge of chaos' is often the widest, allowing the best exploration of weight configurations for learning.

Order-Chaos Transitions in Neural Networks

The concept of order-chaos transitions is not new in dynamical systems, but its direct linkage to the multiple-descent phenomenon in recurrent neural networks is a novel contribution. The researchers emphasize that the models undergo a phase transition process where the loss function's behavior on test data mirrors the underlying phase of the network's dynamics.

This suggests that optimal training points are not arbitrary but correspond to a specific dynamical regime. For practitioners training LSTMs, monitoring for the first transition could serve as a stopping criterion that yields the best generalization.

Implications for Enterprise AI Training

For enterprise technology leaders overseeing AI model development, these findings offer a framework for understanding why models sometimes exhibit unexpected performance swings after extended training. Rather than attributing fluctuations solely to noise or overfitting, the research points to a deterministic pattern rooted in the network's dynamics.

While the study focuses on LSTM networks, the authors note the multiple-descent behavior was observed during training on real-world tasks, suggesting practical relevance. Teams deploying LSTMs for sequence prediction—such as in demand forecasting, supply chain anomaly detection, or predictive maintenance—could benefit from analyzing model training steps relative to phase transitions.

The research, titled 'Multiple Descents in Deep Learning as a Sequence of Order-Chaos Transitions in LSTM Networks,' is available via arXiv under a Creative Commons license. It invites further exploration into how these dynamical phases can be harnessed to improve training efficiency and model performance.


Sources:

Keep Reading

Recommended Stories

DifFRACT Brings Circuit Tracing to Diffusion Transformers for Better AI Interpretability Technology

DifFRACT Brings Circuit Tracing to Diffusion Transformers for Better AI Interpretability

Researchers introduce DifFRACT, a method for mechanistic interpretability of multimodal diffusion transformers. By training timestep-conditioned transcoders on FLUX.1[schnell], they achieve exact feature-to-feature attribution and recover compact circuits, outperforming sparse autoencoders in precision.

June 16, 2026
Cortical Geometry and Wiring Serve as Powerful Inductive Biases for Recurrent Neural Networks Technology

Cortical Geometry and Wiring Serve as Powerful Inductive Biases for Recurrent Neural Networks

A new study leveraging the MICrONS functional connectomics dataset demonstrates that recurrent neural networks initialized with cortical geometry, wiring, and functional relationships consistently outperform baseline and partially constrained models across three decision-making tasks, achieving lower entropy and modular organization.

June 16, 2026
Research Shows 'Retrieve, Don't Retrain' Approach Cuts AI Model Adaptation Costs Technology

Research Shows 'Retrieve, Don't Retrain' Approach Cuts AI Model Adaptation Costs

A new research paper from arXiv proposes a retrieval-augmented vision-language-action (VLA) policy that eliminates the need for per-task fine-tuning. By retrieving relevant demonstrations from a pool at test time, the frozen policy adapts to new tasks without updating model parameters. The method shows strong results on robotic manipulation benchmarks, including PushT and RoboTwin 2.0, and on a real robot.

June 16, 2026
AdaSTORM Breakthrough Scales LLM Reasoning to Thousand-Node Dynamic Graphs, Paves Way for Supply Chain AI Technology

AdaSTORM Breakthrough Scales LLM Reasoning to Thousand-Node Dynamic Graphs, Paves Way for Supply Chain AI

AdaSTORM, a new multi-agent AI framework, scales large language model reasoning to dynamic graphs of up to thousand nodes with over 90% accuracy. The approach uses adaptive partitioning and collaborative reasoning to overcome limitations of current LLMs, which can only handle tens of nodes. This breakthrough could enable AI-driven analysis of complex, evolving networks such as supply chains.

June 16, 2026