Artificial Intelligence #fastmix#data mixture
FastMix: Gradient-Based Data Mixture Optimization Reduces Search Cost in AI Training
FastMix is a novel framework that automates data mixture discovery by training only a single proxy model and jointly optimizing mixture coefficients and model parameters via gradient descent. It reformulates mixture selection as a bilevel optimization problem, enabling efficient, scalable optimization that outperforms baselines.
Jun 17, 2026 1 source