iGEN
Visit IGEN World Explore IGEN Expo
EXPLORE UPGRADE PLANS
BREAKING
Robot Learning Reveals Emergent 'Self' Subnetwork in Continual Learning Studies New Book on Optimal Transport Offers Machine Learning Practitioners a Unified Framework Lightweight Distillation of SAM 3 and DINOv3 for Edge-Deployable Livestock Monitoring Varanasi to Host 2-Day Wheat Products Promotion Society CEO's Conclave from July 9 Uncertainty Quality of VGGT: Analysis on DTU Benchmark Dataset Reveals Effective Confidence Threshold for 3D Reconstruction New Diffusion Model Learns Permutation Distributions with Softer, More Tractable Trajectories RidgeCut: Reinforcement Learning Framework Optimizes Logistics Network Partitioning with Rings and Wedges SDS-LoRA: New Low-Rank Adaptation Method Fixes Gradient Distortion in Large Model Fine-Tuning NeuronFabric Architecture Cuts Memory for On-Chip Transformer Training, Promises Efficient Edge AI Kharif Pulses Sowing Off to a Weak Start: Acreage Down 43% as of June 12 Robot Learning Reveals Emergent 'Self' Subnetwork in Continual Learning Studies New Book on Optimal Transport Offers Machine Learning Practitioners a Unified Framework Lightweight Distillation of SAM 3 and DINOv3 for Edge-Deployable Livestock Monitoring Varanasi to Host 2-Day Wheat Products Promotion Society CEO's Conclave from July 9 Uncertainty Quality of VGGT: Analysis on DTU Benchmark Dataset Reveals Effective Confidence Threshold for 3D Reconstruction New Diffusion Model Learns Permutation Distributions with Softer, More Tractable Trajectories RidgeCut: Reinforcement Learning Framework Optimizes Logistics Network Partitioning with Rings and Wedges SDS-LoRA: New Low-Rank Adaptation Method Fixes Gradient Distortion in Large Model Fine-Tuning NeuronFabric Architecture Cuts Memory for On-Chip Transformer Training, Promises Efficient Edge AI Kharif Pulses Sowing Off to a Weak Start: Acreage Down 43% as of June 12
Home ›› Technology ›› Ai ›› Llms ›› New UDS Framework Slashes LLM Fine-Tuning Time While Boosting Model Performance

New UDS Framework Slashes LLM Fine-Tuning Time While Boosting Model Performance

Researchers propose UDS (Utility-Diversity Sampling), a framework for efficient online batch selection during LLM supervised fine-tuning. UDS reduces training time compared to full-dataset fine-tuning while consistently outperforming state-of-the-art methods.

iG
iGEN Editorial
June 16, 2026
New UDS Framework Slashes LLM Fine-Tuning Time While Boosting Model Performance

Supervised fine-tuning (SFT) of large language models (LLMs) is a critical step for adapting models to downstream tasks, but it is computationally expensive and can suffer from overfitting or bias amplification when using the full dataset. Existing online batch selection methods that dynamically score and filter samples during training have limitations. They often rely solely on data utility, neglecting diversity, depend on external resources like reference models or validation sets, and incur extra training time over full-dataset training.

According to a new paper on arXiv by authors Heming Zou, Yixiu Mao, Yun Qu, Qi Wang, and Xiangyang Ji, a framework called UDS (Utility-Diversity Sampling) addresses these challenges. UDS leverages the nuclear norm of the logits matrix to capture both data utility and intra-sample diversity, while estimating inter-sample diversity through efficient low-dimensional embedding comparisons with a lightweight memory buffer of historical samples. This design eliminates the need for external resources and unnecessary backpropagation, securing computational efficiency.

"UDS consistently outperforms state-of-the-art online batch selection methods under varying data budgets, and significantly reduces training time compared to full-dataset fine-tuning." — from the paper's abstract.

Performance Gains

Experiments on multiple benchmarks demonstrate UDS's advantages. The framework achieves better model performance across different data budgets while requiring less computation than full-dataset SFT. Key benefits include:

  • Reduced training time: Compared to full-dataset fine-tuning, UDS significantly cuts training duration.
  • Improved model quality: Outperforms existing online batch selection methods consistently.
  • No external resources: Unlike prior work, UDS does not need a reference model or validation set.
  • Built-in diversity: Considers both inter- and intra-sample diversity, preventing overfitting and bias.

Comparison of Batch Selection Approaches

Feature Existing Methods UDS
Data utility used Yes Yes
Diversity considered Often neglected Both intra- and inter-sample
External resources required Reference model or validation set None
Extra training time over full dataset Yes No
Performance vs. SOTA Variable Consistently outperforms

Implications for Enterprise AI

For CTOs and technology leaders investing in LLM deployment, the UDS framework offers a practical route to reduce the cost and time of supervised fine-tuning. By automating the selection of the most valuable training examples, enterprises can achieve high-performance domain-adapted models without the expense of full-dataset processing. The code is available at the URL provided in the paper, enabling teams to integrate UDS into their existing SFT pipelines. This efficiency gain is critical as organizations scale their AI capabilities across applications such as supply chain optimization, trade document processing, and logistics automation, where quickly fine-tuning LLMs on proprietary data can yield competitive advantages.

The elimination of external dependencies also simplifies infrastructure requirements, aligning with lean IT strategies. As LLM adoption accelerates in enterprise contexts, frameworks like UDS that balance utility and diversity while minimizing computational overhead will become increasingly valuable.


Sources:

Keep Reading

Recommended Stories

New Self-Enhanced Fine-Tuning Method Boosts Text-to-SQL Reasoning and Generalization Technology

New Self-Enhanced Fine-Tuning Method Boosts Text-to-SQL Reasoning and Generalization

Researchers propose CoTE-SQL, a self-enhanced fine-tuning method that improves text-to-SQL generation by integrating reasoning traces, structured chain-of-thought prompting, and execution error correction. The approach achieves state-of-the-art results on Bird and Spider benchmarks, particularly on complex queries.

June 16, 2026
SDS-LoRA: New Low-Rank Adaptation Method Fixes Gradient Distortion in Large Model Fine-Tuning Technology

SDS-LoRA: New Low-Rank Adaptation Method Fixes Gradient Distortion in Large Model Fine-Tuning

A new paper on arXiv introduces SDS-LoRA, a low-rank parameterization that overcomes anisotropic gradient scaling in LoRA. By structurally decoupling singular values from the backward pass, SDS-LoRA ensures gradients are only applied through orthonormal bases, improving convergence and reducing the performance gap to full fine-tuning. Experimental results across natural language and vision benchmarks show enhanced adaptation performance.

June 16, 2026
NeuronFabric Architecture Cuts Memory for On-Chip Transformer Training, Promises Efficient Edge AI Technology

NeuronFabric Architecture Cuts Memory for On-Chip Transformer Training, Promises Efficient Edge AI

A new software reference architecture called NeuronFabric, detailed in an arXiv paper by Evgeny Ukladchikov, demonstrates on-chip transformer training with local Adam updates. The BF16W variant reduces memory requirements by approximately 16.5% compared to FP32, achieving 4.0 MB to 3.34 MB for a 334K-parameter model, enabling deployment on Xilinx ZCU102 devices. The C# prototype produces coherent text with loss comparable to an FP32 GPU reference.

June 16, 2026
Tyler Framework Boosts LLM Reasoning by Up to 14 Points with Smarter Compute Allocation Technology

Tyler Framework Boosts LLM Reasoning by Up to 14 Points with Smarter Compute Allocation

A new framework called Tyler introduces typed latent reasoning for large language models, learning when to invoke latent computation and how much to allocate. On three backbone LLMs, Tyler improved accuracy by up to 14.49 points over chain-of-thought prompting and up to 4.30 points over competing baselines, while reducing forgetting.

June 16, 2026