iGEN
Visit IGEN World Explore IGEN Expo
EXPLORE UPGRADE PLANS
BREAKING
Apple explains why Siri AI took so long: first version ready last year but rebuilt from ground up New LLM Framework Detects Phishing Emails with Over 90% Accuracy Dual-Granularity Orthogonal Disentanglement: New Framework Boosts Generalizable Audio Deepfake Detection Medical Image Segmentation Survey: U-Net, Transformers, SAM and Clinical Translation Challenges Bayesian Inference and Decision Audits Reveal Unreliability in Frontier AI Evaluation Archives Dali casualty exposes erosion of technical ownership in shipmanagement, warns veteran Kapoor SMEPilot Boosts LLM Inference Up to 3.94x on CPUs with Scalable Matrix Extensions Deep Learning Enables Autonomous Logistics Vehicles to Detect and Pick Load Carriers Bhumika Realty Appoints Amit Parsuramka as Chief Executive Officer New Automated Quantization Framework AQ4SViT Compresses Spiking Vision Transformers for Embedded AI Apple explains why Siri AI took so long: first version ready last year but rebuilt from ground up New LLM Framework Detects Phishing Emails with Over 90% Accuracy Dual-Granularity Orthogonal Disentanglement: New Framework Boosts Generalizable Audio Deepfake Detection Medical Image Segmentation Survey: U-Net, Transformers, SAM and Clinical Translation Challenges Bayesian Inference and Decision Audits Reveal Unreliability in Frontier AI Evaluation Archives Dali casualty exposes erosion of technical ownership in shipmanagement, warns veteran Kapoor SMEPilot Boosts LLM Inference Up to 3.94x on CPUs with Scalable Matrix Extensions Deep Learning Enables Autonomous Logistics Vehicles to Detect and Pick Load Carriers Bhumika Realty Appoints Amit Parsuramka as Chief Executive Officer New Automated Quantization Framework AQ4SViT Compresses Spiking Vision Transformers for Embedded AI
Home ›› Technology ›› Ai ›› Llms ›› PreLort: Prefix-Nested LoRA Enables Federated Fine-Tuning Across Heterogeneous Hardware Ranks

PreLort: Prefix-Nested LoRA Enables Federated Fine-Tuning Across Heterogeneous Hardware Ranks

A new method called PreLort addresses the challenge of aggregating federated LoRA adapters with different ranks due to heterogeneous hardware. By organizing adapter dimensions into a prefix hierarchy and introducing segment-wise aggregation and prefix-nested training, PreLort consistently outperforms existing heterogeneous federated LoRA methods in accuracy and ROUGE-L while achieving lower perplexity.

iG
iGEN Editorial
June 16, 2026
PreLort: Prefix-Nested LoRA Enables Federated Fine-Tuning Across Heterogeneous Hardware Ranks

Federated fine-tuning of large language models (LLMs) using parameter-efficient methods like LoRA (Low-Rank Adaptation) enables privacy-preserving adaptation of foundation models. However, heterogeneous hardware resources introduce a critical challenge: clients with different adapter ranks cannot be directly aggregated. Existing methods that allow aggregation under heterogeneous ranks fail to control how information is distributed across rank dimensions, leading to suboptimal use of shared low-rank representations. To solve this, researchers from multiple institutions have proposed PreLort, a nested low-rank formulation for federated LoRA that organizes adapter dimensions into a prefix hierarchy.

The Challenge of Heterogeneous Ranks

In federated learning, clients often possess different computational capabilities, resulting in varying adapter ranks when fine-tuning LLMs with LoRA. Direct averaging of these heterogeneous adapters dilutes the information contributed by lower-rank clients, as zero-padding disrupts the alignment of rank dimensions. According to the paper, existing heterogeneous federated LoRA methods do not control how information is distributed across rank dimensions, causing suboptimal use of shared low-rank representations. PreLort addresses this by ensuring that lower-rank dimensions encode task-relevant information while higher-rank dimensions capture additional capacity.

How PreLort Works

PreLort introduces three key components that together encourage a consistent low-rank prefix capturing the most task-relevant information, while higher-rank dimensions learn additional capacity. The first is a segment-wise aggregation rule that averages only over clients contributing to each rank segment, avoiding dilution from zero-padded lower-rank clients. The second is a prefix-nested training strategy that optimizes each adapter under multiple rank truncations, encouraging useful signal to concentrate in low-rank prefix dimensions. The third is the overall nested low-rank formulation that organizes adapter dimensions into a prefix hierarchy. These components allow low-rank clients to benefit from richer information contributed by higher-rank clients, as prefix dimensions are consistently learned and aggregated.

Component Description Benefit
Segment-wise aggregation Averages only over clients contributing to each rank segment Avoids dilution from zero-padded lower-rank clients
Prefix-nested training Optimizes each adapter under multiple rank truncations Encourages useful signal to concentrate in low-rank prefix dimensions
Nested low-rank formulation Organizes adapter dimensions into a prefix hierarchy Ensures lower-rank dimensions encode task-relevant information, higher-rank capture additional capacity

Experimental Results

Experiments conducted by the researchers demonstrate that PreLort consistently outperforms prior heterogeneous federated LoRA methods in accuracy and ROUGE-L, a metric for evaluating text generation quality. Additionally, the method achieves lower or comparable perplexity across multiple base models. The paper states that "our method consistently outperforms prior heterogeneous federated LoRA methods in accuracy and ROUGE-L, while achieving lower or comparable perplexity across multiple base models."

Implications for Enterprise AI

For enterprise technology decision-makers, PreLort represents a step toward more efficient and effective federated learning deployments. In scenarios where edge devices or regional servers have varying hardware capabilities—common in global supply chains and logistics—the ability to aggregate adapters without information loss can improve model performance without centralizing sensitive data. While the research is still in the academic phase, the method's focus on handling rank heterogeneity directly addresses a practical barrier to deploying federated LLM fine-tuning in heterogeneous environments.

The authors of the paper are Waseem, Muhammad, Tastan, Nurbek, Jovanovic, Andrej, Lane, Nicholas D, Lukas, Nils, Nandakumar, Karthik, and Horvath, Samuel. The work is available on arXiv and has been submitted to the computer science subcategory of Distributed, Parallel, and Cluster Computing.


Sources:

Keep Reading

Recommended Stories

SPRI: SVD-Partitioned Residual Initialization Boosts Data-Constrained MoE Upcycling for Multilingual Translation Technology

SPRI: SVD-Partitioned Residual Initialization Boosts Data-Constrained MoE Upcycling for Multilingual Translation

Researchers propose SPRI, a method that initializes Mixture-of-Experts (MoE) models from pretrained dense models using SVD-partitioned residuals. Evaluated on multilingual speech-to-text translation, SPRI achieves gains of 2.58 BLEU and 3.32 COMET over fine-tuned dense models, and outperforms prior MoE upcycling baselines by 3.39 BLEU and 4.34 COMET points.

June 16, 2026
Privacy-Preserving Text Sanitization for Distributed Agents via Disentangled Representations Technology

Privacy-Preserving Text Sanitization for Distributed Agents via Disentangled Representations

Researchers propose DiSan, a privacy-preserving text sanitization framework that uses disentangled representations to separate task semantics from style identifiers. Experiments show it reduces personally identifiable information exposure by 20 times while maintaining 83% answer faithfulness on a multi-agent RAG benchmark, outperforming token-level masking.

June 16, 2026
New AI Benchmark Reveals Brittle Reasoning in Large Language Models on Symbolic Puzzles Technology

New AI Benchmark Reveals Brittle Reasoning in Large Language Models on Symbolic Puzzles

Researchers introduce RecurrReason, a benchmark of 10,817 symbolic puzzles to test recurrent reasoning in sequence models. The study finds that T5-style encoder-decoder models significantly outperform GPT-2-style decoder-only models on most tasks, but all models score 0% on River Crossing puzzles. Architecture is a stronger determinant of success than scale, and pre-training only helps on puzzles with locally structured transitions.

June 16, 2026
Who Should Lead Decoding Now? Tracking Reliable Trajectories for Ensembling Masked Diffusion Language Models Technology

Who Should Lead Decoding Now? Tracking Reliable Trajectories for Ensembling Masked Diffusion Language Models

Masked Diffusion Language Models (MDLMs) have emerged as a distinct paradigm for sequence generation, but combining their knowledge is an underexplored problem. Researchers introduce TIE (Trajectory-based Iterative Ensembling), a framework that tracks confidence dynamics over answer-relevant positions to relay decoding trajectories between models, achieving strong performance on diverse reasoning tasks.

June 16, 2026