Mutual Distillation of Dual Foundation Models Achieves State-of-the-Art PET/CT Segmentation with Only 5 Labeled Cases

Researchers propose MuDuo, a mutual distillation framework that leverages two foundation models (SAM-Med3D for CT, SegAnyPET for PET) to distill knowledge into a lightweight student network for semi-supervised PET/CT segmentation. Achieving state-of-the-art performance on the AutoPET dataset with only 5 labeled cases, the approach eliminates manual prompts and maximizes unlabeled data utility.

iGEN Editorial

June 16, 2026

Mutual Distillation of Dual Foundation Models Achieves State-of-the-Art PET/CT Segmentation with Only 5 Labeled Cases

Organ segmentation from PET/CT is critical for quantitative analysis and radiotherapy planning in oncology, but the high cost of expert annotation limits the development of deep learning models. A team of researchers has proposed MuDuo, a mutual distillation framework that exploits both structural and functional foundation models to achieve state-of-the-art performance on the AutoPET dataset using only 5 labeled cases.

The Annotation Bottleneck in Medical Imaging

According to the research paper published on arXiv (arXiv:2606.15611), semi-supervised learning (SSL) provides a practical and effective solution for developing deep models with limited labeled data. Recent developments in visual foundation models have demonstrated remarkable adaptability with improved efficiency. The team's work bridges the gap between the task-specific precision of student models and the segmentation priors of generalist foundation models.

MuDuo: Mutual Distillation Framework

The proposed framework, MuDuo, synergistically leverages two modality-specific foundation models:

SAM-Med3D for structural CT imaging
SegAnyPET for metabolic PET imaging

Both act as generalists that distill their knowledge into a lightweight student network. The approach eliminates the need for manual prompts while maximizing the utility of unlabeled data for automatic segmentation.

Technical Details and Performance

The key innovation is mutual distillation: the two foundation models are used as teachers, each specializing in one modality, and the student network learns from both. The authors report state-of-the-art performance on the AutoPET dataset with only 5 labeled cases. The source code is publicly available at the project's GitHub repository.

Implications for Enterprise AI Adoption

While this work focuses on medical imaging, the concept of leveraging pre-trained foundation models through distillation to reduce labeled data requirements has broad applications. For enterprise technology leaders, the ability to deploy high-performance AI models with minimal annotated data translates directly into lower costs and faster time-to-value. The framework demonstrates that combining multiple large models as teachers can produce lightweight, efficient student models suitable for deployment in resource-constrained environments.

The research was conducted by Mao, Fuyou, Wu, Beining, Jiang, Yanfeng, Xu, Bohan, Lin, Lixin, Naye, Zhang, Hao, and Tang. The full paper is available under a CC BY 4.0 license on arXiv.

Sources:

Mutual Distillation of Dual Foundation Models Achieves State-of-the-Art PET/CT Segmentation with Only 5 Labeled Cases

The Annotation Bottleneck in Medical Imaging

MuDuo: Mutual Distillation Framework

Technical Details and Performance

Implications for Enterprise AI Adoption

Recommended Stories

First Billion-Parameter Generative Foundation Model for Chest Radiography Achieves Expert-Level Synthesis Fidelity

BrainG3N Tokenizer Enables Controllable 3D Brain MRI Generation with Clinical-Grade Embeddings

UniBrain: A Unified Multimodal Model for Brain MRI Imputation and Understanding

Deep Learning Automates Doppler Angle Estimation in Ultrasound, Reducing Measurement Errors