iGEN
Visit IGEN World Explore IGEN Expo
EXPLORE UPGRADE PLANS
BREAKING
PISA Memory System Draws on Cognitive Psychology to Boost AI Agent Adaptability New Multi-Scale Two-Stream Framework Aims to Decouple Semantics from Distortions in AI-Generated Image Quality Assessment P3B3 Benchmark Reveals Strong Brazilian Portuguese Bias in Large Language Models Controlled Dynamics Attractor Transformer: New Model Targets Graph Anomaly Detection with Biologically Plausible Attention Tamil Nadu OE Spinning Mills Threaten 50% Production Cut Over High Cotton Waste Prices BridgePolicy: New Diffusion Bridge Method Improves Visuomotor Policy Learning in Robotics New Theory Explains How Deep Transformers Achieve Adaptive Inference Using Function Vectors PVminerLLM2 Uses Preference Optimization to Improve Structured Patient Voice Extraction Beyond Models: Reflections on Engineering AI-enabled Systems in a Project-Based Course AutoDojo: Adaptive Attacks Expose Superficial Defenses and Structural Limits in LLM Agents PISA Memory System Draws on Cognitive Psychology to Boost AI Agent Adaptability New Multi-Scale Two-Stream Framework Aims to Decouple Semantics from Distortions in AI-Generated Image Quality Assessment P3B3 Benchmark Reveals Strong Brazilian Portuguese Bias in Large Language Models Controlled Dynamics Attractor Transformer: New Model Targets Graph Anomaly Detection with Biologically Plausible Attention Tamil Nadu OE Spinning Mills Threaten 50% Production Cut Over High Cotton Waste Prices BridgePolicy: New Diffusion Bridge Method Improves Visuomotor Policy Learning in Robotics New Theory Explains How Deep Transformers Achieve Adaptive Inference Using Function Vectors PVminerLLM2 Uses Preference Optimization to Improve Structured Patient Voice Extraction Beyond Models: Reflections on Engineering AI-enabled Systems in a Project-Based Course AutoDojo: Adaptive Attacks Expose Superficial Defenses and Structural Limits in LLM Agents
Home ›› Technology ›› Ai ›› Computer Vision ›› Mutual Distillation of Dual Foundation Models Achieves State-of-the-Art PET/CT Segmentation with Only 5 Labeled Cases

Mutual Distillation of Dual Foundation Models Achieves State-of-the-Art PET/CT Segmentation with Only 5 Labeled Cases

Researchers propose MuDuo, a mutual distillation framework that leverages two foundation models (SAM-Med3D for CT, SegAnyPET for PET) to distill knowledge into a lightweight student network for semi-supervised PET/CT segmentation. Achieving state-of-the-art performance on the AutoPET dataset with only 5 labeled cases, the approach eliminates manual prompts and maximizes unlabeled data utility.

iG
iGEN Editorial
June 16, 2026
Mutual Distillation of Dual Foundation Models Achieves State-of-the-Art PET/CT Segmentation with Only 5 Labeled Cases

Organ segmentation from PET/CT is critical for quantitative analysis and radiotherapy planning in oncology, but the high cost of expert annotation limits the development of deep learning models. A team of researchers has proposed MuDuo, a mutual distillation framework that exploits both structural and functional foundation models to achieve state-of-the-art performance on the AutoPET dataset using only 5 labeled cases.

The Annotation Bottleneck in Medical Imaging

According to the research paper published on arXiv (arXiv:2606.15611), semi-supervised learning (SSL) provides a practical and effective solution for developing deep models with limited labeled data. Recent developments in visual foundation models have demonstrated remarkable adaptability with improved efficiency. The team's work bridges the gap between the task-specific precision of student models and the segmentation priors of generalist foundation models.

MuDuo: Mutual Distillation Framework

The proposed framework, MuDuo, synergistically leverages two modality-specific foundation models:

  • SAM-Med3D for structural CT imaging
  • SegAnyPET for metabolic PET imaging

Both act as generalists that distill their knowledge into a lightweight student network. The approach eliminates the need for manual prompts while maximizing the utility of unlabeled data for automatic segmentation.

Technical Details and Performance

The key innovation is mutual distillation: the two foundation models are used as teachers, each specializing in one modality, and the student network learns from both. The authors report state-of-the-art performance on the AutoPET dataset with only 5 labeled cases. The source code is publicly available at the project's GitHub repository.

Implications for Enterprise AI Adoption

While this work focuses on medical imaging, the concept of leveraging pre-trained foundation models through distillation to reduce labeled data requirements has broad applications. For enterprise technology leaders, the ability to deploy high-performance AI models with minimal annotated data translates directly into lower costs and faster time-to-value. The framework demonstrates that combining multiple large models as teachers can produce lightweight, efficient student models suitable for deployment in resource-constrained environments.

The research was conducted by Mao, Fuyou, Wu, Beining, Jiang, Yanfeng, Xu, Bohan, Lin, Lixin, Naye, Zhang, Hao, and Tang. The full paper is available under a CC BY 4.0 license on arXiv.


Sources:

Keep Reading

Recommended Stories

UniBrain: A Unified Multimodal Model for Brain MRI Imputation and Understanding Technology

UniBrain: A Unified Multimodal Model for Brain MRI Imputation and Understanding

Researchers propose UniBrain, a unified multimodal large language model for brain MRI analysis that handles missing data through joint imputation and understanding. The model uses interleaved data flow, self-alignment, and dynamic hidden state mechanisms to achieve high performance on multi-disease MRI datasets.

June 16, 2026
Deep Learning Automates Doppler Angle Estimation in Ultrasound, Reducing Measurement Errors Technology

Deep Learning Automates Doppler Angle Estimation in Ultrasound, Reducing Measurement Errors

A deep learning approach developed using 2100 carotid ultrasound images can automatically estimate Doppler angle, reducing error. The best model achieved mean absolute error less than clinical threshold, potentially improving blood velocity measurements.

June 16, 2026
GPU-Free AI Model UltraSeg Enables Real-Time Ultrasound Segmentation on CPUs Technology

GPU-Free AI Model UltraSeg Enables Real-Time Ultrasound Segmentation on CPUs

UltraSeg, an ultra-lightweight AI architecture, enables real-time point-of-care ultrasound segmentation without GPU dependency. Running on single-core CPUs at up to 89.7 FPS, it matches or exceeds larger models like UNet, making AI diagnostics viable in resource-limited settings.

June 16, 2026
New Sub-Semantic Image Segmentation Method DETECTURE Introduced by Researchers, Outperforms Baselines Technology

New Sub-Semantic Image Segmentation Method DETECTURE Introduced by Researchers, Outperforms Baselines

Researchers propose a new category of image segmentation called sub-semantic, which uses language to partition images into stable appearance patterns rather than whole objects. They introduce DETECTURE, a method that couples a vision-language model with SAM 3 to overcome three failure modes, and create a new dataset called TextureADE derived from ADE20K. DETECTURE achieves the strongest performance on several datasets compared to baselines.

June 16, 2026