UniBrain: A Unified Multimodal Model for Brain MRI Imputation and Understanding

Researchers propose UniBrain, a unified multimodal large language model for brain MRI analysis that handles missing data through joint imputation and understanding. The model uses interleaved data flow, self-alignment, and dynamic hidden state mechanisms to achieve high performance on multi-disease MRI datasets.

iGEN Editorial

June 16, 2026

UniBrain: A Unified Multimodal Model for Brain MRI Imputation and Understanding

Multimodal large language models (MLLMs) hold great potential for medicine, as they inherit knowledge from LLMs and allow multiple data modalities to be integrated, analysed and interpreted in natural language, according to a new paper on arXiv. However, the field of medical MLLMs is constrained by non-trivial challenges, notably the scarcity of high-quality training data and the frequent occurrence of missing data in real-world clinical settings. To address these issues, researchers have proposed a novel unified multimodal model, UniBrain, for brain magnetic resonance image (MRI) analysis.

The Challenge of Missing Medical Data

In clinical practice, it is common to have incomplete sets of MRI modalities due to time constraints, patient condition, or equipment limitations. This missing data can hinder accurate diagnosis and analysis. Traditional approaches often require complete data or rely on imputation methods that are separate from the understanding task. UniBrain tackles this by employing a unified training strategy to perform joint imaging modality imputation and brain image understanding within a single model.

UniBrain: A Unified Approach

Named UniBrain, the model is designed for brain MRI analysis with a focus on robustness to modality incompleteness. During training, an interleaved and description-enriched data flow is constructed to train the model in an autoregressive manner, enabling medical reasoning with generated multimodal data. This allows the model to both fill in missing modalities and perform diagnostic tasks simultaneously.

Technical Innovations

UniBrain introduces several key techniques:

Self-alignment strategy: This approach leverages dense image embeddings to learn fine-grained anatomical features without requiring detailed image captions, reducing the need for expensive annotated data.
Dynamic hidden state mechanism: This mechanism alleviates exposure bias during long-context multimodal inference, improving the model's ability to handle extended sequences of medical data.

The model builds on multimodal large language model architecture, inheriting knowledge from LLMs to integrate and interpret multiple data modalities in natural language.

Performance on Multi-Disease Dataset

The researchers conducted extensive experiments on a multi-disease brain MRI dataset. Results demonstrate that UniBrain achieves high performance for brain image imputation, understanding, and disease diagnosis under various extents of modality incompleteness. The paper did not disclose specific numerical metrics, but the abstract states the model achieves high performance across all tasks.

The table below summarises the key features of UniBrain compared to traditional MLLM approaches for medical imaging:

Feature	Traditional MLLM Approaches	UniBrain
Handling missing modalities	Often require complete data	Joint imputation and understanding via unified training
Training data requirements	High-quality paired data	Self-alignment reduces need for detailed captions
Inference for long sequences	Susceptible to exposure bias	Dynamic hidden state mechanism mitigates bias
Overall task	Separate imputation or analysis	Unified reasoning with generated multimodal data

Implications for Enterprise AI

The UniBrain architecture demonstrates how MLLMs can be adapted to handle incomplete real-world data—a challenge that extends beyond healthcare into fields like logistics, finance, and supply chain management. While the current application is specific to brain MRI, the underlying techniques of joint imputation and understanding could inspire similar models for other domains where missing data is common. Enterprise technology leaders should monitor such advances as they may inform future AI systems capable of robust decision-making under uncertainty.

The authors of the paper are Song, Zhiyun; Liu, Che; Xia, Tian; Kori, Avinash; and Bai, Wenjia. The paper is available on arXiv under the identifier 2606.16484.

Sources:

UniBrain: A Unified Multimodal Model for Brain MRI Imputation and Understanding

The Challenge of Missing Medical Data

UniBrain: A Unified Approach

Technical Innovations

Performance on Multi-Disease Dataset

Implications for Enterprise AI

Recommended Stories

BrainG3N Tokenizer Enables Controllable 3D Brain MRI Generation with Clinical-Grade Embeddings

First Billion-Parameter Generative Foundation Model for Chest Radiography Achieves Expert-Level Synthesis Fidelity

Mutual Distillation of Dual Foundation Models Achieves State-of-the-Art PET/CT Segmentation with Only 5 Labeled Cases

Deep Learning Automates Doppler Angle Estimation in Ultrasound, Reducing Measurement Errors