CAP Achieves 87.6% Improvement in Respiratory Rate Prediction via Patient-Level PPG Learning

Researchers introduce Clinical Anchored Pretraining (CAP) for PPG signals, which anchors representations to patient-level clinical semantics. CAP outperforms baselines on four tasks, with a remarkable 87.6% relative improvement in respiratory rate prediction and average 26.7% gain across tasks.

iGEN Editorial

June 16, 2026

CAP Achieves 87.6% Improvement in Respiratory Rate Prediction via Patient-Level PPG Learning

Existing approaches to universal representation learning for photoplethysmography (PPG) — the optical signal used in wearable health monitors — have largely focused on signal-level objectives, such as waveform reconstruction, while neglecting the patient-level health context that is critical for clinical decision support. This gap limits generalization to complex clinical tasks and heterogeneous patient cohorts.

To address this, researchers from multiple institutions have constructed a large-scale paired PPG-electronic health record (EHR) multimodal dataset by distilling fragmented medical histories and clinical records into cohesive, patient-level EHRs. Building on this resource, they propose Clinical Anchored Pretraining for PPG (CAP), a method that performs cross-modal contrastive alignment to anchor PPG representations to patient-level clinical semantics. During pretraining, CAP guides the encoder beyond simple waveform fitting toward modeling consistency in a patient's overall physiological state. During downstream adaptation, the pretrained PPG encoder provides clinically grounded representations that strengthen inductive bias and improve robustness and transferability.

The research team evaluated CAP on four diverse downstream tasks and found that it consistently outperforms strong baselines across all tasks. Notably, CAP achieved a +87.6% relative improvement over the state-of-the-art baseline on respiratory rate prediction, and delivered an average relative improvement of +26.7% across all four tasks. The authors also enhanced interpretability through comprehensive analyses, including ablations and multiple complementary visualizations of the learned representations.

Downstream Task	Improvement vs. Baseline	Details
Respiratory Rate Prediction	+87.6% relative	State-of-the-art baseline outperformed
All Four Tasks (average)	+26.7% relative	Consistent gains across diverse tasks

The code for the experiments is available online. The study, titled "CAP: Towards PPG Universal Representation Learning with Patient-level Supervision," was authored by He, Chenyang, Shao, Xinyi, Huang, Shun, Bosong, Zhang, Daoqiang, Jing, Ming, and Ding, Cheng.

For enterprise technology leaders, CAP represents a significant step toward more accurate and context-aware health monitoring from wearable devices. By integrating patient-level clinical data during training, the method could enable more reliable remote patient monitoring, reduce false alarms, and improve clinical decision support systems. The large gains on respiratory rate prediction are particularly relevant for applications in chronic disease management, sleep tracking, and post-operative care, where respiratory metrics are critical. As wearable health devices become more prevalent in corporate wellness programs and telemedicine platforms, AI methods like CAP that improve signal-to-clinical inference could become a differentiator for health tech vendors.

Sources:

CAP Achieves 87.6% Improvement in Respiratory Rate Prediction via Patient-Level PPG Learning

Recommended Stories

Before the Labels: How Dataset Construction Biases Suicidality Detection in Clinical Text

Beijing Accuses US AI Firms of Using Chinese Models for Training

project44 CEO: AI Agents Without Context Are Just Guessing Faster

Self-Improving AI Isn't Just for Frontier Labs: How Enterprises Can Build Their Own