Generative modeling · Biomedical time series · 2026In progress

GlucoCast

Conditional diffusion framework for privacy-preserving blood glucose forecasting.

18%

RMSE improvement vs. LSTM / CNN baselines

OhioT1DM

Public T1D benchmark dataset

PyTorchDiffusion modelsOhioT1DM datasetConditional generation

Repository and writeup are coming online shortly. Reach out on email if you’d like early access.

Overview

GlucoCast is a conditional diffusion framework for continuous glucose monitoring (CGM) data. It generates synthetic CGM trajectories conditioned on contextual signals (meals, insulin, activity) and uses the generated data to train downstream forecasters that outperform LSTM and CNN baselines on the OhioT1DM benchmark by roughly 18% RMSE.

The motivating constraint is data sharing. Real CGM traces are sensitive health data, difficult to release at scale. A generator that produces plausible, privacy-preserving traces unlocks training pipelines that would otherwise be blocked on data access.

Background

Blood glucose forecasting helps people with type 1 diabetes anticipate hyper- and hypoglycaemic events. Supervised models (LSTM, CNN, Transformer) have shown promising results on public benchmarks but remain hard to deploy broadly because patient data is protected.

Diffusion models have emerged as a strong generative family for structured continuous data. Unlike GANs they have stable training dynamics; unlike VAEs they produce high-fidelity samples. Conditioning a diffusion model on the exogenous signals that drive glucose trajectories is a natural fit for this problem.

Approach

The generator learns to reverse a forward diffusion process on CGM windows, conditioned on aligned meal, insulin, and activity signals. Downstream forecasters are trained on mixtures of real and synthetic data, then evaluated against real held-out trajectories from OhioT1DM.

Results

Models trained with GlucoCast-augmented data outperform LSTM and CNN baselines trained on real data alone by approximately 18% RMSE on OhioT1DM. The improvement is consistent across the standard 30-minute and 60-minute forecast horizons used in the benchmark.

Tech stack

PyTorch: Diffusion model implementation and training.
Conditional diffusion: Score-based generation conditioned on meals, insulin, and activity.
OhioT1DM dataset: Public benchmark for type 1 diabetes CGM forecasting.