Clinical AI · EHR representation · research software

Chris Sainsbury

Clinician-researcher working on clinical foundation models, EHR representation learning, translational AI, and research software.

I work across medicine, machine learning, and research infrastructure. This site is a public front page for the main things I am working on: active projects, recent outputs, papers, software, and current directions.

GitHub profile shorthand: data analysis in diabetes · machine learning · sailing.

Translational research thread

Phenotype fingerprinting

Using clinical sequence models to identify rare, under-recognised, or hard-to-characterise disease patterns.

Read more

Research programme

ORCA

Treating electronic health records as a language, with implications for representation, translation, and generation.

Read more

Scientific-discovery system

NEXUS

Predicting cross-domain method transfer by learning from historical discoveries and validated transfers.

Read more

Programme-level research thread

ASCEND research arc

A linked sequence of work spanning ASCENDgpt, FlatASCEND, and ORCA.

Read more

Public-facing project

glucose.ai

A literature intelligence and discovery surface for diabetes and AI/ML work.

Read more

Research infrastructure

pi / cog workflows

Tools and workflows for keeping projects, sources, tasks, and synthesis organised in a usable working system.

Read more

recent outputs

Papers and scholarly outputs

2026

Sparse Autoencoder Decomposition of Clinical Sequence Model Representations: Feature Complexity, Task Specialisation, and Mortality Prediction

C Sainsbury, F Dong, A Karwath

arXiv preprint arXiv:2605.04072

2026 Cited by 1

FlatASCEND: Autoregressive Clinical Sequence Generation with Continuous Time Prediction and Association-Based Pharmacological Testing

C Sainsbury, F Dong, K Andreas

arXiv

2026

Implementation of a diabetes electronic health record with clinical decision support in primary care: A mixed methods study

J Dickson, S Cunningham, C Sainsbury, M Rutter, N Kanumilli, E Pearson, ...

Diabetic Medicine 43, 111-112

2026

DIABETES2VEC: Learning routine care trajectories to identify incident diabetes risk

C Sainsbury, G Jones, A Karwath

Diabetes Technology & Therapeutics 28, 140S-141S

2026

Causal and Active Learning-Based Counterfactual Chest X-ray Generation for Supporting Clinical Decision-Making in Lung Disease

Y Zhu, G Mohr, L Zhang, C Sainsbury, F Dong, JD Maclay, DJ Lowe, ...

Fifth Conference on Causal Learning and Reasoning

See all outputs

areas of work

What connects the portfolio

The portfolio spans academic papers, translational grants, scientific-discovery systems, and software for organising research. The aim is not just to build models, but to make them clinically meaningful and operationally usable.

Clinical foundation modelsEHR representation learningMechanistic interpretabilityTranslational AI in medicineScientific discovery systemsResearch software and agent workflows