Biomarker Data Science Platform

End-to-End Clinical Trial Biomarker Pipelines across Therapeutic Areas

Reproducible, modular biomarker analysis reports spanning four therapeutic areas. Each report applies an identical analytical pipeline to simulated Phase II trial data, with disease-specific biological framing and clinical interpretation.


Therapeutic Area Reports

Disease · Generalised Myasthenia Gravis (gMG)
Mechanism · FcRn-mediated IgG catabolism (VYVGART-like)
Primary biomarker · IgG total NPX
N · 120 patients

Disease · Non-Small Cell Lung Cancer (NSCLC)
Mechanism · PD-1 blockade / T-cell reinvigoration (nivolumab-like)
Primary biomarker · Tumour Mutational Burden (TMB)
N · 150 patients

Disease · Heterozygous Familial Hypercholesterolaemia (HeFH)
Mechanism · PCSK9 inhibition / LDL-R upregulation (evolocumab-like)
Primary biomarker · LDL Cholesterol (LDL-C)
N · 200 patients

Disease · Early Alzheimer’s Disease (MCI / mild dementia)
Mechanism · Anti-Aβ protofibril mAb (lecanemab-like)
Primary biomarker · Plasma p-tau181
N · 160 patients


Pipeline Architecture

All reports share a common three-stage analytical framework executed from a single parameterised source file (_analysis_core.qmd):

Stage Methods
Multi-Omics Transcriptomics QC & batch correction · Welch DE (BH-FDR) · Olink NPX proteomics · cross-modal correlation
Longitudinal & Survival Linear mixed-effects (lme4) · Emax PD model (nls) · Kaplan–Meier · Cox proportional-hazards
Machine Learning PCA + UMAP · k-means clustering · elastic-net (glmnet) · random forest + OOB AUROC

Built with Quarto · R 4.4.2 · 13 May 2026