Data Scientist at the BHF Data Science Centre (HDR UK) and Visiting Data Scientist at the University of Leeds.
I build tools and pipelines for cardiovascular and cancer research using large-scale NHS datasets β typically working with 50M+ patient records in trusted research environments.
- Healthcare data pipelines β PySpark, Databricks, R/dbplyr, SQL
- National Secure Data Environemnts - NHS England Secure Data Environment, SAIL in Wales.
- LLM applications β RAG systems for clinical documentation and codebase intelligence
- Data curation tools β phenotype search, metadata management, validation frameworks
- Research software engineering β making analytical code reproducible and shareable
- π Cardiovascular and Oncology datasets for world leading reseach.
- π€ Question-answering systems for healthcare documentation.
π Working from home in Leeds, UK



