Hello, I'm Colin
I'm what happens when the science, data and programming worlds collide! A passionate genetics and genomics scientist, with expertise across data analysis, statistics, machine learning, AI, natural language processing, algorithms and flutter.
Skills
Data Science & AI
- R & Python (Bioconductor packages, Pandas, NumPy...)
- Machine learning algorithms (caret)
- Natural language processing (SpaCy)
- Data visualization & interpretation (ggplot2)
- Statistical analysis
- Algorithms
App Development
- Dart, Flutter SDK
- State Management
- Firebase Integration
- Native Device Features
- Security, scalability and reliability
Tools & Others
- Git, GitHub/GitLab
- Docker
- VS Code, Android Studio
- Firebase
- Google Cloud
- API integration
About Me
I'm a dedicated and innovative problem solver. I can help build engaging web experiences and high-performance mobile applications using Flutter as the front end and a combination of Firebase and Google Cloud as the back end. This combination ensures a fast and beautiful appearing app to the users, and security, scalability and easy management for owners. My journey in technology is driven by my passion for leveraging technology to solve complex problems.
I have a PhD in Molecular and cellular biology and many years of experience in scientific research. As well as the science, I learned how to program, how to handle and analyse data appropriately, how to use AI and machine learning to enhance analyses.
I thrive in environments where I can apply my skills in data analysis, natural language processing, and machine learning to create intelligent, user-centric solutions. This portfolio showcases my work, demonstrating my capabilities and commitment to delivering solutions.
Please explore my page and feel free to reach out if you'd like to collaborate or discuss opportunities.

Work Experience
Full stack developer at NuFitPlan Nutrition and fitness service
Remote
- Developed a Flutter app with Firebase and Google Cloud backend.
- Android, iOS, web, and desktop versions of the app from a single codebase.
- Integrated with Stripe to process payments and subscriptions.
- Uses natural language processing (machine learning) and AI to enhance app functionality.
Scientific Editor at Fios Genomics
Edinburgh, UK
- Worked across genomics and transcriptomics (and other -omics) technologies and data sets.
- Applied appropriate statistical analyses and quality evaluation to various data sets.
- Programming primarily in the R language.
- Interpretation and discussion of results in context and how to use these to direct future work.
Postdoctoral researcher at Newcastle University
Newcastle-upon-Tyne, UK
- Worked on several projects which investigated the links between genetic variation in DNA, disease and treatments.
- First appreciated the power of computer programming.
My Flutter App Showcase
Here's a working example of one of my recent Flutter applications. This section aims to demonstrate my proficiency in mobile app development, UI/UX design, cross-platform capabilities and back-end integration ensuring flawless functionality, security and scalability. Please try it out!
NuFitPlan
Category: Nutrition and fitness
This app is designed to walk users through a complicated sign-up process. It handles user data, leverages Firebase to handle real-time data storage and retrieval, a custom NLP model for data processing, and Gemini AI for summarisation of information. It is integrated with Stripe to handle payments and subscriptions.
- Key feature 1: Fully functional, handles users, payments and subscriptions
- Key feature 2: Uses Google Cloud for computation and data analysis
- Key feature 3: Incorporates machine learning and Google Gemini AI
- For testing purposes, during the subscription process use card number "4242 4242 4242 4242", any future expiry date, any CVC number, any name and any valid UK postcode.
This app is a work in progress as I have a list of features to add, but I'm proud to share it with you in its current state.
Education
PhD Molecular and Cellular Biology
University of Glasgow | UK
Graduated: July 2014
BSc (Hons) Biochemistry
University of Glasgow | UK
Graduated: July 2010
Certificates
Finding Hidden Messages in DNA (Bioinformatics I)
A series of classes illustrating the power of computing in modern biology. Essentially it was an advanced R programming course where I learned to use scripting and automated processes to solve complex biological data problems.
Advanced Data Visualisation with R
Coursera certification. Included advanced figures with ggplot2, spatial data, plotly and gganimate.
R Programming
Coursera certification. I learned general programming and scripting with R, loop functions and debugging, simulation and profiling, and R markdown.
The Data Scientist’s Toolbox
Coursera certification. Here modules included data science fundamentals, R and RStudio, version control and Github, R markdown and big data.
Publications
Genetic risk of osteoarthritis operates during human skeletogenesis
Authors: SJ Rice, A Brumwell, J Falk, YS Kehayova, J Casement, E Parker, et al.
Journal: Human Molecular Genetics, 2023
Genetic risk of osteoarthritis operates during human fetal development
Authors: S Rice, A Brumwell, J Falk, Y Kehayova, J Casement, E Parker, I Hofer, et al.
Type: Report/Conference Abstract, 2022
Epigenomic analysis of osteoarthritis genetic risk during human fetal development
Authors: SJ Rice, J Falk, A Brumwell, Y Kehayova, J Casement, E Parker, IM Hofer, et al.
Journal: Osteoarthritis and Cartilage, 2022
Genome-wide association study identifies risk loci for progressive chronic lymphocytic leukemia
Authors: WY Lin, SE Fordham, N Sunter, C Elstob, T Rahman, E Willmore, et al.
Journal: Nature Communications, 2021
Functional testing of thousands of osteoarthritis-associated variants for regulatory activity
Authors: JC Klein, A Keith, SJ Rice, C Shepherd, V Agarwal, J Loughlin, et al.
Journal: Nature Communications, 2019
Prioritization of PLEC and GRINA as Osteoarthritis Risk Genes Through the Identification and Characterization of Novel Methylation Quantitative Trait Loci
Authors: SJ Rice, M Tselepi, AK Sorial, G Aubourg, C Shepherd, D Almarza, et al.
Journal: Arthritis & Rheumatology, 2019
Expression analysis of the osteoarthritis genetic susceptibility mapping to the matrix Gla protein gene MGP
Authors: C Shepherd, AE Reese, LN Reynard, J Loughlin
Journal: Arthritis Research & Therapy, 2019
Prioritization of PLEC and GRINA as osteoarthritis risk genes through the identification and characterization of novel methylation quantitative trait loci 2019
Authors: S Rice, G Aubourg, T Sorial, DA Gomez, M Tselepi, C Shepherd, et al.
Type: Report/Conference Abstract, 2019
Multicentre genome wide association study identifies risk alleles for progressive chronic lymphocytic leukaemia
Authors: D Allsup, WY Lin, SE Fordham, N Sunter, C Elstob, T Rahman, E Willmore, et al.
Journal: Blood, 2019
Functional Characterization of the Osteoarthritis Genetic Risk Residing at ALDH1A2 Identifies rs12915901 as a Key Target Variant
Authors: C Shepherd, D Zhu, AJ Skelton, J Combe, H Threadgold, L Zhu, et al.
Journal: Arthritis & Rheumatology, 2018
Genotype at the MGP OA risk locus correlates with an expression quantitative trait locus operating on MGP in cartilage and with dna methylation at a cluster of regulatory CpGs
Authors: C Shepherd, LN Reynard, J Loughlin
Journal: Osteoarthritis and Cartilage, 2018
Identification of novel methylation quantitative trait loci (mqtls) and functional characterization using CRISPR/CAS9 and gene expression analysis prioritizes Plec as an oa gene
Authors: SJ Rice, AK Sorial, G Aubourg, C Shepherd, M Tselepi, D Almarza, et al.
Journal: Osteoarthritis and Cartilage, 2018
Functional characterisation of the osteoarthritis genetic risk locus that resides at the gene ALDH1A2, coding for the retinoic acid synthesis enzyme RALDH2, identifies
Authors: C Shepherd, D Zhu, AJ Skelton, J Combe, LN Reynard, J Loughlin
Journal: Osteoarthritis and Cartilage, 2017
Expression analysis of the osteoarthritis genetic susceptibility locus mapping to an intron of the MCF2L gene and marked by the polymorphism rs11842874
Authors: C Shepherd, AJ Skelton, MD Rushton, LN Reynard, J Loughlin
Journal: Osteoarthritis and Cartilage, 2016
Methylation quantitative trait locus analysis of osteoarthritis links epigenetics with genetic risk
Authors: MD Rushton, LN Reynard, DA Young, C Shepherd, G Aubourg, F Gee, et al.
Journal: Human Molecular Genetics, 2015
Improved catenated structures of bovine peroxiredoxin III F190L reveal details of ring-ring interactions and a novel conformational state
Authors: Z Cao, DP McGow, C Shepherd, JG Lindsay
Journal: PLoS One, 2015
Methylation of cartilage DNA is a mediator of genetic risk at several OA susceptibility loci
Authors: MD Rushton, LN Reynard, DA Young, C Shepherd, R Darlay, HJ Cordell, et al.
Journal: Osteoarthritis and Cartilage, 2015
Inactivation of mammalian Ero1α is catalysed by specific protein disulfide-isomerases
Authors: C Shepherd, OBV Oka, NJ Bulleid
Journal: Biochemical Journal, 2014
Functional Characterization of the Osteoarthritis Genetic Risk Residing at ALDH1A2 Identifies rs12915901 as a Key Target Variant. Arthritis Rheumatol 70, 1577-1587 (2018).
Authors: C Shepherd, D Zhu, AJ Skelton, J Combe, H Threadgold, L Zhu, et al.
Journal: Nat Genet, 2014
The mechanism of endoplasmic reticulum oxidoreductase 1 α (Ero1α) inactivation
Authors: C Shepherd
Type: PhD Thesis, University of Glasgow, 2012
Get in Touch
I'm always open to discussing new projects, creative ideas, or opportunities to be part of your vision.