Data Scientist

Cell and Gene Therapy Catapult
Eastleigh
3 weeks ago
Create job alert

Application Deadline: 11 January 2026


Department: Scale Enabling Technologies


Location: London (Guy's hospital)


Description

By joining our dynamic Scale Enabling Technologies (SET) Team at the forefront of intelligent manufacturing for gene and cell therapies, the Data Scientist specialising in Machine Learning, Modelling, and AI will collaborate closely with multidisciplinary data and analytical scientists, as well as bioprocess engineers, to design, build, and deploy cutting‑edge predictive models and machine learning solutions. Contributions will directly accelerate the digital transformation of cell and gene therapy manufacturing, advancing our mission to deliver life‑changing treatments to patients worldwide.


The Data Scientist will own the development of robust data pipelines, execute advanced feature engineering, drive rigorous model validation and deployment, and assist with implementing decisions based on model output.


The Data Scientist will be empowered to extract actionable insights from complex datasets and build scalable, automated decision‑making systems, ensuring all analytical solutions adhere to best practices in statistical modelling and machine learning.


Comprehensive training and ongoing development opportunities will be provided to help you excel and grow with us. The role will interact with colleagues in Edinburgh as well as at the Stevenage and Braintree Manufacturing Innovation Centres, with occasional travel to these locations as required.


Key Responsibilities

  • Working with SET scientists to develop, validate, and deploy predictive models, digital twins, and ML algorithms for process optimisation and quality control
  • Build and maintain automated data pipelines for model training, validation and deployment
  • Conduct data analysis and statistical testing to identify process improvement opportunities
  • Evaluate both established and emerging AI/ML technologies for potential adoption
  • Communicate results through clear visualisations, dashboards, and reports both internally and externally via conference participation, publications, etc.
  • Maintain code documentation, version control, and reproducibility standards while driving continuous improvement initiatives
  • Collaborate with IT and infrastructure teams to ensure scalable and reliable deployment of solutions, as well as with the programme heads to execute on the strategic vision of the company

Experience

  • Strong programming experience in Python and R for data science and machine learning applications
  • Strong experience in developing algorithms for supervised and unsupervised learning using machine learning techniques and tools
  • Hands‑on experience with large language models (LLMs), AI agents, and code assistants to accelerate data analysis, automate workflows, and produce insights
  • Hands‑on expertise in building and validating data models and simulations, including predictive analytics for complex datasets
  • Experience applying data ontology and taxonomy principles to model, organize, and standardize biological and clinical datasets, ensuring semantic consistency and interoperability with industry standards (e.g., FAIR, ISA 95/88, OBO Foundry ontologies
  • Ambitious and highly motivated self‑starter who is passionate about pushing the boundaries of Industry 4.0 and making a tangible impact in the biotech sector.
  • Experience using code version control (e.g., git)

Desirable

  • Exposure to bioprocess data such as iPSC, AAV, CAR‑T
  • Experience in closed‑loop control strategies and their implementation
  • Working experience in time‑series, omics datasets analysis and knowledge graph architecture
  • Experience applying model validation strategies to ensure accuracy, reliability, and generalizability of predictive models
  • Working experience in ML‑specific frameworks (i.e. TensorFlow, PyTorch, SciPy, etc.)
  • Experience working with MATLAB

Skills, Knowledge & Expertise

  • A deep understanding of data structures, formats, manipulation, and best practices when storing and handling it within relational databases (e.g., PostgreSQL) or non‑relational databases (e.g., Redis)
  • Proven ability to apply deep learning techniques to high‑dimensional data, resulting in improved predictive accuracy and actionable insights
  • Able to engage constructively with colleagues at all levels and across different departments to deliver objectives and to respond to a wide range of customer and management needs
  • Has a positive attitude towards learning and personal and professional development
  • Keeps up to date with professional knowledge, expertise and best practice
  • Has a “roll your sleeves up” attitude towards varying work assignments
  • Proven diplomacy skills with diverse groups of internal and external stakeholders and able to build strong relationships
  • Resilient, with the ability to manage multiple and varied tasks and prioritise workload within a fast‑paced professional environment, while maintaining strong attention to detail
  • A positive attitude towards learning, personal and professional development
  • Keeps up to date with professional knowledge, expertise and best practice
  • Highly motivated, pragmatic and practical to support the mission of the Cell and Gene Therapy Catapult to accelerate the development of a commercial cell and gene‑based therapy industry in the UK
  • Desire to establish a high‑profile career within cell and gene therapy sector and the personal drive to help push the sector to be a commercial success
  • Knowledge of ATMPs, bioprocessing, digital twins, chemometric and mechanistic models (desirable)
  • Knowledge of data visualization tools (e.g., R Shiny, Plotly) (desirable)
  • Awareness of GxP standards, data integrity principles, and regulatory‑compliant data environments (desirable)
  • Familiarity with cloud computing platforms (AWS or similar) and working with APIs (desirable)
  • Understands innovation and the impact of disruptive technologies as well as drivers of change and new ways of working across industry, processes, people, and culture
  • A good team player with a hands‑on approach, and adaptable to new challenges
  • Lead and inspire colleagues
  • Ability to quickly establish credibility and build rapport and trust
  • Ambitious, collaborative, driven
  • Comfortable operating autonomously once goals and objectives are set
  • High degree of motivation, problem solving skills and innovative thinking
  • Excellent communication skills for cross‑disciplinary collaboration.
  • Self‑motivated, detail‑oriented, and adaptable in a fast‑paced R&D environment.
  • Proven ability to deliver high‑quality results with limited supervision.
  • A good team player easily adaptable to new challenges
  • Willingness to travel

Education / Qualifications

  • Relevant experience, including either a PhD or MSc in Computer Science, Engineering, Physics, Bioinformatics, or related STEM field

CGT Catapult is committed to providing an equal, diverse, and inclusive work environment where everyone’s contributions are valued. We celebrate differences, empower, and inspire everyone, because when everyone is included, everyone wins. In 2024, we received bronze accreditation from Inclusive Employers.


#J-18808-Ljbffr

Related Jobs

View all jobs

Data Scientist

Data Scientist

Data Scientist

Data Scientist

Data Scientist - New

Data Scientist - Contract - 12 months

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

How to Write a Machine Learning Job Ad That Attracts the Right People

Machine learning now sits at the heart of many UK organisations, powering everything from recommendation engines and fraud detection to forecasting, automation and decision support. As adoption grows, so does demand for skilled machine learning professionals. Yet many employers struggle to attract the right candidates. Machine learning job adverts often generate high volumes of applications, but few applicants have the blend of modelling skill, engineering awareness and real-world experience the role actually requires. Meanwhile, strong machine learning engineers and scientists quietly avoid adverts that feel vague, inflated or confused. In most cases, the issue is not the talent market — it is the job advert itself. Machine learning professionals are analytical, technically rigorous and highly selective. A poorly written job ad signals unclear expectations and low ML maturity. A well-written one signals credibility, focus and a serious approach to applied machine learning. This guide explains how to write a machine learning job ad that attracts the right people, improves applicant quality and strengthens your employer brand.

Maths for Machine Learning Jobs: The Only Topics You Actually Need (& How to Learn Them)

Machine learning job adverts in the UK love vague phrases like “strong maths” or “solid fundamentals”. That can make the whole field feel gatekept especially if you are a career changer or a student who has not touched maths since A level. Here is the practical truth. For most roles on MachineLearningJobs.co.uk such as Machine Learning Engineer, Applied Scientist, Data Scientist, NLP Engineer, Computer Vision Engineer or MLOps Engineer with modelling responsibilities the maths you actually use is concentrated in four areas: Linear algebra essentials (vectors, matrices, projections, PCA intuition) Probability & statistics (uncertainty, metrics, sampling, base rates) Calculus essentials (derivatives, chain rule, gradients, backprop intuition) Basic optimisation (loss functions, gradient descent, regularisation, tuning) If you can do those four things well you can build models, debug training, evaluate properly, explain trade-offs & sound credible in interviews. This guide gives you a clear scope plus a six-week learning plan, portfolio projects & resources so you can learn with momentum rather than drowning in theory.

Neurodiversity in Machine Learning Careers: Turning Different Thinking into a Superpower

Machine learning is about more than just models & metrics. It’s about spotting patterns others miss, asking better questions, challenging assumptions & building systems that work reliably in the real world. That makes it a natural home for many neurodivergent people. If you live with ADHD, autism or dyslexia, you may have been told your brain is “too distracted”, “too literal” or “too disorganised” for a technical career. In reality, many of the traits that can make school or traditional offices hard are exactly the traits that make for excellent ML engineers, applied scientists & MLOps specialists. This guide is written for neurodivergent ML job seekers in the UK. We’ll explore: What neurodiversity means in a machine learning context How ADHD, autism & dyslexia strengths map to ML roles Practical workplace adjustments you can ask for under UK law How to talk about neurodivergence in applications & interviews By the end, you’ll have a clearer sense of where you might thrive in ML – & how to turn “different thinking” into a genuine career advantage.