Senior Data Scientist

Wellcome Sanger Institute
Hinxton
11 months ago
Applications closed

Related Jobs

View all jobs

Senior Data Scientist - Consumer Behaviour – exciting ‘scale up’ proposition

Senior Data Scientist – Machine Learning -  Defence – Eligible for SC

Senior Data Scientist - Commercial

Senior Data Scientist role - Financial Services | Guildford £80k

Senior Data Scientist - Operational Research & Optimisation

Senior Data Scientist (Generative AI) - RELOCATION TO ABU DHABI

Do you want to help us improve human health and understand life on Earth? Make your mark by shaping the future to enable or deliver life-changing science to solve some of humanity’s greatest challenges.

Senior Data Scientist

We seek a senior machine learning research scientist to join a collaborative project between the Wellcome Sanger Institute and Open Targets (targets ( This project aims to leverage datasets internally generated at the Sanger Institute and publicly available data from human cells to create foundational models for biology, enhancing our understanding of life's rules and improving health for all. You will work within an interdisciplinary team of life scientists and computer/ML scientists, with a shared objective of advancing biological research through these foundational models. This role will sit within the AI/ML Faculty group led by Dr. Mohammad Lotfollahi, and the successful candidates, across different seniority levels (senior and principal), will be responsible for delivering their portfolio of scientific research projects as part of the broader team strategy.

About the role

Your role will involve designing foundational models leveraging multi-modal readouts. This includes integrating and processing data from various sources to develop robust and versatile AI models. To achieve this, you will work with open-source software, proposing, developing, and maintaining new solutions to analyze and interpret large-scale single-cell datasets. We have access to unique data and are also in the position to generate data to train unique models. Additionally, we have substantial computational power and GPU resources to train large models efficiently.

Our teams are well-positioned to tackle this problem with experience in both generating and analyzing datasets, including millions of cells across multiple tissues and conditions (e.g., disease, healthy). This involves a detailed understanding of the training of large-scale ML models and a track record of undertaking large data-science projects.

You will be responsible for:

  • Independently managing and leading machine learningresearchprojects and writing outcomes in a scientific publication for submission to journals or machine learning conferences (ICLR, ICML, CVPR, etc).   
  • Collaborating with team members in proposing, developing, and evaluating new machine learning models that enable understanding single-cell data and its application in drug discovery.
  • Working with Ph.D. students and postdocs in collaborating teams on developing solutions for interdisciplinary scientific problems in biology as well asproviding supervision and training to junior members of the team.
  • Contributing to writing scientific papers on biotechnology and biology.
  • Distilling your developed solutions into open-source and easy-to-install packages with documentation that facilitates the usage of your solution for downstream users, including biologists and bioinformaticians.
  • Presenting your research and analysis pipelines to internal and external audiences.

About You:

You will be supported in your personal and professional development and have the opportunity to lead peer-reviewed publications around using genetics and genomics approaches to guide drug discovery and present them at national and international conferences.

Essential Skills

● Ph.D. or M.Sc. with equivalent research experience in a relevant quantitative discipline (e.g., Computer Science, Computational Biology, Genetics, Bioinformatics, Physics, Engineering, or Applied Statistics/Mathematics)

● Previous ML work experience in scientific/academic environment (RA/Internships are considered as work experience)

● Strong knowledge of Python, including core data science libraries such as Scikit-Learn, SciPy, TensorFlow, and PyTorch.

● Expertise in machine learning algorithms and frameworks, with experience in designing, training, and deploying ML models.

● Proficiency in handling and processing large datasets, including techniques for data cleaning, feature engineering, and data augmentation.

● Experience with high-performance computing environments, including the use of GPUs for training large-scale machine learning models.

● Experience in natural language processing (NLP) and training models based on transformer architectures, such as BERT and GPT.

● Familiarity with generative models such as diffusion models and flow matching.

● Knowledge of software development good practices and collaboration tools, including git-based version control, Python package management, and code reviews.

● Strong problem-solving skills with the ability to analyze complex data and derive actionable insights.

● Excellent communication skills, with the ability to explain complex machine learning algorithms and statistical methods to non-technical stakeholders.

  • Evidence of related work experience as a researcher in the area of Machine learning
  • Strong publication record, first author position ideal

In addition to the above technical skills, you will also have the following:

  • Ability to quickly understand scientific, technical, and process challenges and breakdown complex problems into actionable steps
  • Ability to work in a frequently changing environment with the capability to interpret management information to amend plans
  • Ability to prioritize, manage workload, and deliver agreed activities consistently on time
  • Demonstrate good networking, influencing and relationship building skills
  • Strategic thinking is the ability to see the ‘bigger picture'
  • Ability to build collaborative working relationships with internal and external stakeholders at all levels
  • Demonstrates inclusivity and respect for all

Relevant publication of the groups:

  • Lotfollahi, M., Naghipourfar, M., Luecken, M. D., Khajavi, M., Büttner, M., Wagenstetter, M., Avsec, Ž., Gayoso, A., Yosef, N., Interlandi, M. & Others. Mapping single-cell data to reference atlases by transfer learning.Nature Biotechnology1–10 .
  • Lotfollahi, M., Wolf, F. A. & Theis, F. J. scGen predicts single-cell perturbation responses.Nature Methods16, 715–721 .
  • Lotfollahi, M., Rybakov, S., Hrovatin, K., Hediyeh-Zadeh, S., Talavera-López, C., Misharin, A. V. & Theis, F. J. Biologically informed deep learning to query gene programs in single cell atlases.Nature Cell Biology .

Get the latest insights and jobs direct. Sign up for our newsletter.

By subscribing you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

How to Write a Winning Cover Letter for Machine Learning Jobs: Proven 4-Paragraph Structure

Learn how to craft the perfect cover letter for machine learning jobs with this proven 4-paragraph structure. Ideal for entry-level candidates, career switchers, and professionals looking to advance in the machine learning sector. When applying for a machine learning job, your cover letter is a vital part of your application. Machine learning is an exciting and rapidly evolving field, and your cover letter offers the chance to demonstrate your technical expertise, passion for AI, and your ability to apply machine learning techniques to solve real-world problems. Writing a cover letter for machine learning roles may feel intimidating, but by following a clear structure, you can showcase your strengths effectively. Whether you're just entering the field, transitioning from another role, or looking to advance your career in machine learning, this article will guide you through a proven four-paragraph structure. We’ll provide practical tips and sample lines to help you create a compelling cover letter that catches the attention of hiring managers in the machine learning job market.

Veterans in Machine Learning: A Military‑to‑Civilian Pathway into AI Careers

Introduction Artificial intelligence is no longer relegated to sci‑fi films—it underpins battlefield decision‑support, fraud detection, and even supermarket logistics. The UK Government’s 2025 AI Sector Deal forecasts an additional £200 billion in GDP by 2030, with machine‑learning (ML) engineers cited as the nation’s second most in‑demand tech role (Tech Nation 2024). The Ministry of Defence’s Defence AI Strategy echoes that urgency, earmarking £1.6 billion for FY 2025–28 to embed ML into planning, logistics, and autonomous systems. If you have ever tuned a radar filter, plotted artillery trajectories, or sifted sensor data for actionable intel, you have already worked with statistical modelling—the backbone of machine learning. This guide shows UK veterans how to reframe military experience for ML roles, leverage MoD transition funding, and land high‑impact positions building the models shaping tomorrow’s defence and commercial landscapes. Quick Win: Bookmark our live board for Machine‑Learning Engineer roles to see who’s hiring today.

Rural-Remote Machine Learning Jobs: Finding Balance Beyond the Big Cities

Over the past decade, machine learning (ML) has transformed from a niche research domain into a pervasive technology underpinning everything from recommendation systems and voice assistants to financial forecasting and autonomous vehicles. Historically, the UK’s major tech hubs—particularly London—have been magnets for top ML talent and corporate headquarters. However, remote work has become mainstream, and many ML professionals are realising they can excel in their field while living far beyond the city limits. At MachineLearningJobs.co.uk, we’ve observed a growing interest in positions that allow for a rural lifestyle or a coastal environment, often reflected in search terms like “ML remote countryside” or “tech jobs by the sea.” This surge is no coincidence. Flexible work policies, better rural broadband, and the nature of machine learning tasks—much of which can be done through cloud platforms—are bringing new opportunities to those who wish to swap urban hustle for fresh air and scenic views. Whether you’re a data scientist, ML engineer, researcher, or product manager, a rural or seaside move could reinvigorate your work-life balance. In this article, we’ll unpack why rural-remote ML jobs are on the rise, how you can navigate the challenges of leaving the city, and what you need to do to thrive in a machine learning career beyond the M25. If you’ve dreamt of looking up from your laptop to rolling fields or ocean waves, keep reading—your rural ML role might be closer than you think.