Shape the Future of AIJoin one of the UK's fastest-growing companies and become a Professional Development Expert in Artificial Intelligence.

View Roles

Senior Machine Learning Engineer, Scaling and Performance

InstaDeep Ltd
London
1 month ago
Create job alert

Innovation is at the heart of what we do. We work as a cohesive team that collectively develops real-life decision-making and technology products across various industries. We are always on the lookout for talented minds to join our dynamic team and contribute their unique insights. Be part of a stimulating and collaborative environment where your ideas can make an impact and ignite transformative change worldwide.

InstaDeep, founded in 2014, is a pioneering AI company at the forefront of innovation. With strategic offices in major cities worldwide, including London, Paris, Berlin, Tunis, Kigali, Cape Town, Boston, and San Francisco, InstaDeep collaborates with giants like Google DeepMind and prestigious educational institutions like MIT, Stanford, Oxford, UCL, and Imperial College London. We are a Google Cloud Partner and a select NVIDIA Elite Service Delivery Partner. We have been listed among notable players in AI, fast-growing companies, and Europe's 1000 fastest-growing companies in 2022 by Statista and the Financial Times. Our recent acquisition by BioNTech has further solidified our commitment to leading the industry.

Join us to be a part of the AI revolution!

The Team:

Our team plays a pivotal role in enhancing the capabilities and efficiency of our advanced AI systems. We design solutions that enable our machine learning models to scale seamlessly and perform optimally in real-world applications and large scale research. Collaborating across InstaDeep, we directly impact projects in diverse fields including Life Sciences, Logistics, Chip Design, and Quantum ML.

The Role:

We seek a highly skilled Machine Learning Engineer with a passion for tackling the challenges of large-scale ML development. You'll play a vital role in making our ambitious AI solutions a practical reality. If you thrive on system-level analysis, find joy in squeezing every ounce of performance from hardware, and love diving deep into algorithm optimisation, this is the position for you.

Responsibilities

  • Scaling Expertise:Design and implement strategies to efficiently scale machine learning models across diverse hardware platforms (GPU/TPU).
  • Performance Optimisation:Analyse and profile ML systems under heavy load, pinpointing bottlenecks, and implementing targeted optimisations.
  • Distributed Systems Architecture:Create robust distributed training and inference solutions for maximum computational efficiency.
  • Algorithmic Optimisation:Research and understand the latest deep learning literature to implement and optimise state-of-the-art algorithms and architectures, ensuring compute efficiency and performance.
  • Low-Level Mastery:Write high-quality Python, C/C++, XLA, Pallas, Triton, and/or CUDA code to achieve performance breakthroughs.

Required Skills

  • Understanding of Linux systems, performance analysis tools, and hardware optimisation techniques
  • Experience with distributed training frameworks (Ray, Dask, PyTorch Lightning, etc.)
  • Expertise with Python and/or C/C++
  • Development with machine learning frameworks (JAX, Tensorflow, PyTorch etc.)
  • Passion for profiling, identifying bottlenecks, and delivering efficient solutions.

Highly Desirable

  • Track record of successfully scaling ML models.
  • Experience writing custom CUDA kernels or XLA operations.
  • Understanding of GPU/TPU architectures and their implications for efficient ML systems.
  • Fundamentals of modern Deep Learning
  • Actively following ML trends and a desire to push boundaries.

Example Projects:

  • Profile algorithm traces, identifying opportunities for custom XLA operations and CUDA kernel development.
  • Implement and apply SOTA architectures (MAMBA, Griffin, Hyena) to research and applied projects.
  • Adapt algorithms for large-scale distributed architectures across HPC clusters.
  • Employ memory-efficient techniques within models for increased parameter counts and longer context lengths.

What We Offer:

  • Real-World Impact:Directly contribute to the performance and reach of our AI solutions.
  • Cutting-Edge Challenges:Tackle complex problems at the forefront of machine learning and large-scale system design.
  • Growth-Oriented Environment:Expand your expertise in a team of talented engineers dedicated to advancing ML scalability.

Our commitment to our people

We empower individuals to celebrate their uniqueness here at InstaDeep. Our team comes from all walks of life, and we’re proud to continue encouraging and supporting applicants from underrepresented groups across the globe. Our commitment to creating an authentic environment comes from our ability to learn and grow from our diversity, and how better to experience this than by joining our team?We operate on a hybrid work model with guidance to work at the office 3 days per week to encourage close collaboration and innovation. We are continuing to review the situation with the well-being of InstaDeepers at the forefront of our minds.

Right to work:Please note that you will require the legal right to work in the location you are applying for.

Ready to take the next step? Check out our FAQs and discover what makes us tick!

Can I apply to multiple jobs?I was interviewed/applied last year and wasn't selected. May I reapply?I don't live where the job opportunity is. Can I still apply?
#J-18808-Ljbffr

Related Jobs

View all jobs

Senior Machine Learning Ops Engineer

Senior Machine Learning Engineer

Senior Machine Learning Engineer

Senior Machine Learning Engineer

Senior Machine Learning Engineer

Machine Learning Engineer

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

10 Machine‑Learning Recruitment Agencies in the UK You Should Know (2025 Job‑Seeker Guide)

With deep‑learning projects now integral across healthcare, finance and tech, UK demand for machine‑learning talent is booming. Lightcast shows +50 % YoY growth in UK adverts referencing “machine learning,” “deep learning,” “computer vision” or “reinforcement learning” in Q1 2025. Monthly vacancies sit around 1,800–2,100, but certified ML specialists number fewer than 15,000. Specialist recruiters help candidates access hidden roles, competitive packages, and structured interview prep. How we screened: Only UK‑registered agencies with clear ML/AI or Data practices Agencies that posted ≥ 5 UK ML roles between March and June 2025

Machine Learning Jobs Skills Radar 2026: Emerging Tools, Frameworks & Platforms to Learn Now

Machine learning is no longer confined to academic research—it's embedded in how UK companies detect fraud, recommend content, automate processes & forecast risk. But with model complexity rising and LLMs transforming workflows, employers are demanding new skills from machine learning professionals. Welcome to the Machine Learning Jobs Skills Radar 2026—your annual guide to the top languages, frameworks, platforms & tools shaping machine learning roles in the UK. Whether you're an aspiring ML engineer or a mid-career data scientist, this radar shows what to learn now to stay job-ready in 2026.

How to Find Hidden Machine Learning Jobs in the UK Using Professional Bodies like BCS, Turing Society & More

Machine learning (ML) continues to transform sectors across the UK—from fintech and retail to healthtech and autonomous systems. But while the demand for ML engineers, researchers, and applied scientists is growing, many of the best opportunities are never posted on traditional job boards. So, where do you find them? The answer lies in professional bodies, academic-industry networks, and tight-knit ML communities. In this guide, we’ll show you how to uncover hidden machine learning jobs in the UK by engaging with groups like the BCS (The Chartered Institute for IT), Turing Society, Alan Turing Institute, and others. We’ll explore how to use member directories, CPD events, SIGs (Special Interest Groups), and community projects to build connections, gain early access to job leads, and raise your professional profile in the ML ecosystem.