Principal ML Platform Engineer

London, United Kingdom
Last month
Job Type
Permanent
Work Location
Remote
Seniority
Lead
Posted
8 Apr 2026 (Last month)

Synthesia is the world’s leading AI video platform for business, used by over 90% of the Fortune 100. Founded in 2017, the company is headquartered in London, with offices and teams across Europe and the US.

As AI continues to shape the way we live and work, Synthesia develops products to enhance visual communication and enterprise skill development, helping people work better and stay at the center of successful organizations.

Following our recent Series E funding round, where we raised $200 million, our valuation stands at $4 billion. Our total funding exceeds $530 million from premier investors including Accel, NVentures (Nvidia's VC arm), Kleiner Perkins, GV, and Evantic Capital, alongside the founders and operators of Stripe, Datadog, Miro, and Webflow.

We’re looking for a Principal Engineer to join the ML Platform team at Synthesia.

Our team builds and operates the systems that allow researchers and product teams totrain, serve, and deploy generative modelsreliably and efficiently. This includes research infrastructure, production serving systems, internal tooling, and the platform interfaces that connect them. A growing part of our mission is making these systems more automation-friendly andagent-oriented, so that workflows can increasingly be operated through reliable tooling rather than manual effort.

We’re looking for a strong generalist with a systems mindset:

  • someone who is comfortable working across infrastructure, backend systems, and tooling, and who has seen ML systems in practice.

  • this is not a pure ML Engineer role. We’re especially interested in people who think deeply about reliability, scalability, performance, and resource efficiency in complex production environments.

This is a hands-on IC role with significant ownership. You’ll help shape how our ML platform evolves as we scale the number of models, workloads, tools and teams relying on it.

What you’ll do

  • Design and improve the platform systems that support model training, evaluation, and production serving.

  • Build infrastructure and tooling that make ML workloads more reliable, scalable, and cost-efficient.

  • Develop internal tools and workflows that are easy to operateboth by humans and by agents.

  • Work on the architecture behind how models are deployed, served, and operated across research and product environments.

  • Improve how we schedule, monitor, and debug workloads running on GPUs and cloud infrastructure.

  • Develop internal tools and abstractions and agentic systems that reduce operational overhead for researchers and engineers.

  • Drive improvements across observability, automation, reliability, and developer experience.

  • Collaborate closely with researchers and product engineers to understand pain points and turn them into robust platform capabilities.

  • Contribute to technical direction and make pragmatic architectural tradeoffs as the platform grows.

You’ll thrive in this role if you have

  • Strong experience building or operating production systems with a focus on reliability, scalability, and maintainability.

  • A systems mindset: you naturally think in terms of bottlenecks, failure modes, interfaces, resource usage, and long-term operability.

  • Solid hands-on experience with cloud infrastructure, Linux, and infrastructure automation.

  • Experience with Kubernetes and operating distributed workloads in production.

  • Strong coding skills, ideally in Python or similar languages used for backend systems and tooling.

  • Strong judgment around where automation adds leverage, and where human control and reliability matter most.

  • Experience building internal platforms, developer tooling, or infrastructure abstractions used by other engineers.

  • Comfort working in ambiguous environments and taking ownership of open-ended technical problems.

  • A pragmatic approach: you care about solving the right problem well, not over-engineering.

Particularly relevant experience

  • Operating ML infrastructure or model serving systems in production.

  • Supporting research or data-intensive workloads.

  • Working with GPU-based systems or other performance-sensitive infrastructure.

  • Experience with observability and debugging in distributed systems.

  • Familiarity with Terraform, Datadog, GitHub Actions, or similar tools.

Bonus points for

  • Experience building agentic or LLM-powered internal tools.

  • Experience with workflow orchestration systems such as Temporal.

  • Experience working at the boundary between research and production engineering.

  • Familiarity with performance optimization, scheduling, or resource allocation problems.

  • Experience building lightweight product or developer-facing tools.

Related Jobs

View all jobs
Spotlight

Machine Learning Engineer - National Security (Gloucestershire)

Mind Foundry Gloucester, Gloucestershire, United Kingdom
On-site Clearance Required
Spotlight

Senior ML Compiler Engineer

Fractile Bristol, United Kingdom

Principal Machine Learning Infrastructure Engineer

PhysicsX London, United Kingdom

Senior Software Engineer, ML Ops

Isomorphic Labs London, United Kingdom
On-site

Group Product Manager

PolyAI London, United Kingdom

Principal Machine Learning Engineer (Live Sports Insights)

Sky Syon, London, United Kingdom
Hybrid

Principal Software Reliability Engineer - Consumer Identity

Entrust London, United Kingdom

Partner AI Deployment Engineer, Global Advisory Alliances

OpenAI United Kingdom
Hybrid

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Where to Advertise Machine Learning Jobs in the UK (2026 Guide)

Advertising machine learning jobs in the UK requires a different approach to most technical hiring. The candidate pool is small, highly specialised and in demand across AI labs, financial services, healthcare, autonomous systems and consumer technology simultaneously. Machine learning engineers and researchers move between roles through professional networks, conference communities and specialist platforms — not general job boards where ML roles compete with unrelated software engineering positions for the same audience. This guide, published by MachineLearningJobs.co.uk, covers where to advertise machine learning roles in the UK in 2026, how the main platforms compare, what employers should expect to pay, and what the data says about hiring across different role types.

Machine Learning Jobs UK 2026: What to Expect Over the Next 3 Years

Machine learning has undergone a transformation that few technology disciplines can match. In the space of three years it has moved from a specialism sitting at the edges of most organisations' technology strategies to a capability that sits at the centre of them. The tools have changed, the expectations have shifted, and the range of industries treating machine learning as a core business function — rather than an experimental one — has expanded dramatically. For job seekers, this creates both opportunity and complexity in roughly equal measure. The machine learning jobs market of 2026 is significantly larger than it was three years ago, but it is also significantly more demanding. Employers have developed more sophisticated expectations, the technical bar for specialist roles has risen, and the landscape of tools, frameworks, and architectural patterns that practitioners are expected to know has broadened considerably. The candidates who will thrive over the next three years are those who understand where the discipline is heading — which specialisms are attracting the most investment, which technologies are reshaping what machine learning engineers and researchers are expected to build, and how the definition of a machine learning career is evolving beyond the model-building core toward a much wider range of roles across the full ML lifecycle. This article breaks down what the UK machine learning jobs market is likely to look like through to 2028 — covering the titles emerging right now, the technologies driving employer demand, the skills that will matter most, and how to position your career ahead of the curve.

New Machine Learning Employers to Watch in 2026: UK and Global Companies Driving ML Innovation

Machine learning (ML) has transitioned from a specialised field into a core business capability. In 2026, organisations across healthcare, finance, robotics, autonomous systems, natural language processing, and analytics are expanding their machine learning teams to build scalable intelligent products and services. For professionals exploring opportunities on www.MachineLearningJobs.co.uk , understanding the companies that are scaling, winning investment, or securing high‑impact contracts is crucial. This article highlights the new and high‑growth machine learning employers to watch in 2026, focusing on UK innovators, international firms with significant UK presence, and global platforms investing in machine learning talent locally.