Be at the heart of actionFly remote-controlled drones into enemy territory to gather vital information.

Apply Now

Senior Machine Learning Engineer, Data for Embodied AI

Wayve
City of London
6 days ago
Create job alert
About us

Founded in 2017, Wayve is the leading developer of Embodied AI technology. Our advanced AI software and foundation models enable vehicles to perceive, understand, and navigate any complex environment, enhancing the usability and safety of automated driving systems.

Our vision is to create autonomy that propels the world forward. Our intelligent, mapless, and hardware-agnostic AI products are designed for automakers, accelerating the transition from assisted to automated driving.

In our fast-paced environment big problems ignite us—we embrace uncertainty, leaning into complex challenges to unlock groundbreaking solutions. We aim high and stay humble in our pursuit of excellence, constantly learning and evolving as we pave the way for a smarter, safer future.

At Wayve, your contributions matter. We value diversity, embrace new perspectives, and foster an inclusive work environment; we back each other to deliver impact.

Make Wayve the experience that defines your career!


The role

Science is the team that is advancing our end-to-end autonomous driving research. The team\'s mission is to accelerate our journey to AV2.0 and ensure the future success of Wayve by incubating and investing in new ideas that have the potential to become game-changing technological advances for the company.

The goal of this role is to build, scale, and optimise next-generation world model architectures (e.g. GAIA and successors) and bridge them into high-throughput training infrastructure, enabling synthetic data and simulation to dramatically accelerate autonomy development. You\'ll design systems to acquire, process, and curate multimodal data at scale. You\'ll turn raw experience into the high-quality datasets that fuel our models.

You\'ll sit at the intersection of machine learning research and data engineering, collaborating closely with scientists and infrastructure teams to ensure our workflows are robust, efficient, and deeply integrated with our model training stack.

Your work will directly impact how quickly and effectively we can train, evaluate, and deploy embodied AI systems in the real world.


Key responsibilities
  • Design and implement large-scale data acquisition, processing, and curation pipelines, owning the full lifecycle of high-quality datasets used to train advanced robotics and foundation models.
  • Continuously improve dataset quality and utility through sophisticated data analysis, debugging, and experimentation; developing metrics, tests, and monitoring mechanisms that directly drive model performance improvements.
  • Develop and scale multimodal data pipelines for ingestion, preprocessing, filtering, annotation, and storage across video, LiDAR, and telemetry modalities.
  • Run systematic experiments on data ablations and composition to assess their impact on model training dynamics, generalisation, and downstream performance.
  • Collaborate with ML researchers and platform engineers to ensure datasets are fit for purpose and efficiently integrated into large-scale training workflows.
  • Build internal tools and workflows for dataset auditing, visualization, and versioning to streamline iteration and reproducibility.
  • Advance best practices for data governance, reliability, and scalability across the data lifecycle; ensuring data safety, privacy, and long-term maintainability.

About you

To set you up for success as a Senior MLE at Wayve, we\'re looking for the following skills and experience:

  • 5+ years of experience in ML engineering, data engineering, or applied ML roles focused on large-scale data systems.
  • Proven experience building and maintaining large-scale data pipelines for machine learning, including data ingestion, transformation, and validation.
  • Strong Python fundamentals and experience with modern ML and data frameworks (e.g. PyTorch, Ray, Dask, Spark, or equivalent).
  • Solid understanding of multimodal data (video, lidar, sensor telemetry) and its challenges in large-scale training.
  • Experience defining and tracking data quality metrics, conducting dataset analysis, and driving data-informed improvements in model performance.
  • Demonstrated ability to work collaboratively with ML researchers, platform engineers, and product teams in a fast-paced, experimental environment.
  • Strong problem-solving skills, a data-driven mindset, and the ability to translate research needs into reliable data solutions.

Desirable
  • Exposure to large-scale storage, distributed training systems, or cloud compute environments (Azure, AWS, GCP).
  • Experience designing high-throughput, distributed data pipelines (e.g. with Spark, Ray, Beam, or similar frameworks).
  • Familiarity with data versioning, lineage, and governance tools (e.g. LakeFS, DVC, MLflow, Delta Lake).
  • Experience in AVs, robotics, simulation, or other embodied AI domains.
  • Familiarity with foundation models, generative models, or simulation-based data pipelines.

Why Join Us
  • Shape the future of embodied AI through data. Your work will directly determine the quality, scale, and impact of the foundation models that drive our autonomy systems.
  • Tackle data challenges at unprecedented scale. Work with petabytes of multimodal data - video, lidar, and telemetry - and build pipelines that enable training at the frontier of AI.
  • Collaborate with world-class talent. Partner with leading ML researchers, software engineers, and data scientists who are redefining how AI learns from real-world experience.
  • Make your mark on real-world autonomy. Your data systems will power models that see, understand, and act in the world.
  • Work in a high-trust, high-autonomy environment. We value creativity, experimentation, and rigorous thinking. You\'ll have the freedom to explore bold ideas and the support to make them real.

We understand that everyone has a unique set of skills and experiences and that not everyone will meet all of the requirements listed above. If you\'re passionate about self-driving cars and think you have what it takes to make a positive impact on the world, we encourage you to apply.

For more information visit Careers at Wayve.

To learn more about what drives us, visit Values at Wayve

DISCLAIMER: We will not ask about marriage or pregnancy, care responsibilities or disabilities in any of our job adverts or interviews. However, we do look to capture information about care responsibilities, and disabilities among other diversity information as part of an optional DEI Monitoring form to help us identify areas of improvement in our hiring process and ensure that the process is inclusive and non-discriminatory.


#J-18808-Ljbffr

Related Jobs

View all jobs

Senior Machine Learning Engineer, Data for Embodied AI

Machine Learning Engineer - Evaluation

Senior Machine Learning Engineer, Scaling World Models

Senior Machine Learning Engineer, Scaling World Models

Staff Machine Learning Engineer - Autonomy

Machine Learning Engineer, Controllable GAIA

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Machine Learning Recruitment Trends 2025 (UK): What Job Seekers Need To Know About Today’s Hiring Process

Summary: UK machine learning hiring has shifted from title‑led CV screens to capability‑driven assessments that emphasise shipped ML/LLM features, robust evaluation, observability, safety/governance, cost control and measurable business impact. This guide explains what’s changed, what to expect in interviews & how to prepare—especially for ML engineers, applied scientists, LLM application engineers, ML platform/MLOps engineers and AI product managers. Who this is for: ML engineers, applied ML/LLM engineers, LLM/retrieval engineers, ML platform/MLOps/SRE, data scientists transitioning to production ML, AI product managers & tech‑lead candidates targeting roles in the UK.

Why Machine Learning Careers in the UK Are Becoming More Multidisciplinary

Machine learning (ML) has moved from research labs into mainstream UK businesses. From healthcare diagnostics to fraud detection, autonomous vehicles to recommendation engines, ML underpins critical services and consumer experiences. But the skillset required of today’s machine learning professionals is no longer purely technical. Employers increasingly seek multidisciplinary expertise: not only coding, algorithms & statistics, but also knowledge of law, ethics, psychology, linguistics & design. This article explores why UK machine learning careers are becoming more multidisciplinary, how these fields intersect with ML roles, and what both job-seekers & employers need to understand to succeed in a rapidly changing landscape.

Machine Learning Team Structures Explained: Who Does What in a Modern Machine Learning Department

Machine learning is now central to many advanced data-driven products and services across the UK. Whether you work in finance, healthcare, retail, autonomous vehicles, recommendation systems, robotics, or consumer applications, there’s a need for dedicated machine learning teams that can deliver models into production, maintain them, keep them secure, efficient, fair, and aligned with business objectives. If you’re hiring for or applying to ML roles via MachineLearningJobs.co.uk, this article will help you understand what roles are typically present in a mature machine learning department, how they collaborate through project lifecycles, what skills and qualifications UK employers look for, what the career paths and salaries are, current trends and challenges, and how to build an effective ML team.