Shape the Future of AIJoin one of the UK's fastest-growing companies and become a Professional Development Expert in Artificial Intelligence.

View Roles

AI Data Engineering Lead

Moonvalley
London
1 month ago
Create job alert

Join to apply for theAI Data Engineering Leadrole atMoonvalley

Continue with Google Continue with Google

Join to apply for theAI Data Engineering Leadrole atMoonvalley

Get AI-powered advice on this job and more exclusive features.

Sign in to access AI-powered advices

Continue with Google Continue with Google

Continue with Google Continue with Google

Continue with Google Continue with Google

Continue with Google Continue with Google

Continue with Google Continue with Google

Continue with Google Continue with Google

This range is provided by Moonvalley. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.

Base pay range

$250,000.00/yr - $450,000.00/yr

Moonvalley is developing cutting-edge generative AI models designed to power Superbowl-worthy commercials and award-winning cinematic experiences. Our inaugural, cutting-edge HD model, Marey, is built on exclusively licensed and owned data for professional use in Hollywood and enterprise applications.

Our team is an unprecedented convergence of talent across industries. Our elite AI scientists from Deepmind, Google, Microsoft, Meta & Snap, have decades of collective experience in machine learning and computational creativity. We have also established the first AI-enabled movie studio in Hollywood, filled with accomplished filmmakers and visionary creative talent. We work with the top producers, actors, and filmmakers in Hollywood as well as creative-driven global brands. So far we’ve raised over $70M from world-class investors including General Catalyst, Bessemer, Khosla Ventures & YCombinator – and we’re just getting started.

Role Summary

We’re looking for a Data Engineering Lead to architect and scale the data pipelines that power our next-generation generative video models. This role is central to our mission of training models exclusively on clean, high-quality data.

You will lead the design of data ingestion pipelines, data annotations, and high-throughput, distributed systems that support large-scale data processing and curation. You’ll work closely with researchers, engineers, and infrastructure teams to ensure that our data pipeline is not just performant, but trusted, traceable, and aligned with our goal of building the world’s cleanest generative video foundation model.

What You'll Do

  • Design and lead scalable, high-throughput data pipelines optimized for multi-modal video model training.
  • Build systems for data ingestion, deduplication, quality assessment, validation, filtering, and labeling to ensure only clean, high-quality data flows through the pipeline.
  • Collaborate with research to define data quality benchmarks.
  • Optimize end-to-end performance across distributed data processing frameworks (e.g., Apache Spark, Ray, Airflow).
  • Work with infrastructure teams to scale pipelines across thousands of GPUs.
  • Work directly with the leadership on the data team roadmaps.
  • Manage the team of data engineers.
  • Work together with filmakers on data acquisition.

What We're Looking For

  • Deep experience in building and scaling data infrastructure for large-scale ML systems, ideally for video or multi-modal models.
  • Solid background in ML engineering, including hands-on experience in training and optimizing classifiers.
  • Experience managing large-scale datasets and pipelines in production.
  • Experience in managing and leading small teams of engineers.
  • Expertise in Python, Spark, Airflow, or similar data frameworks.
  • Understanding of modern infrastructure: Kubernetes, Terraform, object stores (e.g. S3, GCS), and distributed computing environments.
  • Strong communication and leadership skills; you can bridge the gap between engineering and research.
  • Skilled at balancing rapid, iterative delivery with a focus on long-term technical vision, ensuring solutions are both pragmatic and architecturally elegant.

Nice To Haves

  • Experience working on foundational model training pipelines (image, video, or language).
  • Familiarity with dataset licensing, governance, and compliance workflows.
  • Experience with video-specific data challenges like frame sampling, codec variability, temporal alignment, and perceptual quality scoring

In our team, we approach our work with the dedication similar to Olympic athletes. Anticipate occasional late nights and weekends dedicated to our mission. We understand this level of commitment may not suit everyone, and we openly communicate this expectation.

If you're motivated by deeply technical problems, a seemingly never-ending uphill battle and the opportunity to build (and own) a generational technology company, we can give you what you're looking for.

All business roles at Moonvalley are hybrid positions by default, with some fully remote depending on the job scope. We meet a few times every year, usually in London, UK or North America (LA, Toronto) as a company.

If you're excited about the opportunity to work on cutting-edge AI technology and help shape the future of media and entertainment, we encourage you to apply. We look forward to hearing from you!

The statements contained in this job description reflect general details as necessary to describe the principal functions of this job, the level of knowledge and skill typically required and the scope of responsibility. It should not be considered an all-inclusive listing of work requirements. Individuals may perform other duties as assigned, including work in other functional areas to cover absences, to equalize peak work periods, or to otherwise balance organizational work

Moonvalley AI is proud to be an equal opportunity employer. We are committed to providing accommodations. If you require accommodation, we will work with you to meet your needs.

Please be assured we'll treat any information you share with us with the utmost care, only use your information for recruitment purposes and will never sell it to other companies for marketing purposes. Please review our privacy policy and job applicant privacy policy locatedherefor further information.Seniority level

  • Seniority levelMid-Senior level

Employment type

  • Employment typeFull-time

Job function

  • Job functionInformation Technology
  • IndustriesSoftware Development

Referrals increase your chances of interviewing at Moonvalley by 2x

Sign in to set job alerts for “Data Specialist” roles.

Continue with Google Continue with Google

Continue with Google Continue with Google

London, England, United Kingdom 1 month ago

London, England, United Kingdom 6 days ago

London, England, United Kingdom 4 hours ago

London, England, United Kingdom 1 week ago

London, England, United Kingdom 16 hours ago

Lower Kingswood, England, United Kingdom
£22,925.00
-
£25,623.00
6 hours ago

London, England, United Kingdom 3 weeks ago

London, England, United Kingdom 2 weeks ago

Southwark, England, United Kingdom 4 days ago

London, England, United Kingdom 3 weeks ago

London, England, United Kingdom 1 month ago

Sutton, England, United Kingdom 1 week ago

London Area, United Kingdom 49 minutes ago

Digital, Data & Technology Graduate Programme – September 2025 start

London, England, United Kingdom 5 days ago

London, England, United Kingdom 1 week ago

London, England, United Kingdom 3 weeks ago

London Area, United Kingdom
£40.00
-
£50.00
8 hours ago

Digital, Data & Technology Graduate Programme

London, England, United Kingdom 3 days ago

Welwyn, England, United Kingdom 2 days ago

London, England, United Kingdom 1 day ago

London, England, United Kingdom 2 weeks ago

City Of London, England, United Kingdom 1 week ago

Hammersmith, England, United Kingdom 1 week ago

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.


#J-18808-Ljbffr

Related Jobs

View all jobs

Senior Managing Data Engineer

Lead AI & Data Engineer

Data Engineering Manager (f/m/d)

AI Data Engineer

AI Data Engineer

Head of Enterprise Data Engineering - FCDO - G6...

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

10 Machine‑Learning Recruitment Agencies in the UK You Should Know (2025 Job‑Seeker Guide)

With deep‑learning projects now integral across healthcare, finance and tech, UK demand for machine‑learning talent is booming. Lightcast shows +50 % YoY growth in UK adverts referencing “machine learning,” “deep learning,” “computer vision” or “reinforcement learning” in Q1 2025. Monthly vacancies sit around 1,800–2,100, but certified ML specialists number fewer than 15,000. Specialist recruiters help candidates access hidden roles, competitive packages, and structured interview prep. How we screened: Only UK‑registered agencies with clear ML/AI or Data practices Agencies that posted ≥ 5 UK ML roles between March and June 2025

Machine Learning Jobs Skills Radar 2026: Emerging Tools, Frameworks & Platforms to Learn Now

Machine learning is no longer confined to academic research—it's embedded in how UK companies detect fraud, recommend content, automate processes & forecast risk. But with model complexity rising and LLMs transforming workflows, employers are demanding new skills from machine learning professionals. Welcome to the Machine Learning Jobs Skills Radar 2026—your annual guide to the top languages, frameworks, platforms & tools shaping machine learning roles in the UK. Whether you're an aspiring ML engineer or a mid-career data scientist, this radar shows what to learn now to stay job-ready in 2026.

How to Find Hidden Machine Learning Jobs in the UK Using Professional Bodies like BCS, Turing Society & More

Machine learning (ML) continues to transform sectors across the UK—from fintech and retail to healthtech and autonomous systems. But while the demand for ML engineers, researchers, and applied scientists is growing, many of the best opportunities are never posted on traditional job boards. So, where do you find them? The answer lies in professional bodies, academic-industry networks, and tight-knit ML communities. In this guide, we’ll show you how to uncover hidden machine learning jobs in the UK by engaging with groups like the BCS (The Chartered Institute for IT), Turing Society, Alan Turing Institute, and others. We’ll explore how to use member directories, CPD events, SIGs (Special Interest Groups), and community projects to build connections, gain early access to job leads, and raise your professional profile in the ML ecosystem.