Be at the heart of actionFly remote-controlled drones into enemy territory to gather vital information.

Apply Now

ML Data Engineer

Recraft
City of London
1 week ago
Create job alert
About Us

Founded in the US in 2022 and now based in London, UK, Recraft is an AI tool for professional designers, illustrators, and marketers, setting a new standard for excellence in image generation.


We designed a tool that lets creators quickly generate and iterate original images, vector art, illustrations, icons, and 3D graphics with AI. Over 3 million users across 200 countries have produced hundreds of millions of images using Recraft, and we’re just getting started.


Join a universe of professional opportunities, develop and support large-scale projects, and shape the future of creativity. We are committed to making Recraft an essential, daily tool for every designer and setting the industry standard. Our mission is to ensure that creators can fully control their creative process with AI, providing them with innovative tools to turn ideas into reality.


If you’re passionate about pushing the boundaries of AI, we want you on board!


Job Description

At Recraft, we’re building the next generation of generative models across images and text. We’re looking for an ML Data Engineer to scale our data pipelines for unstructured data (primarily images) and keep our training flows fast, reliable, and repeatable. You’ll design and operate high-throughput ingestion and preprocessing on Kubernetes, evolve our internal data-pipeline framework, and work hand-in-hand with ML engineers to ship datasets that move model quality forward.


Key Responsibilities

  • Build robust crawlers/scrapers to collect large-scale image (and occasional text/HTML) datasets from diverse sources.


  • Own the end-to-end flow: raw data → quality/beauty/relevance filtering → dedup/validation → ready-to-train artifacts.
    Operate and improve our Kubernetes-based data-pipeline framework (distributed jobs, retries, monitoring, automation).


  • Work with S3-style object storage: efficient layouts, lifecycle, throughput, and cost awareness.


  • Add tooling around pipelines (progress/health visualization, metrics, alerts) for observability and faster iteration.


  • Collaborate closely with ML engineers to align datasets with training needs and accelerate experimentation.



Requirements

Must-have



  • Strong Python fundamentals; you write clean, maintainable, production-ready code.


  • Solid hands-on Kubernetes experience (containers, jobs, batch/distributed processing).


  • Proven track record with unstructured data, especially images (loading, filtering, transforming at scale).


  • Experience building web crawlers/parsers and handling real-world failure modes gracefully.


  • Comfort with S3/object storage and moving lots of data efficiently and safely.


  • Pragmatic, detail-oriented, ownership mindset; you enjoy making systems reliable and fast.



Nice-to-have



  • Familiarity with ML workflows (PyTorch) and downstream training considerations.


  • Experience with image quality scoring, captioning, or image-to-text pipelines.


  • DAG/workflow visualizations or pipeline UX tooling.


  • DevOps fluency: Docker, CI/CD, infra automation.



What We Offer

  • Competitive salary and equity.


  • We’re able to offer Skilled Worker visa sponsorship in the UK for qualified candidates.


  • Real impact on model quality: your pipelines directly power training runs and product improvements.


  • Ownership with support: autonomy to design and improve systems, alongside experienced ML peers.


  • Modern stack: Python, Kubernetes, S3, internal pipeline framework built for scale.


  • Growth: a fast-moving environment where shipping well-engineered systems is the norm.



#J-18808-Ljbffr

Related Jobs

View all jobs

AI/ML Data Engineer: Production Models on Azure

Senior AI/ML Data Engineer - 100% Remote - EMEA

Data Engineer

GCP AI/ML Data Engineer — Marketing Automation (London)

Sr. AI Data Engineer (UK Remote)

Google Cloud AI/ML Data Engineer

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Machine Learning Recruitment Trends 2025 (UK): What Job Seekers Need To Know About Today’s Hiring Process

Summary: UK machine learning hiring has shifted from title‑led CV screens to capability‑driven assessments that emphasise shipped ML/LLM features, robust evaluation, observability, safety/governance, cost control and measurable business impact. This guide explains what’s changed, what to expect in interviews & how to prepare—especially for ML engineers, applied scientists, LLM application engineers, ML platform/MLOps engineers and AI product managers. Who this is for: ML engineers, applied ML/LLM engineers, LLM/retrieval engineers, ML platform/MLOps/SRE, data scientists transitioning to production ML, AI product managers & tech‑lead candidates targeting roles in the UK.

Why Machine Learning Careers in the UK Are Becoming More Multidisciplinary

Machine learning (ML) has moved from research labs into mainstream UK businesses. From healthcare diagnostics to fraud detection, autonomous vehicles to recommendation engines, ML underpins critical services and consumer experiences. But the skillset required of today’s machine learning professionals is no longer purely technical. Employers increasingly seek multidisciplinary expertise: not only coding, algorithms & statistics, but also knowledge of law, ethics, psychology, linguistics & design. This article explores why UK machine learning careers are becoming more multidisciplinary, how these fields intersect with ML roles, and what both job-seekers & employers need to understand to succeed in a rapidly changing landscape.

Machine Learning Team Structures Explained: Who Does What in a Modern Machine Learning Department

Machine learning is now central to many advanced data-driven products and services across the UK. Whether you work in finance, healthcare, retail, autonomous vehicles, recommendation systems, robotics, or consumer applications, there’s a need for dedicated machine learning teams that can deliver models into production, maintain them, keep them secure, efficient, fair, and aligned with business objectives. If you’re hiring for or applying to ML roles via MachineLearningJobs.co.uk, this article will help you understand what roles are typically present in a mature machine learning department, how they collaborate through project lifecycles, what skills and qualifications UK employers look for, what the career paths and salaries are, current trends and challenges, and how to build an effective ML team.