Research Engineer - Data

Leonardo.Ai
London
1 week ago
Create job alert

Leonardo.Ai, now part of the Canva family, is on a mission to redefine creativity through cutting-edge generative AI. Our platform empowers millions worldwide to effortlessly produce high-quality images, videos, and more. With nearly a quarter of a billion users, we’re building a world-class R&D team to push the boundaries of AI creativity.

The Role:

As aResearch Engineer – DataatLeonardo, you will architect and manage petascale data pipelines, combining text, images, 3D models, and other data modalities to drive world-class AI models. You’ll work hand-in-hand with our Researchers to create and curate large, multi-modal datasets, including synthetic data, that supercharge SOTA generative AI solutions. Your expertise in distributed systems, data processing, and experimentation will shape the backbone of our research work.

Responsibilities:

  • Data Acquisition & Curation
    Lead the ingestion, unification, and organization of large, unstructured data sources (e.g., text, images, 3D geometry, code snippets) into scalable, high-quality datasets suitable for machine learning research and production.

  • High-Performance Data Pipelines
    Develop and optimize distributed systems for data processing, including filtering, indexing, and retrieval, leveraging frameworks like Ray, Metaflow, Spark, or Hadoop.

  • Synthetic Data Generation
    Build and orchestrate pipelines to generate synthetic data at scale, advancing research on cost-efficient inference and training strategies.

  • Experiments & Analysis
    Design and conduct experiments on dataset quality, scalability, and performance.

  • Security & Compliance
    Collaborate with legal and safety teams to ensure all data usage respects privacy, security, and ethical standards.

  • Open-Source Contributions
    Contribute to internal and external libraries or frameworks, sharing insights and breakthroughs with the wider AI community through publications or technical blogs.

Skills we like you to have:

  • Multi-Modal Data Expertise
    Hands-on experience with images, videos, 3D geometry (mesh/solid modeling), and/or text data. Well-rounded expertise in Python and PyTorch.

  • Synthetic Data & Inference
    Passion for synthetic data generation making use of inference of pretrained models, 3D rendering engines, and/or other softwares.

  • Distributed Computing & MLOps
    Demonstrated proficiency in setting up large-scale, robust data pipelines, using frameworks like Spark, Ray, or Metaflow. Comfortable with model versioning, and experiment tracking.

  • Performance Optimization
    Good understanding of parallel and distributed computing. Experienced with setting up evaluation methods

  • Cloud & Storage Systems
    Experience with AWS, Azure, or other cloud platforms. Proficient in both relational (MySQL, PostgreSQL) and NoSQL (MongoDB, Cassandra) databases, plus vector data stores.

Our Culture

  • Inclusive Culture:We celebrate diversity and are committed to creating an inclusive environment where everyone feels valued and empowered. Your unique perspectives and experiences are essential to our success.

  • Flexible Work Environment:We understand the importance of work-life balance. Thrive personally and professionally with the option to work remotely or in our vibrant offices.

  • Empowering Growth:We invest in your development with continuous learning opportunities and clear pathways for career advancement tailored to your goals.

  • Meaningful Impact:Be part of shaping the future of AI and contribute to innovative projects with global impact.

If you’re passionate about building scalable data ecosystems that fuel the next frontier of AI innovation—and you’re excited to collaborate with top-tier researchers and engineers—join us atLeonardo.Aito make creativity boundless for everyone.

#J-18808-Ljbffr

Related Jobs

View all jobs

Research Engineer

Research Engineer - Chem Bio

Research Engineer, ML, AI & Computer Vision

Research Engineer

Research Engineer, Multimodal

Research Engineer, ML, AI & Computer Vision

Get the latest insights and jobs direct. Sign up for our newsletter.

By subscribing you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Machine Learning Leadership for Managers: Strategies to Motivate, Mentor, and Set Realistic Goals in Data-Driven Teams

Machine learning (ML) has become an indispensable force in the modern business world, influencing everything from targeted marketing campaigns to advanced medical diagnostics. As industries integrate predictive algorithms and data-driven decision-making into their core operations, the need for effective leadership in machine learning environments has never been greater. Whether you’re overseeing a small team of data scientists or spearheading an enterprise-scale ML project, your leadership style must accommodate rapid innovation, complex problem-solving, and diverse stakeholder expectations. This guide provides actionable insights into how you can motivate, mentor, and establish achievable goals for your machine learning teams—ensuring they thrive in data-driven environments.

Top 10 Books to Advance Your Machine Learning Career in the UK

Machine learning (ML) remains one of the fastest-growing fields within technology, reshaping industries across the UK from finance and healthcare to e-commerce, telecommunications, and beyond. With increasing demand for ML specialists, job seekers who continually update their knowledge and skills hold a significant advantage. In this article, we've curated ten essential books every machine learning professional or aspiring ML engineer in the UK should read. Covering foundational theory, practical implementations, advanced techniques, and industry trends, these resources will equip you to excel in your machine learning career.

Navigating Machine Learning Career Fairs Like a Pro: Preparing Your Pitch, Questions to Ask, and Follow-Up Strategies to Stand Out

Machine learning (ML) has swiftly become one of the most in-demand skill areas across industries, with companies leveraging predictive models and data-driven insights to solve challenges in healthcare, finance, retail, manufacturing, and beyond. Whether you’re an early-career data scientist aiming to break into ML, a seasoned engineer branching into deep learning, or a product manager exploring AI-driven solutions, machine learning career fairs offer a powerful route to connect with prospective employers face-to-face. Attending these events can help you: Network with hiring managers and technical leads who make direct recruitment decisions. Gain insider insights on the latest ML trends and tools. Learn about emerging job roles and new industry verticals adopting machine learning. Showcase your interpersonal and communication skills, both of which are increasingly important in collaborative AI/ML environments. However, with many applicants vying for attention in a bustling hall, standing out isn’t always easy. In this detailed guide, we’ll walk you through how to prepare meticulously, pitch yourself confidently, ask relevant questions, and follow up effectively to land the machine learning opportunity that aligns with your ambitions.