Research Engineer, Machine Learning - Paris/London/Zurich/Warsaw

Mistral AI
City of London
4 weeks ago
Create job alert

About Mistral

At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.

We democratize AI through high-performance, optimised, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet the needs of enterprises, whether on-premises or in cloud environments. Our offerings include Le Chat, the AI assistant for life and work.

We are a dynamic, collaborative team passionate about AI and its potential to transform society. Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, the USA, the UK, Germany and Singapore. We are creative, low-ego and team-spirited. Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact.

Role Summary

The Research Engineering team spans Platform (shared infra & clean code) and Embedded (inside research squads). Engineers can move along the research-production spectrum as needs or interests evolve.

As a Research Engineer – ML track, you’ll build and optimise the large-scale learning systems that power our open-weight models. Working hand-in-hand with Research Scientists, you’ll either join:

  • Platform RE Team: Enhance the shared training framework, data pipelines and cluster tooling used by every team;
  • Embedded RE Team: Sit inside a research squad (Alignment, Pre-training, Multimodal, …) and turn fresh ideas into repeatable, scalable code.

Location: Paris / London (hybrid) or remote from EU/UK

What will you do
  • Accelerate researchers by taking on the heavy parts of large-scale ML pipelines and building robust tools.
  • Interface cutting-edge research with production: integrate checkpoints, streamline evaluation, and expose APIs.
  • Conduct experiments on the latest deep-learning techniques (sparsified 70 B + runs, distributed training on thousands of GPUs).
  • Design, implement and benchmark ML algorithms; write clear, efficient code in Python.
  • Deliver prototypes that become production-grade components for Le Chat and our enterprise API.
About you
  • Master’s or PhD in Computer Science (or equivalent proven track record).
  • 4 + years working on large-scale ML codebases.
  • Hands-on with PyTorch, JAX or TensorFlow; comfortable with distributed training (DeepSpeed / FSDP / SLURM / K8s).
  • Experience in deep learning, NLP or LLMs; bonus for CUDA or data-pipeline chops.
  • Strong software-design instincts: testing, code review, CI/CD.
  • Self-starter, low-ego, collaborative.
Benefits

France:

  • Competitive cash salary and equity
  • Food: Daily lunch vouchers
  • Sport: Monthly contribution to a Gympass subscription
  • Transportation: Monthly contribution to a mobility pass
  • Health: Full health insurance for you and your family
  • Parental: Generous parental leave policy
  • Visa sponsorship

UK:

  • Competitive cash salary and equity
  • Insurance
  • Transportation: Reimburse office parking charges, or £90 per month for public transport
  • Sport: £90 per month reimbursement for gym membership
  • Meal voucher: £200 monthly allowance for meals
  • Pension plan: SmartPension (percentages are 5% Employee & 3% Employer)


#J-18808-Ljbffr

Related Jobs

View all jobs

NLP/LLM Research Engineer — Prototyping Cutting-Edge AI (Hybrid)

NLP/LLM Research Scientist (PhD) – Hybrid, Cambridge

Senior Machine Learning Research Engineer

Machine Learning Research Engineer - NLP / LLM

Machine Learning Research Engineer – NLP / LLM

Machine Learning Research Engineer – NLP / LLM

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Machine Learning Jobs for Career Switchers in Their 30s, 40s & 50s (UK Reality Check)

Are you considering a career change into machine learning in your 30s, 40s or 50s? You’re not alone. In the UK, organisations across industries such as finance, healthcare, retail, government & technology are investing in machine learning to improve decisions, automate processes & unlock new insights. But with all the hype, it can be hard to tell which roles are real job opportunities and which are just buzzwords. This article gives you a practical, UK-focused reality check: which machine learning roles truly exist, what skills employers really hire for, how long retraining realistically takes, how to position your experience and whether age matters in your favour or not. Whether you come from analytics, engineering, operations, research, compliance or business strategy, there is a credible route into machine learning if you approach it strategically.

How to Write a Machine Learning Job Ad That Attracts the Right People

Machine learning now sits at the heart of many UK organisations, powering everything from recommendation engines and fraud detection to forecasting, automation and decision support. As adoption grows, so does demand for skilled machine learning professionals. Yet many employers struggle to attract the right candidates. Machine learning job adverts often generate high volumes of applications, but few applicants have the blend of modelling skill, engineering awareness and real-world experience the role actually requires. Meanwhile, strong machine learning engineers and scientists quietly avoid adverts that feel vague, inflated or confused. In most cases, the issue is not the talent market — it is the job advert itself. Machine learning professionals are analytical, technically rigorous and highly selective. A poorly written job ad signals unclear expectations and low ML maturity. A well-written one signals credibility, focus and a serious approach to applied machine learning. This guide explains how to write a machine learning job ad that attracts the right people, improves applicant quality and strengthens your employer brand.

Maths for Machine Learning Jobs: The Only Topics You Actually Need (& How to Learn Them)

Machine learning job adverts in the UK love vague phrases like “strong maths” or “solid fundamentals”. That can make the whole field feel gatekept especially if you are a career changer or a student who has not touched maths since A level. Here is the practical truth. For most roles on MachineLearningJobs.co.uk such as Machine Learning Engineer, Applied Scientist, Data Scientist, NLP Engineer, Computer Vision Engineer or MLOps Engineer with modelling responsibilities the maths you actually use is concentrated in four areas: Linear algebra essentials (vectors, matrices, projections, PCA intuition) Probability & statistics (uncertainty, metrics, sampling, base rates) Calculus essentials (derivatives, chain rule, gradients, backprop intuition) Basic optimisation (loss functions, gradient descent, regularisation, tuning) If you can do those four things well you can build models, debug training, evaluate properly, explain trade-offs & sound credible in interviews. This guide gives you a clear scope plus a six-week learning plan, portfolio projects & resources so you can learn with momentum rather than drowning in theory.