Machine Learning Evaluation Engineer

writewithmarker

London

3 weeks ago

Applications closed

Related Jobs

View all jobs

Staff Data Engineer, AI Evaluation London

Data Science Specialist – AI Trainer

Machine Learning Research Engineer - 5490

Machine Learning Engineer

Senior Machine Learning Engineer

AI Evaluation, Research Methods, Python, LLMObservability

Salary range

£60,000-£80,000 p.a. + equity, depending on experience (up to £100,000 forcandidates with exceptional relevant experience)

Apply

Email us at and tell us a little bit about yourselfand your interest in the future of writing, along with your CV or a link to your CV site.

What is Marker?

Marker is an AI-native Word Processor – a reimagining of Google Docs and Microsoft Word.

Join us in building the next generation of agentic AI assistants supporting serious writers in their work.

We are a small, ambitious company using cutting-edge technology to give everybody writing superpowers.

What you'll do at Marker

We are looking for someone with a couple of years experience in academia or industry who can help us bringrigour and insight to our AI systems through evaluation,research, and observability. You'll work directly with Ryan Bowman (CPO) to help us understand and improvehow our AI assists writers. Here are some examples of areas you will be working in:

Design and implement evaluation frameworks for complex, subjective AI outputs (like writing feedbackthat's meant to inspire rather than just correct)
Build flexible evaluation pipelines that can assess quality across multiple dimensions - from humanpreference to actual writing improvement
Research and prototype new evaluation methodologies for creative and subjective AI tasks
Collaborate with our engineering team to integrate evaluation insights into our development process
Help define what "quality" means for different AI outputs and create metrics that actually matter forour users
Work on challenging problems like: "How do we automatically evaluate whether an AI comment successfullyencourages thoughtful revision?"

What we can offer

A calm, human-friendly work environment among kind and experienced professionals
Fun, creative, novel, and interesting technical work at the intersection of AI research and productdevelopment
An opportunity to work with and learn about the latest advancements in AI evaluation and language models
Direct collaboration with leadership to shape how we understand and improve our AI systems
As much responsibility and growth opportunities as you want to take on

Are you a good fit for this role?

In order to be successful in this role, you will recognise yourself in the following:

You have experience with AI/ML evaluation methodologies and can speak the language of AI research
You've worked hands-on with language models and understand the challenges of evaluating subjective,creative outputs
You are a self-starter willing to work independently and at speed - we imagine a 2-week experimentcadence at most.
You are familiar with and have worked on related technical systems (evaluation pipelines, datacollection tools) but don't need to be a full-stack engineer. You won't be expected to build these alone!
You think critically about what metrics actually matter and aren't satisfied with vanity metrics
You're comfortable working with ambiguous problems where the "right answer" isn't obvious
You have some programming experience (Python preferred) and can work independently on technical projects
You're interested in the intersection of AI capabilities and human creativity

An exceptional candidate for this role would be able to demonstrate some of thefollowing:

Experience building evaluation systems for generative AI in production environments
Knowledge of TypeScript and ability to integrate with our existing systems
Background in human-computer interaction, computational creativity, or writing research
Experience with A/B testing, statistical analysis, and experimental design
Familiarity with modern AI observability and monitoring tools
Published research or deep interest in AI evaluation methodologies
Interest in writing (fiction, non-fiction, essays)

However, you are NOT expected to:

Be a senior software engineer - we're looking for someone who can build evaluation systems, notarchitect our entire backend
Have solved every evaluation problem before - this is cutting-edge work and we're figuring it outtogether
Be experienced with every library in our stack from day one - you'll work closely with Ryan and ourengineering team
Have a specific degree - we value practical experience and research ability over credentials

Our stack

You'll be working with the following technologies:

Our AI engine uses a range of models, including self-hosted and fine-tuned open source models, as wellas latest reasoning models from Anthropic and OpenAI
Evaluation and research tools built primarily in Python, with integration into our TypeScriptinfrastructure
Our agentic AI execution platform is written in TypeScript, hosted on Cloudflare Workers
Standard ML tooling: various evaluation frameworks, data analysis tools, and monitoring systems
Our text editor frontend is a web application built with React, TypeScript and ProseMirror

Apply now!

Interested? Email us at with your CV (or a link to your CV site).Tell us a little bit about yourself and why you'd like to work at Marker!

Please note that this role is currently only available based in ourLondon hub, and at this time we are not able to sponsor work visas in the UK.

#J-18808-Ljbffr

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Aug 12, 2025

Jobs

Automate Your Machine Learning Jobs Search: Using ChatGPT, RSS & Alerts to Save Hours Each Week

ML jobs are everywhere—product companies, labs, consultancies, fintech, healthtech, robotics—often hidden in ATS portals or duplicated across boards. The fastest way to stay on top of them isn’t more scrolling; it’s automation. With keyword-rich alerts, RSS feeds, and a reusable ChatGPT workflow, you can bring relevant roles to you, triage them in minutes, and tailor strong applications without burning your evenings. This is a copy-paste playbook for www.machinelearningjobs.co.uk readers. It’s UK-centric, practical, and designed to save you hours each week. What You’ll Have Working In 30 Minutes A role & keyword map spanning LLM/NLP, Vision, Core ML, Recommenders, MLOps/Platform, Research/Applied Science, and Edge/Inference optimisation. Shareable Boolean searches you can paste into Google & job boards to cut noise. Always-on alerts & RSS feeds delivering fresh roles to your inbox/reader. A ChatGPT “ML Job Scout” prompt that deduplicates, scores fit, and outputs tailored actions. A lightweight pipeline tracker so deadlines and follow-ups never slip.

Aug 1, 2025

Jobs Careers

10 Machine‑Learning Recruitment Agencies in the UK You Should Know (2025 Job‑Seeker Guide)

With deep‑learning projects now integral across healthcare, finance and tech, UK demand for machine‑learning talent is booming. Lightcast shows +50 % YoY growth in UK adverts referencing “machine learning,” “deep learning,” “computer vision” or “reinforcement learning” in Q1 2025. Monthly vacancies sit around 1,800–2,100, but certified ML specialists number fewer than 15,000. Specialist recruiters help candidates access hidden roles, competitive packages, and structured interview prep. How we screened: Only UK‑registered agencies with clear ML/AI or Data practices Agencies that posted ≥ 5 UK ML roles between March and June 2025

Jul 18, 2025

Jobs Careers

Machine Learning Jobs Skills Radar 2026: Emerging Tools, Frameworks & Platforms to Learn Now

Machine learning is no longer confined to academic research—it's embedded in how UK companies detect fraud, recommend content, automate processes & forecast risk. But with model complexity rising and LLMs transforming workflows, employers are demanding new skills from machine learning professionals. Welcome to the Machine Learning Jobs Skills Radar 2026—your annual guide to the top languages, frameworks, platforms & tools shaping machine learning roles in the UK. Whether you're an aspiring ML engineer or a mid-career data scientist, this radar shows what to learn now to stay job-ready in 2026.

Machine Learning Evaluation Engineer

Related Jobs

Staff Data Engineer, AI Evaluation London

Data Science Specialist – AI Trainer

Machine Learning Research Engineer - 5490

Machine Learning Engineer

Machine Learning Engineer

Senior Machine Learning Engineer

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

Industry Insights

Automate Your Machine Learning Jobs Search: Using ChatGPT, RSS & Alerts to Save Hours Each Week

10 Machine‑Learning Recruitment Agencies in the UK You Should Know (2025 Job‑Seeker Guide)

Machine Learning Jobs Skills Radar 2026: Emerging Tools, Frameworks & Platforms to Learn Now

Find the perfect job? Subscribe to job alerts to stay informed about new opportunities.

10 Machine‑Learning Recruitment Agencies in the UK You Should Know (2025 Job‑Seeker Guide)