National AI Awards 2025Discover AI's trailblazers! Join us to celebrate innovation and nominate industry leaders.

Nominate & Attend

Search - Search Inference - Senior MLOps Engineer

Elasticsearch B.V.
London
2 days ago
Create job alert

Elastic, the Search AI Company, enables everyone to find the answers they need in real time, using all their data, at scale — unleashing the potential of businesses and people. The Elastic Search AI Platform, used by more than 50% of the Fortune 500, brings together the precision of search and the intelligence of AI to enable everyone to accelerate the results that matter. By taking advantage of all structured and unstructured data — securing and protecting private information more effectively — Elastic’s complete, cloud-based solutions for search, security, and observability help organizations deliver on the promise of AI.
What is The Role The Search Inference team is responsible for bringing performant, ergonomic, and cost effective machine learning (ML) model inference to Search workflows. ML inference has become a crucial part of the modern search experience whether used for query understanding, semantic search, RAG, or any other GenAI use-case.
Our goal is to simplify ML inference in Search workflows by focusing on large scale inference capabilities for embeddings and reranking models that are available across the Elasticsearch user base. As a team, we are a collaborative, cross-functional group with backgrounds in information retrieval, natural language processing, and distributed systems. We work with Go microservices, Python, Ray Serve, Kubernetes/KubeRay, and work on AWS, GCP & Azure.
We provide thought leadership across a variety of mediums including open code repositories, publishing blogs, and speaking at conferences. We focus on matching the expectations of our customers along the lines of throughput, latency, and cost. We’re seeking an experienced ML Ops Engineer to help us deliver on this vision.
What You Will Be Doing Working with the team (and other teams) to evolve our inference service so it may host LLMs in addition to existing models (ELSER, E5, Rerank)
Enhancing the scalability and reliability of the service and work with the team to ensure knowledge is shared and best practices are followed
Improving the cost and efficiency of the platform, making the best use of available infrastructure
Adapting existing solutions to use our inference service, ensuring a seamless transition
What You Bring 5+ years working in an MLOps or related ML Engineering role
Production experience self-hosting & operating LLMs at scale for generative tasks via an inference framework such as Ray or KServe (or similar)
Production experience with running and tuning specialized hardware for Generative AI workloads, especially GPUs via CUDA
Measured and articulate written and spoken communication skills. You work well with others and can craft concise and expressive thoughts into correspondence: emails, issues, investigations, documentation, onboarding materials, and so on.
An interest in learning new tools, workflows and philosophies that can help you grow. You can function well in an environment that drives towards change. This role has tremendous opportunities for growth!
Please include whatever info you believe is relevant in your application: resume, GitHub profile, code samples, blog posts and writing samples, links to personal projects, etc.
Additional Information - We Take Care of Our People As a distributed company, diversity drives our identity. Whether you’re looking to launch a new career or grow an existing one, Elastic is the type of company where you can balance great work with great life. Your age is only a number. It doesn’t matter if you’re just out of college or your children are; we need you for what you can do.
We strive to have parity of benefits across regions and while regulations differ from place to place, we believe taking care of our people is the right thing to do.
Competitive pay based on the work you do here and not your previous salary
Health coverage for you and your family in many locations
Ability to craft your calendar with flexible locations and schedules for many roles
Generous number of vacation days each year
Increase your impact - We match up to $2000 (or local currency equivalent) for financial donations and service
Up to 40 hours each year to use toward volunteer projects you love
Embracing parenthood with minimum of 16 weeks of parental leave
Different people approach problems differently. We need that. Elastic is an equal opportunity employer and is committed to creating an inclusive culture that celebrates different perspectives, experiences, and backgrounds. Qualified applicants will receive consideration for employment without regard to race, ethnicity, color, religion, sex, pregnancy, sexual orientation, gender perception or identity, national origin, age, marital status, protected veteran status, disability status, or any other basis protected by federal, state or local law, ordinance or regulation.
We welcome individuals with disabilities and strive to create an accessible and inclusive experience for all individuals. To request an accommodation during the application or the recruiting process, please email .We will reply to your request within 24 business hours of submission.
Applicants have rights under Federal Employment Laws, view posters linked below: Family and Medical Leave Act (FMLA) Poster; Pay Transparency Nondiscrimination Provision Poster; Employee Polygraph Protection Act (EPPA) Poster and Know Your Rights (Poster)
Elasticsearch develops and distributes encryption software and technology that is subject to U.S. export controls and licensing requirements for individuals who are located in or are nationals of the following sanctioned countries and regions: Belarus, Cuba, Iran, North Korea, Russia, Syria, the Crimea Region of Ukraine, the Donetsk People’s Republic (“DNR”), and the Luhansk People’s Republic (“LNR”). If you are located in or are a national of one of the listed countries or regions, an export license may be required as a condition of your employment in this role. Please note that national origin and/or nationality do not affect eligibility for employment with Elastic.
Please see here for our Privacy Statement.

#J-18808-Ljbffr

Related Jobs

View all jobs

Project Manager

Project Manager

Power BI Data Analyst

Data Engineer

Flight Data Analyst...

Data Engineer...

National AI Awards 2025

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

How to Present Machine Learning Solutions to Non-Technical Audiences: A Public Speaking Guide for Job Seekers

Machine learning is driving change across nearly every industry—from retail and finance to health and logistics. But while the technology continues to evolve rapidly, the ability to communicate it clearly has become just as important as building the models themselves. Whether you're applying for a junior ML engineer role, a research position, or a client-facing AI consultant job, UK employers increasingly expect candidates to explain complex machine learning solutions to non-technical audiences. In this guide, you’ll learn how to confidently present your work, structure your message, use simple visuals, and explain the real-world value of machine learning in a way that makes sense to people without a background in data science.

Machine Learning Jobs UK 2025: 50 Companies Hiring Now

Bookmark this page—we refresh the Hotlist every quarter so you always know who’s really scaling their ML teams. The UK’s National AI Strategy, a £2 billion GenAI accelerator fund and a record flow of private capital have kicked ML hiring into overdrive for 2025. Whether you build production‑grade LLM services or optimise on‑device models for edge hardware, employers need your skills now. Below you’ll find 50 organisations that advertised UK‑based machine‑learning vacancies or announced head‑count growth during the past eight weeks. They’re grouped into five quick‑scan categories so you can jump straight to the type of employer—and mission—that excites you. For each company we list: Main UK hub Example live or recent vacancy Why it’s worth a look (stack, impact, culture) Search any employer on MachineLearningJobs.co.uk to see real‑time adverts, or set a free alert so fresh openings drop straight in your inbox.

Return-to-Work Pathways: Relaunch Your Machine Learning Career with Returnships, Flexible & Hybrid Roles

Returning to work after an extended break can feel like starting from scratch—especially in a specialist field like machine learning. Whether you paused your career for parenting, caring responsibilities or another life chapter, the UK’s machine learning sector now offers a variety of return-to-work pathways. From structured returnships to flexible and hybrid roles, these programmes recognise the transferable skills and resilience you’ve developed, pairing you with mentorship, upskilling and supportive networks to ease your transition back. In this guide, you’ll discover how to: Understand the current demand for machine learning talent in the UK Leverage your organisational, communication and analytical skills in ML contexts Overcome common re-entry challenges with practical solutions Refresh your technical knowledge through targeted learning Access returnship and re-entry programmes tailored to machine learning Find roles that fit around family commitments—whether flexible, hybrid or full-time Balance your career relaunch with caring responsibilities Master applications, interviews and networking specific to ML Learn from inspiring returner success stories Get answers to common questions in our FAQ section Whether you aim to return as an ML engineer, research scientist, MLOps specialist or data scientist with an ML focus, this article will map out the steps and resources you need to reignite your machine learning career.