Senior Software Engineer, MLOps and Infrastructure

Cohere
London
3 weeks ago
Applications closed

Related Jobs

View all jobs

Senior MLOps Engineer

Senior MLOps Engineer

Principal Software Engineer

Senior Machine Learning Engineer

Senior Data Scientist (Generative AI) - RELOCATION TO ABU DHABI

Senior Data Scientist (Generative AI) - RELOCATION TO ABU DHABI

This job is brought to you by Jobs/Redefined, the UK's leading over-50s age inclusive jobs board.

Who are we?

Our mission is to scale intelligence to serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI.

We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. We like to work hard and move fast to do what's best for our customers.

Cohere is a team of researchers, engineers, designers, and more, who are passionate about their craft. Each person is one of the best in the world at what they do. We believe that a diverse range of perspectives is a requirement for building great products.

Join us on our mission and shape the future!

Why this team?

This team is responsible for building world-class infrastructure that is critical to all of Cohere's success. Focus on stability, scalability, and observability are all paramount as this work acts as the foundation for all members of technical staff.

Our team optimizes for a wide range of technical skillsets (some of which are outlined below). Being self-directed and adaptable, identifying and solving key problems are essential.

Please Note:All of our infrastructure roles require participating in a 24x7 on-call rotation, where you are compensated for your on-call schedule.

For this role, we are targeting candidates who live in EMEA.

In order to be successful in the role, you have:

  • 5+ years of engineering experience running production infrastructure at a large scale
  • Experience designing large, highly available distributed systems with Kubernetes, and GPU workloads on those clusters
  • Experience working with GCP, Azure, AWS and/or OCI
  • Experience in designing, deploying, supporting, and troubleshooting in complex Linux-based computing environments
  • Excellent collaboration and troubleshooting skills to build mission-critical systems, and ensure smooth operations and efficient teamwork
  • The grit and adaptability to solve complex technical challenges that evolve day to day

Bonus qualifications:

  • You worked with or supported MLEs or data scientists
  • Familiarity troubleshooting RDMA networking

As a Senior Software Engineer you will:

  • Build self-service systems that automate managing, deploying and operating services.
  • This includes our custom Kubernetes operators that support language model deployments.
  • Automate environment observability and resilience. Enable all developers to troubleshoot and resolve problems.
  • Take steps required to ensure we hit defined SLOs, including participation in an on-call rotation.
  • Build strong relationships with internal developers and influence the Infrastructure team's roadmap based on their feedback.
  • Develop our team through knowledge sharing and an active review process.

You may be a good fit if:

  • You have proven production experience with Kubernetes.
  • You have hands-on coding experience developing services and automated tests (we use Go).
  • You prefer contributing to Open Source solutions rather than building solutions from the ground up.
  • You have experience scaling and debugging cloud-based infrastructure (we use Oracle, GCP, and Coreweave).
  • You draw motivation from building systems that help others be more productive.
  • You see mentorship, knowledge transfer, and review as essential prerequisites for a healthy team.

If some of the above doesn't line up perfectly with your experience, we still encourage you to apply! If you want to work really hard on a glorious mission with teammates that want the same thing, Cohere is the place for you.

We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs.

Full-Time Employees at Cohere enjoy these Perks:

  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for 6 months for employees based in Canada, the US, and the UK
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco and London and co-working stipend
  • 6 weeks of vacation

#J-18808-Ljbffr

Get the latest insights and jobs direct. Sign up for our newsletter.

By subscribing you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Top 10 Best UK Universities for Machine Learning Degrees (2025 Guide)

Explore ten UK universities that deliver world-class machine-learning degrees in 2025. Compare entry requirements, course content, research strength and industry links to find the programme that fits your goals. Machine learning (ML) has shifted from academic curiosity to the engine powering everything from personalised medicine to autonomous vehicles. UK universities have long been pioneers in the field, and their programmes now blend rigorous theory with hands-on practice on industrial-scale datasets. Below, we highlight ten institutions whose undergraduate or postgraduate pathways focus squarely on machine learning. League tables move each year, but these universities consistently excel in teaching, research and collaboration with industry.

How to Write a Winning Cover Letter for Machine Learning Jobs: Proven 4-Paragraph Structure

Learn how to craft the perfect cover letter for machine learning jobs with this proven 4-paragraph structure. Ideal for entry-level candidates, career switchers, and professionals looking to advance in the machine learning sector. When applying for a machine learning job, your cover letter is a vital part of your application. Machine learning is an exciting and rapidly evolving field, and your cover letter offers the chance to demonstrate your technical expertise, passion for AI, and your ability to apply machine learning techniques to solve real-world problems. Writing a cover letter for machine learning roles may feel intimidating, but by following a clear structure, you can showcase your strengths effectively. Whether you're just entering the field, transitioning from another role, or looking to advance your career in machine learning, this article will guide you through a proven four-paragraph structure. We’ll provide practical tips and sample lines to help you create a compelling cover letter that catches the attention of hiring managers in the machine learning job market.

Veterans in Machine Learning: A Military‑to‑Civilian Pathway into AI Careers

Introduction Artificial intelligence is no longer relegated to sci‑fi films—it underpins battlefield decision‑support, fraud detection, and even supermarket logistics. The UK Government’s 2025 AI Sector Deal forecasts an additional £200 billion in GDP by 2030, with machine‑learning (ML) engineers cited as the nation’s second most in‑demand tech role (Tech Nation 2024). The Ministry of Defence’s Defence AI Strategy echoes that urgency, earmarking £1.6 billion for FY 2025–28 to embed ML into planning, logistics, and autonomous systems. If you have ever tuned a radar filter, plotted artillery trajectories, or sifted sensor data for actionable intel, you have already worked with statistical modelling—the backbone of machine learning. This guide shows UK veterans how to reframe military experience for ML roles, leverage MoD transition funding, and land high‑impact positions building the models shaping tomorrow’s defence and commercial landscapes. Quick Win: Bookmark our live board for Machine‑Learning Engineer roles to see who’s hiring today.