Senior Platform Engineer (Infrastructure)

uSwitch

London

8 months ago

Applications closed

Related Jobs

View all jobs

Senior Data Engineer [UAE Based] (London Area)

Senior Data Engineer [UAE Based]

Senior Data Scientist

Senior Data Scientist (MLOps)

Senior Data Science Consultant – Econometrics specialist

User Experience Researcher

Description

Hybrid - 2 days per week in office (London Bridge/Tower Bridge area)

The RVU London cloud infrastructure team

We are committed to Open Source software in order to build services that help millions of customers to save money and make confident decisions. As well as helping our customers, we also give back to the community by open sourcing interesting projects that we build that might benefit others.

We’re looking for an experienced Platform/Infrastructure Engineer to join our infrastructure platform team, known internally as ‘Airship’.

Our goal as a team is to enable our development teams to deliver services quickly, reliably and securely. We do this by running multiple Kubernetes EKS and Fargate clusters in AWS, creating common tooling to aid in development tasks and running shared services such as Opensearch, Envoy, Vault and Prometheus to name a few. The team has also recently expanded its scope to simplify Data engineering in the organisation using the same techniques we used to ease creating web applications on data pipelines, leveraging Argo Workflows and Argo Events as well as completed a migration to Github Actions.

Day to day tasks will include:

Planning and working on our infrastructure platform: from maintenance to design systems improvements or to adopt new technologies
Working with product engineering and data teams to design, build and improve scalability and reliability of their systems with an emphasis to provide the best DevEx
Developing tooling to help our teams work more efficiently

Requirements

The ideal candidate will have some of the following skills:

Extensive experience in running Kubernetes clusters in production
Knowledge of Golang, Helm and Terraform (some knowledge of Python is definitely a plus)
Production experience in Cilium and/or eBPF and networking in general
Extensive experience in monitoring systems and their performance
The ability to debug large and complex systems and solving large problems that affect a wide user base in a simple way
Experience with image vulnerability scanning and patching strategies for large systems
Experience / Familiarity with AWS Multi Accounts system designs tools like Crossplane and Control Tower
Familiarity with Argo Workflows or similar data pipeline as a service tools
Familiarity working with a variety of Cloud Native projects
Familiarity with Github Action
Familiarity with OpenTelemetry

Out team has been featured in a few conferences:

CNCF:

PlatformCon: and

We have also been featured in the London AWS Summit 2023 for contribution to the EKS tooling community

We also hosted and held the Terraform Hashicorp User Group meetup in London in April.

Examples of some projects we have worked on:

Short lived database credentials

Our running services previously relied on having long lived credentials to access data that were rarely, if ever, rotated. We wanted human and pod identity to be used to grant short-lived credentials based on policies. We used Vault to build a solution to this problem, creating tooling such as / to make it as easy as possible for developers to use these credentials with their services. ()

: a service that integrates AWS IAM with Kubernetes

We have a lot of existing AWS resource that have their access limited using IAM. We used Kube2IAM initially but experienced race conditions that would hand different role credentials to pods. We started work on a replacement and have worked with the community to get it used in other places.

: Envoy control plane for multi-cluster load balancing

For some of our more important applications it was important to have them survive a total cluster outage. This meant we needed a way to easily route traffic to an application spread out across multiple clusters so we created Yggdrasil, a tool to configure Envoy nodes to route our traffic between clusters based on Ingress resources. ()

: more confidence in the status of your deployments

It tracks deployments as they roll out and posts useful status updates into Slack. It does this by watching the Kubernetes api for namespaces and deployments with the correct annotations. When a new deployment rollout begins and completes updates are posted to the Slack API. Any errors during the deployment rollout are captured and included in the Slack message (see example below). This can be very useful to help quickly debug a failing deployment.

You can also check out our to see a number of blogs on what we’ve been up to.

Our commitment to you

At RVU, we are dedicated to developing valuable, inclusive, and user-friendly products and services for all. To achieve this it’s essential that our teams reflect the diverse range of people in our community. We believe in being the change we wish to see in the world, by embracing our differences and holding ourselves accountable to being open and inclusive teammates and wider community members.

Benefits

What we’ll give back to you:

We want to give you a great work environment; contribute back to both your personal and professional development; and give you great benefits to make your time at RVU even more enjoyable. Some of these benefits include:

Employer matching pension up to
Hybrid approach of in-office and remote working, and a “Work from Home” budget to help contribute towards a great work environment at home
Excellent maternity, paternity and adoption leave policy, for those key moments in your life
25 days holiday (increasing to 30 days) + 2 days “My Time” per year
Up to 30 days per year “working from anywhere”
A healthy learning and training budget, as well as the chance to go to conferences around the world every year
Electric vehicles scheme
In office gym
Free breakfast in the office daily
Health insurance
Access to the Calm and Peppy app for physical and mental health
Regular events - from team socials to company-wide events with insightful external speakers, we want to make sure our colleagues continue to feel connected

Get the latest insights and jobs direct. Sign up for our newsletter.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

May 17, 2025

Jobs

Rural-Remote Machine Learning Jobs: Finding Balance Beyond the Big Cities

Over the past decade, machine learning (ML) has transformed from a niche research domain into a pervasive technology underpinning everything from recommendation systems and voice assistants to financial forecasting and autonomous vehicles. Historically, the UK’s major tech hubs—particularly London—have been magnets for top ML talent and corporate headquarters. However, remote work has become mainstream, and many ML professionals are realising they can excel in their field while living far beyond the city limits. At MachineLearningJobs.co.uk, we’ve observed a growing interest in positions that allow for a rural lifestyle or a coastal environment, often reflected in search terms like “ML remote countryside” or “tech jobs by the sea.” This surge is no coincidence. Flexible work policies, better rural broadband, and the nature of machine learning tasks—much of which can be done through cloud platforms—are bringing new opportunities to those who wish to swap urban hustle for fresh air and scenic views. Whether you’re a data scientist, ML engineer, researcher, or product manager, a rural or seaside move could reinvigorate your work-life balance. In this article, we’ll unpack why rural-remote ML jobs are on the rise, how you can navigate the challenges of leaving the city, and what you need to do to thrive in a machine learning career beyond the M25. If you’ve dreamt of looking up from your laptop to rolling fields or ocean waves, keep reading—your rural ML role might be closer than you think.

May 15, 2025

Jobs

Quantum-Enhanced Machine Learning—Propelling AI into the Next Frontier

Machine learning (ML) has revolutionised how we interpret data, build predictive models, and create intelligent applications. From recommendation engines and self-driving cars to advanced genomics and natural language processing, ML solutions are integral to nearly every corner of modern life. However, as data complexity and model size continue to skyrocket, the computational demands placed on ML systems grow in tandem—often pushing even high-performance classical computers to their limits. In recent years, quantum computing has emerged as a tantalising solution to these challenges. Unlike traditional digital systems, quantum computers exploit quantum mechanics—superposition and entanglement—to process information in ways that defy conventional logic. As these machines mature, they promise exponential speed-ups for certain tasks, potentially reshaping how we approach AI and data-intensive challenges. What does this mean for machine learning? Enter quantum-enhanced ML, a new frontier where quantum processors and classical ML frameworks unite to accelerate model training, tackle high-dimensional data, and solve complex optimisation tasks more efficiently. In this article, we will: Unpack the current state of machine learning, highlighting key bottlenecks. Provide a concise overview of quantum computing—why it’s radical and how it differs from classical technology. Examine potential breakthroughs in quantum-enhanced ML, including real-world use cases and technical approaches. Explore the roles and skill sets that will define this quantum-AI era, with guidance on how to prepare. Discuss the roadblocks (like hardware maturity and ethical concerns) and how they might be addressed in the years to come. If you’re a machine learning engineer, data scientist, or simply an AI enthusiast fascinated by the next wave of computational innovation, read on—quantum computing could become an integral part of your future toolkit, opening up job opportunities and reimagining what ML can achieve.

May 11, 2025

Jobs

Machine Learning Jobs at Newly Funded UK Start-ups: Q3 2025 Investment Tracker

Machine learning (ML) has become the beating heart of modern tech innovation, powering breakthroughs in healthcare, finance, cybersecurity, robotics, and more. Across the United Kingdom, this surge in ML-driven solutions is fueling the success of countless start-ups—and spurring demand for talented machine learning engineers, data scientists, and related professionals. If you’re eager to join a high-growth ML company or simply want to keep tabs on the latest trends, this Q3 2025 Investment Tracker will guide you through the newly funded UK start-ups pushing the boundaries of ML. In this article, we’ll highlight key developments from Q3 2025, delve into the most promising newly funded ventures, and shed light on the machine learning roles they’re urgently seeking to fill. Plus, we’ll show you how to connect with these employers via MachineLearningJobs.co.uk, a dedicated platform for ML job seekers. Let’s dive in!

Senior Platform Engineer (Infrastructure)

Related Jobs

Senior Data Engineer [UAE Based] (London Area)

Senior Data Engineer [UAE Based]

Senior Data Scientist

Senior Data Scientist (MLOps)

Senior Data Science Consultant – Econometrics specialist

User Experience Researcher

Get the latest insights and jobs direct. Sign up for our newsletter.

Industry Insights

Rural-Remote Machine Learning Jobs: Finding Balance Beyond the Big Cities

Quantum-Enhanced Machine Learning—Propelling AI into the Next Frontier

Machine Learning Jobs at Newly Funded UK Start-ups: Q3 2025 Investment Tracker

Find the perfect job? Subscribe to job alerts to stay informed about new opportunities.