Jobs

Research Technology Developer


Job details
  • Campbell North
  • London
  • 1 week ago

Company Overview:

Our client is a research-driven organisation led by passionate mathematicians and computer scientists. The Research Technology team lies at the heart of the company, managing one of the largest HPC clusters in the world. This team is critical to the firm's success, facilitating trades with daily volumes exceeding $250 billion globally.

Team Overview:

The Research Technology team is a full-stack team that collaborates closely with researchers to develop a highly performant, reliable, and transparent system. The team builds custom software to support an exa-scale filesystem, job scheduler, and zero-touch platforms for seamless integration with data centre operations. They are also responsible for developing custom file formats, compression algorithms, GPU tooling, and network management software to optimise performance.

Key Responsibilities:

  • Design and build software for the HPC cluster, focusing on performance, reliability, and scalability.
  • Mentor junior team members and push the boundaries of the team’s capabilities.
  • Engage constructively with researchers to find novel and scalable solutions.
  • Promote and implement radical changes and alternative ways of thinking while maintaining a pragmatic approach to minimise operational risks.
  • Manage and maintain a complex live system 24/7, delivering changes on short notice or tight deadlines.

What You Will Be Working On:

  • Developing an exascale filesystem handling billions of directories, a trillion files, and a million clients with complete resiliency against hardware failure.
  • Enhancing a dynamic job scheduler managing over 10 million entries and 100,000 concurrent tasks.
  • Building zero-touch platforms for monitoring, operating, and upgrading tens of thousands of machines.
  • Creating custom file formats, compression algorithms, and GPU tooling to optimise performance from 20,000 high-end GPUs.
  • Expanding the HPC cluster to provide access to more teams and multiple data centres.
  • Improving measurement and optimisation of resource usage across the entire cluster.

Essential Attributes:

  • Strong academic grounding in computer science fundamentals, including algorithms and data structures.
  • Proficiency in at least one statically typed language; experience with Golang and Rust is beneficial but not required. Scripting is primarily in Python.
  • Approximately 5-10 years of experience in designing and building large-scale distributed systems with highly scalable solutions.
  • Excellent problem-solving and analytical skills.
  • Familiarity with the Linux operating system, particularly in diagnosing performance and scalability issues.
  • Ability to multitask, manage multiple projects simultaneously, and prioritise effectively.
  • High self-motivation and the ability to work independently without supervision.
  • Understanding machine learning frameworks and compute offload devices, such as GPUs, is an advantage.

This role offers the opportunity to work in a fast-paced, research-driven environment where you can significantly impact the firm’s HPC infrastructure and overall success. We encourage you to apply if you are a self-starter passionate about developing cutting-edge technology.

Sign up for our newsletter

The latest news, articles, and resources, sent to your inbox weekly.

Similar Jobs

Software Developer - Data Pipelines (Python)

Squarepoint is a global investment management firm that utilizes a diversified portfolio of systematic and quantitative strategies across financial markets that seeks to achieve high quality, uncorrelated returns for our clients. We have deep expertise in trading, technology and operations and attribute our success to rigorous scientific research. As a...

Squarepoint Capital London

Director of Product Marketing

Quantinuum is at the forefront of developing the world's most advanced full-stack integrated quantum computing technologies. We are surpassing the boundaries of classical physics and modern supercomputing, ushering in a new era of computing that promises to revolutionize industries including medicine, materials science, finance, artificial intelligence, machine learning, cryptography, and...

Quantinuum London

Algorithm Developer

Location: Cheadle, United KingdomThales people provide armed forces customers with operational advantage at every decisive moment throughout the mission. Defence and armed forces customers rely on us to deliver the full range of defence mission systems solutions at land, sea, and air. Our platforms extend across the battlespace including Above...

Thales Cheadle

Animation Engineering Director

WHAT MAKES US EPIC?Read on to find out what you will need to succeed in this position, including skills, qualifications, and experience.At the core of Epic’s success are talented, passionate people. Epic prides itself on creating a collaborative, welcoming, and creative environment. Whether it’s building award-winning games or crafting engine...

Epic Games London

C++ Developer

C++ Developer – Quantitative Trading SystemsAre you ready to push the boundaries of C++ development in a live trading environment?Our client, a quantitative investment firm, uses scientific rigor and mathematical precision to optimize trading strategies in global markets. They’re looking for a C++ Developer who has proven experience in asset...

Langbourn

Senior Applied Scientist, TinyML

At Wayve we're committed to creating a diverse, fair and respectful culture that is inclusive of everyone based on their unique skills and perspectives, and regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, veteran...

Wayve London