Artificial Intelligence Engineer - Distributed Inference

Danucore
Birmingham
3 months ago
Applications closed

Related Jobs

View all jobs

Director of Artificial Intelligence - Manufacturing & Industrial

Vision Systems Engineer

Automation Engineer

Senior Machine Learning Engineer

RF Design Engineer - LNAs, Design from LF to X Band

Digital Design Engineer - High Speed Digital Design

AI Engineer - Distributed Inference Specialist


Do you want to be a spectator or a player as the world races to develop AGI?

Are you ready to be a pioneer of AI?


Why join us?


AtDanucore, we are on the hunt forBRILLIANT MINDSto join a team of visionaries and innovators dedicated to building distributedsupercomputersandAI systemswhich are:


Faster️ –from building and deploying AI datacentres at speed to optimising the AI workloads that run on them we want to be the fastest


CheaperAI should be accessible to all. We lower the costs of AI deployment with careful hardware deployments and software systems to ensure efficient resource utilisation.


KinderOur systems are designed to benefit humanity. We do not allow our systems to participate in military, gambling or pornography applications


GreenerWe optimise energy consumption with an integrated hardware and software solution to leverage renewable energy, optimise heat recovery - all running under energy aware orchestration systems to optimise workloads


ClevererWe develop agentic AI systems and to make our systems intelligent and constantly improving


Help us build systems to ensure the power of frontier AI remainsaccessibleand give userssovereigntyover their AI systems


Join us in ensuring that the most transformative technology in human history remains in the hands of humanity itself. Let's make AI development transparent, accessible, and aligned with the interests of humanity, not just the profits of a few. ⚡


About the Role


This role is for those obsessed with pushing the boundaries of AI model performance.


We're looking for someone who gets excited about shaving milliseconds off inference time, every percentage point of GPU utilization gained and how many Watts were consumed to achieve it. ⚡️


You'll work directly with cutting-edge models — from LLMs to multimodal systems — and large GPU clusters, finding innovative ways to make them run faster, more efficiently, and more accessibly on diverse hardware setups. ️


What We're Looking For


In team members:


  • Passion for AI: A strong desire to influence the future of technology and its societal impact.
  • Willingness to Learn: we're looking for future experts with curious minds and a growth mindset.
  • Open-Mindedness: Ready to challenge the norm and think outside the box?


and for the role:


  • Evidence of deploying and optimising AI models in multi gpu and multi node systems ️ ️
  • Good working knowledge of leading AI runtimes: PyTorch, vLLM, TensorRT, ONNX Runtime, Llama.cpp ‍♂️‍➡️⏱️o
  • Experience with distributed inference engines: Ray Serve, Triton Inference Server, vLLM, SLURM
  • Knowledge of AI compilers: OpenXLA, torch.compile, OpenAI's triton, MLIR, Mojo, TVM, MLC-LLM ⚙️
  • Good working knowledge of inter-process communication: message queues, MPI, NCCL, gRPC
  • Good working knowledge of high performance networking: RDMA, RoCE, Infiniband, NVIDIA GPUDirect, NVLink, NVIDIA DOCA, MagnumIO, dpdk, spdk
  • Experience with model quantisation, pruning, and sparsity techniques for performance optimisation.


And bonus points if you have:

  • a homelab, blog, or a collection of git repos showcasing your talents and interests ‍ ‍
  • made contributions to open-source projects or publications in the field of AI/ML systems optimisation


Let us know which of the above you have worked with / are relevant in your cover letter! ✨


Key Responsibilities


  • Design and implement high-performance distributed inference systems for running large language models and multimodal AI models at scale
  • Optimise model serving infrastructure for maximum throughput, minimal latency, and optimal power efficiency ⚡
  • Develop and maintain deployment pipelines for efficient model serving, and monitoring in production
  • Research and implement cutting-edge techniques in model optimisation, including pruning, quantisation, and sparsity methods ‍
  • Design, build and configure experimental hardware setups for model serving and optimisation ️
  • Design and implement robust testing frameworks to ensure reliable model serving ✅
  • Collaborate with the team to build and improve our distributed inference platform, making it more accessible and efficient for users
  • Monitor, optimise and document system performance metrics, including latency, throughput, power consumption and benchmark scores



How Can We Tempt You?


Exceptional Financial Package: Enjoy a competitive compensation structure, including an enticing EMI scheme that rewards your brilliance.


Envious Compute Power: Gain access to a vast array of cutting-edge computing resources to bring your ideas to life!


Support for Your Vision: We believe that the brightest minds often have their own innovative projects. Let's collaborate! Share your ideas, and work with our team and support network to make them happen!


Make an Impact: Join a passionate team dedicated to creating positive change in the world. The future is ours to shape, and together we can ensure it's for the better.


Dynamic Start-Up Culture: Dive in from day one! Experience the thrill of a start-up environment where you can roll up your sleeves and make a real difference right away.



How to Apply

Email your cover letter and CV to with subject "AI Engineer - Distributed Inference"

In your cover letter, please include details of:

  • what parts or technologies mentioned in this job advert you have experience with and can add value with
  • links to any public work e.g. github profile, blogs or papers

Get the latest insights and jobs direct. Sign up for our newsletter.

By subscribing you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Machine Learning Jobs at Newly Funded UK Start-ups: Q3 2025 Investment Tracker

Machine learning (ML) has become the beating heart of modern tech innovation, powering breakthroughs in healthcare, finance, cybersecurity, robotics, and more. Across the United Kingdom, this surge in ML-driven solutions is fueling the success of countless start-ups—and spurring demand for talented machine learning engineers, data scientists, and related professionals. If you’re eager to join a high-growth ML company or simply want to keep tabs on the latest trends, this Q3 2025 Investment Tracker will guide you through the newly funded UK start-ups pushing the boundaries of ML. In this article, we’ll highlight key developments from Q3 2025, delve into the most promising newly funded ventures, and shed light on the machine learning roles they’re urgently seeking to fill. Plus, we’ll show you how to connect with these employers via MachineLearningJobs.co.uk, a dedicated platform for ML job seekers. Let’s dive in!

Portfolio Projects That Get You Hired for Machine Learning Jobs (With Real GitHub Examples)

In today’s data-driven landscape, the field of machine learning (ML) is one of the most sought-after career paths. From startups to multinational enterprises, organisations are on the lookout for professionals who can develop and deploy ML models that drive impactful decisions. Whether you’re an aspiring data scientist, a seasoned researcher, or a machine learning engineer, one element can truly make your CV shine: a compelling portfolio. While your CV and cover letter detail your educational background and professional experiences, a portfolio reveals your practical know-how. The code you share, the projects you build, and your problem-solving process all help prospective employers ascertain if you’re the right fit for their team. But what kinds of portfolio projects stand out, and how can you showcase them effectively? This article provides the answers. We’ll look at: Why a machine learning portfolio is critical for impressing recruiters. How to select appropriate ML projects for your target roles. Inspirational GitHub examples that exemplify strong project structure and presentation. Tangible project ideas you can start immediately, from predictive modelling to computer vision. Best practices for showcasing your work on GitHub, personal websites, and beyond. Finally, we’ll share how you can leverage these projects to unlock opportunities—plus a handy link to upload your CV on Machine Learning Jobs when you’re ready to apply. Get ready to build a portfolio that underscores your skill set and positions you for the ML role you’ve been dreaming of!

Machine Learning Job Interview Warm‑Up: 30 Real Coding & System‑Design Questions

Machine learning is fuelling innovation across every industry, from healthcare to retail to financial services. As organisations look to harness large datasets and predictive algorithms to gain competitive advantages, the demand for skilled ML professionals continues to soar. Whether you’re aiming for a machine learning engineer role or a research scientist position, strong interview performance can open doors to dynamic projects and fulfilling careers. However, machine learning interviews differ from standard software engineering ones. Beyond coding proficiency, you’ll be tested on algorithms, mathematics, data manipulation, and applied problem-solving skills. Employers also expect you to discuss how to deploy models in production and maintain them effectively—touching on MLOps or advanced system design for scaling model inferences. In this guide, we’ve compiled 30 real coding & system‑design questions you might face in a machine learning job interview. From linear regression to distributed training strategies, these questions aim to test your depth of knowledge and practical know‑how. And if you’re ready to find your next ML opportunity in the UK, head to www.machinelearningjobs.co.uk—a prime location for the latest machine learning vacancies. Let’s dive in and gear up for success in your forthcoming interviews.