Founding Lead Machine Learning Engineer

Bjak
City of London
3 weeks ago
Create job alert
Transform language models into real-world, high-impact product experiences.

A1 is a self-funded AI group, operating in full stealth. We’re building a new global consumer AI application focused on an important but underexplored use case.


You will shape the core technical direction of A1 - model selection, training strategy, infrastructure, and long-term architecture. This is a founding technical role: your decisions will define our model stack, our data strategy, and our product capabilities for years ahead.


You won’t just fine-tune models - you’ll design systems: training pipelines, evaluation frameworks, inference stacks, and scalable deployment architectures. You will have full autonomy to experiment with frontier models (LLaMA, Mistral, Qwen, Claude-compatible architectures) and build new approaches where existing ones fall short.


Why This Role Matters

  • You are creating the intelligence layer of A1’s first product, defining how it understands, reasons, and interacts with users.


  • Your decisions shape our entire technical foundation — model architectures, training pipelines, inference systems, and long-term scalability.


  • You will push beyond typical chatbot use cases, working on a problem space that requires original thinking, experimentation, and contrarian insight.


  • You influence not just how the product works, but what it becomes, helping steer the direction of our earliest use cases.


  • You are joining as a founding builder, setting engineering standards, contributing to culture, and helping create one of the most meaningful AI applications of this wave.



What You’ll Do

  • Build end-to-end training pipelines: data → training → eval → inference


  • Design new model architectures or adapt open-source frontier models


  • Fine-tune models using state-of-the-art methods (LoRA/QLoRA, SFT, DPO, distillation)


  • Architect scalable inference systems using vLLM / TensorRT-LLM / DeepSpeed


  • Build data systems for high-quality synthetic and real-world training data


  • Develop alignment, safety, and guardrail strategies


  • Design evaluation frameworks across performance, robustness, safety, and bias


  • Own deployment: GPU optimization, latency reduction, scaling policies


  • Shape early product direction, experiment with new use cases, and build AI-powered experiences from zero


  • Explore frontier techniques: retrieval-augmented training, mixture-of-experts, distillation, multi-agent orchestration, multimodal models



What It’s Like to Work Here

  • You take ownership - you solve problems end-to-end rather than wait for perfect instructions


  • You learn through action - prototype → test → iterate → ship


  • You’re calm in ambiguity - zero-to-one building energises you


  • You bias toward speed with discipline - V1 now > perfect later


  • You see failures and feedback as essential to growth


  • You work with humility, curiosity, and a founder’s mindset


  • You lift the bar for yourself and your teammates every day



Requirements

  • Strong background in deep learning and transformer architectures


  • Hands-on experience training or fine-tuning large models (LLMs or vision models)


  • Proficiency with PyTorch, JAX, or TensorFlow


  • Experience with distributed training frameworks (DeepSpeed, FSDP, Megatron, ZeRO, Ray)


  • Strong software engineering skills — writing robust, production-grade systems


  • Experience with GPU optimization: memory efficiency, quantization, mixed precision


  • Comfortable owning ambiguous, zero-to-one technical problems end-to-end



Nice to Have

  • Experience with LLM inference frameworks (vLLM, TensorRT-LLM, FasterTransformer)


  • Contributions to open-source ML libraries


  • Background in scientific computing, compilers, or GPU kernels


  • Experience with RLHF pipelines (PPO, DPO, ORPO)


  • Experience training or deploying multimodal or diffusion models


  • Experience in large-scale data processing (Apache Arrow, Spark, Ray)


  • Prior work in a research lab (Google Brain, DeepMind, FAIR, Anthropic, OpenAI)



What You’ll Get

  • Extreme ownership and autonomy from day one - you define and build key model systems.


  • Founding-level influence over technical direction, model architecture, and product strategy.


  • Remote-first flexibility


  • High-impact scope—your work becomes core infrastructure of a global consumer AI product.


  • Competitive compensation and performance-based bonuses


  • Backing of a profitable US$2B group, with the speed of a startup


  • Insurance coverage, flexible time off, and global travel insurance


  • Opportunity to shape a new global AI product from zero


  • A small, senior, high-performance team where you collaborate directly with founders and influence every major decision.



Our Team & Culture

We operate as a dense, senior, high-performance team. We value clarity, speed, craftsmanship, and relentless ownership. We behave like founders — we build, ship, iterate, and hold ourselves to a high technical bar.


If you value excellence, enjoy building real systems, and want to be part of a small team creating something globally impactful, you’ll thrive here.


About A1

A1 is a self-funded, independent AI group backed by BJAK, focused on building a new consumer AI product with global impact. We’re assembling a small, elite team of ML and engineering builders who want to work on meaningful, high-impact problems.


#J-18808-Ljbffr

Related Jobs

View all jobs

Lecturer / Senior Lecturer in Artificial Intelligence (Machine Learning, NLP, Reinforcement Lea[...]

Reader in Artificial Intelligence (Machine Learning, NLP, Reinforcement Learning, and AI Security)

Lead AI & ML Researcher: NLP, RL & Security

Lecturer / Senior Lecturer in Artificial Intelligence (Machine Learning, NLP, Reinforcement Lea[...]

Assistant Professor in Statistical Data Science

Master Data Analyst

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

The Skills Gap in Machine Learning Jobs: What Universities Aren’t Teaching

Machine learning has moved from academic research into the core of modern business. From recommendation engines and fraud detection to medical imaging, autonomous systems and language models, machine learning now underpins many of the UK’s most critical technologies. Universities have responded quickly. Machine learning modules are now standard in computer science degrees, specialist MSc programmes have proliferated, and online courses promise to fast-track careers in the field. And yet, despite this growth in education, UK employers consistently report the same problem: Many candidates with machine learning qualifications are not job-ready. Roles remain open for months. Interview processes filter out large numbers of applicants. Graduates with strong theoretical knowledge struggle when faced with practical tasks. The issue is not intelligence or effort. It is a persistent skills gap between university-level machine learning education and real-world machine learning jobs. This article explores that gap in depth: what universities teach well, what they routinely miss, why the gap exists, what employers actually want, and how jobseekers can bridge the divide to build successful careers in machine learning.

Machine Learning Jobs for Career Switchers in Their 30s, 40s & 50s (UK Reality Check)

Are you considering a career change into machine learning in your 30s, 40s or 50s? You’re not alone. In the UK, organisations across industries such as finance, healthcare, retail, government & technology are investing in machine learning to improve decisions, automate processes & unlock new insights. But with all the hype, it can be hard to tell which roles are real job opportunities and which are just buzzwords. This article gives you a practical, UK-focused reality check: which machine learning roles truly exist, what skills employers really hire for, how long retraining realistically takes, how to position your experience and whether age matters in your favour or not. Whether you come from analytics, engineering, operations, research, compliance or business strategy, there is a credible route into machine learning if you approach it strategically.

How to Write a Machine Learning Job Ad That Attracts the Right People

Machine learning now sits at the heart of many UK organisations, powering everything from recommendation engines and fraud detection to forecasting, automation and decision support. As adoption grows, so does demand for skilled machine learning professionals. Yet many employers struggle to attract the right candidates. Machine learning job adverts often generate high volumes of applications, but few applicants have the blend of modelling skill, engineering awareness and real-world experience the role actually requires. Meanwhile, strong machine learning engineers and scientists quietly avoid adverts that feel vague, inflated or confused. In most cases, the issue is not the talent market — it is the job advert itself. Machine learning professionals are analytical, technically rigorous and highly selective. A poorly written job ad signals unclear expectations and low ML maturity. A well-written one signals credibility, focus and a serious approach to applied machine learning. This guide explains how to write a machine learning job ad that attracts the right people, improves applicant quality and strengthens your employer brand.