Research Engineer/Research Scientist - Red Team (Misuse)

AI Safety Institute
London, United Kingdom
2 weeks ago
Posted
1 Apr 2026 (2 weeks ago)

About the AI Security Institute

The AI Security Institute is the world's largest and best-funded team dedicated to understanding advanced AI risks and translating that knowledge into action. We’re in the heart of the UK government with direct lines to No. 10 (the Prime Minister's office), and we work with frontier developers and governments globally.

We’re here because governments are critical for advanced AI going well, and UK AISI is uniquely positioned to mobilise them. With our resources, unique agility and international influence, this is the best place to shape both AI development and government action.

The deadline for applying to this role is Tuesday 7th April 2026, end of day, anywhere on Earth.

Team Description

Interventions that secure a system from abuse by bad actors or misaligned AI systems will grow in importance as AI systems become more capable, autonomous, and integrated into society.

The Misuse Red Team is a specialised subteam within AISI's wider Red Team. We red-team frontier AI safeguards for dangerous capabilities, research novel attack vectors, and develop advanced automated attack tooling. We share our findings with frontier AI companies (including Anthropic, OpenAI, DeepMind), key UK officials, and other governments to inform their respective deployment, research, and policy decision-making.

We have published on several topics, including novel automated attack algorithms (Boundary Point Jailbreaking), poisoning attacks, safeguards safety cases, defending finetuning APIs, third-party attacks on agents, agent misuse, and pre-training data filtering. Some example impact cases have been advancing the benchmarking of agent misuse, identifying novel vulnerabilities and collaborating with frontier labs to mitigate them, and producing insights into the feasibility and effectiveness of attacks and defences in data poisoning and fine-tuning APIs.

Role Description

We’re looking for research scientists and research engineers for our misuse sub-team with expertise developing and analysing attacks and protections for systems based on large language models or who have broader experience with frontier LLM research and development. An ideal candidate would have a strong track record of performing and publishing novel and impactful research in these or other areas of LLM research. We’re looking for:

  • Research Scientists, who typically leadtechnical direction– picking the questions, designing the experiments, and owning the conclusions (typically evidenced by a strong publication record).
  • Research Engineers, who typically leadexecution – building the systems and code that make those experiments possible at scale, and owning reliability, speed, and reproducibility.

In practice, we can support staff’s work spanning or alternating between research and engineering.If you have a preference, please specify this in your application.

The team is currently led by Eric Winsor and Xander Davies – advised by Geoffrey Irving and Yarin Gal. You’ll work with incredible technical staff across AISI, including alumni from Anthropic, OpenAI, DeepMind, and top universities. You may also collaborate with external teams from Anthropic, OpenAI, and Gray Swan.

We are open to hires at junior, senior, staff and principal research scientist levels.

Representative projects you might work on

  • Designing, building, running and evaluating methods to automatically attack and evaluate safeguards, such as LLM-automated attacking and direct optimisation approaches.
  • Building a benchmark for asynchronous monitoring for signs of misuse and jailbreak development across multiple model interactions.
  • Investigating novel attacks and defences for data poisoning LLMs with backdoors or other attacker goals.
  • Performing adversarial testing of frontier AI system safeguards and producing reports that are impactful and action-guiding for safeguard developers.

What we’re looking for

In accordance with the Civil Service Commission rules, the following list contains all selection criteria for the interview process.

The experiences listed below should be interpreted as examples of the expertise we're looking for, as opposed to a list of everything we expect to find in one applicant:

You may be a good fit if you have:

  • Hands-on research experience with large language models (LLMs) - such as training, fine-tuning, evaluation, or safety research.
  • A demonstrated track record of peer-reviewed publications in top-tier ML conferences or journals.
  • Ability and experience writing clean, documented research code for machine learning experiments, including experience with ML frameworks like PyTorch or evaluation frameworks like Inspect.
  • A sense of mission, urgency, responsibility for success.
  • An ability to bring your own research ideas and work in a self-directed way, while also collaborating effectively and prioritizing team efforts over extensive solo work.

Strong candidates may also have:

  • Experience working on adversarial robustness, other areas of AI security, or red teaming against any kind of system.
  • Experience working on AI alignment or AI control.

    Related Jobs

    View all jobs

    (Alignment) Research Engineer/Research Scientist - Red Team

    AI Safety Institute London, United Kingdom

    Research Engineer - Societal Impacts

    AI Safety Institute London, United Kingdom

    ML Research Engineer, London

    Isomorphic Labs London, United Kingdom

    Research Engineer

    Relation Therapeutics London, United Kingdom
    Permanent

    Principal Machine Learning Infrastructure Engineer

    PhysicsX London, United Kingdom

    Forward Deployed AI Engineer

    Latent Labs London, United Kingdom, United Kingdom
    Hybrid

    Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

    By subscribing, you agree to our privacy policy and terms of service.

    Industry Insights

    Discover insightful articles, industry insights, expert tips, and curated resources.

    Where to Advertise Machine Learning Jobs in the UK (2026 Guide)

    Advertising machine learning jobs in the UK requires a different approach to most technical hiring. The candidate pool is small, highly specialised and in demand across AI labs, financial services, healthcare, autonomous systems and consumer technology simultaneously. Machine learning engineers and researchers move between roles through professional networks, conference communities and specialist platforms — not general job boards where ML roles compete with unrelated software engineering positions for the same audience. This guide, published by MachineLearningJobs.co.uk, covers where to advertise machine learning roles in the UK in 2026, how the main platforms compare, what employers should expect to pay, and what the data says about hiring across different role types.

    New Machine Learning Employers to Watch in 2026: UK and Global Companies Driving ML Innovation

    Machine learning (ML) has transitioned from a specialised field into a core business capability. In 2026, organisations across healthcare, finance, robotics, autonomous systems, natural language processing, and analytics are expanding their machine learning teams to build scalable intelligent products and services. For professionals exploring opportunities on www.MachineLearningJobs.co.uk , understanding the companies that are scaling, winning investment, or securing high‑impact contracts is crucial. This article highlights the new and high‑growth machine learning employers to watch in 2026, focusing on UK innovators, international firms with significant UK presence, and global platforms investing in machine learning talent locally.

    How Many Machine Learning Tools Do You Need to Know to Get a Machine Learning Job?

    Machine learning is one of the most exciting and rapidly growing areas of tech. But for job seekers it can also feel like a maze of tools, frameworks and platforms. One job advert wants TensorFlow and Keras. Another mentions PyTorch, scikit-learn and Spark. A third lists Mlflow, Docker, Kubernetes and more. With so many names out there, it’s easy to fall into the trap of thinking you must learn everything just to be competitive. Here’s the honest truth most machine learning hiring managers won’t say out loud: 👉 They don’t hire you because you know every tool. They hire you because you can solve real problems with the tools you know. Tools are important — no doubt — but context, judgement and outcomes matter far more. So how many machine learning tools do you actually need to know to get a job? For most job seekers, the real number is far smaller than you think — and more logically grouped. This guide breaks down exactly what employers expect, which tools are core, which are role-specific, and how to structure your learning for real career results.