Be at the heart of actionFly remote-controlled drones into enemy territory to gather vital information.

Apply Now

Reinforcement Learning Scientist

Stealth AI Startup
Sheffield
10 months ago
Applications closed

Related Jobs

View all jobs

Principal Data Scientist

Principal Data Scientist - Healthcare

Lead Data Scientist - Recommender Systems

Machine Learning Research Engineer (Foundational Research)

Principal Data Scientist - Healthcare

Principal Data Scientist - Healthcare

Join Us: Research Scientist - Online Reinforcement Learning (RL) at an Agentic AI Start-Up!


Are you ready to revolutionize the future of intelligent agents? We're anAgentic AI start-upon a mission to build the next generation of autonomous systems capable of real-time learning, adaptation, and decision-making. If you’re passionate aboutOnline Reinforcement Learningand want to shape the frontier of AI, we’d love to hear from you!


About Us


We are a well-funded, ambitious, fast-growing start-up buildingAI agentsthat can learn, adapt, and thrive in dynamic, interactive environments. Our vision is to empower businesses and individuals with cutting-edge, agentic AI solutions that redefine how machines interact with the world.


The Role


As aResearch Scientist in Online Reinforcement Learning, you will:

  • Innovate: Develop groundbreaking algorithms for real-time learning and decision-making in dynamic, multi-agent systems.
  • Collaborate: Work closely with a team of researchers and engineers to create scalable solutions that deliver real-world impact.
  • Experiment: Lead experimental projects to address challenges like stability, data efficiency, and exploration in online RL.
  • Productize AI: Translate research insights into deployable AI systems for robotics, gaming, autonomous platforms, and more.
  • Share Knowledge: Publish research at top-tier conferences (e.g., NeurIPS, ICML, ICLR) and contribute to the global AI community.


What You’ll Bring


  • PhD or equivalentin Machine Learning, Reinforcement Learning, Computer Science, or related fields.
  • Expertisein RL algorithms (e.g., PPO, A3C, DQN) and their application to dynamic environments.
  • Proven Research Impact: Strong publication record in top conferences/journals and a passion for advancing AI.
  • Technical Skills: Proficiency in Python, RL frameworks (PyTorch/TensorFlow), and cloud-based ML tools.
  • Start-Up Mindset: A proactive, problem-solving attitude and a love for tackling challenges in fast-paced environments.
  • Visionary Thinking: A deep interest in agentic AI and its potential to transform industries.


Why Join Us?


  • Impactful Work: Shape the future of agentic AI in industries like autonomous vehicles, robotics, and intelligent systems.
  • Ownership: Be part of a start-up where your ideas and contributions directly drive our success.
  • Cutting-Edge Tech: Access to the latest tools, resources, and computational infrastructure.
  • Growth Opportunities: Thrive in a collaborative, growth-focused culture that values curiosity and innovation.
  • Start-Up Perks: Competitive salary, meaningful equity, flexible work options, and a chance to grow with us.


Our Mission


At our core, we’re driven by the belief that intelligent agents can reshape the way we live, work, and explore. Join us on our journey to build a future where AI systems are not just tools but partners in discovery and creation.

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Neurodiversity in Machine Learning Careers: Turning Different Thinking into a Superpower

Machine learning is about more than just models & metrics. It’s about spotting patterns others miss, asking better questions, challenging assumptions & building systems that work reliably in the real world. That makes it a natural home for many neurodivergent people. If you live with ADHD, autism or dyslexia, you may have been told your brain is “too distracted”, “too literal” or “too disorganised” for a technical career. In reality, many of the traits that can make school or traditional offices hard are exactly the traits that make for excellent ML engineers, applied scientists & MLOps specialists. This guide is written for neurodivergent ML job seekers in the UK. We’ll explore: What neurodiversity means in a machine learning context How ADHD, autism & dyslexia strengths map to ML roles Practical workplace adjustments you can ask for under UK law How to talk about neurodivergence in applications & interviews By the end, you’ll have a clearer sense of where you might thrive in ML – & how to turn “different thinking” into a genuine career advantage.

Machine Learning Hiring Trends 2026: What to Watch Out For (For Job Seekers & Recruiters)

As we move into 2026, the machine learning jobs market in the UK is going through another big shift. Foundation models and generative AI are everywhere, companies are under pressure to show real ROI from AI, and cloud costs are being scrutinised like never before. Some organisations are slowing hiring or merging teams. Others are doubling down on machine learning, MLOps and AI platform engineering to stay competitive. The end result? Fewer fluffy “AI” roles, more focused machine learning roles with clear ownership and expectations. Whether you are a machine learning job seeker planning your next move, or a recruiter trying to build ML teams, understanding the key machine learning hiring trends for 2026 will help you stay ahead.

Machine Learning Recruitment Trends 2025 (UK): What Job Seekers Need To Know About Today’s Hiring Process

Summary: UK machine learning hiring has shifted from title‑led CV screens to capability‑driven assessments that emphasise shipped ML/LLM features, robust evaluation, observability, safety/governance, cost control and measurable business impact. This guide explains what’s changed, what to expect in interviews & how to prepare—especially for ML engineers, applied scientists, LLM application engineers, ML platform/MLOps engineers and AI product managers. Who this is for: ML engineers, applied ML/LLM engineers, LLM/retrieval engineers, ML platform/MLOps/SRE, data scientists transitioning to production ML, AI product managers & tech‑lead candidates targeting roles in the UK.