National AI Awards 2025Discover AI's trailblazers! Join us to celebrate innovation and nominate industry leaders.

Nominate & Attend

Research Scientist - Large Language Model Post-Training

PolyAI
London
2 months ago
Applications closed

Related Jobs

View all jobs

Research Scientist

Research Scientist | Reinforcement Learning | Python | Machine Learning | Deep Learning | Hybri[...]

AI Research Engineer, PhD

Deep Learning Researcher

Senior Data Scientist - Operational Research (Routing)

Senior Applied Scientist - Computer Vision

Research Scientist - Large Language Model Post-Training

London, England, United Kingdom

PolyAI automates customer service through lifelike voice assistants that let customers lead a conversation. Our voice assistants make it possible for businesses to deliver outstanding customer service that rivals their human agents. Our customers, which include the world’s leading logos, are expanding how they use our platform, driving automation of critical customer service operations and integrating PolyAI into their daily customer service workflows.

As a Research Scientist specialising in large language model post-training, you will play a key role in shaping and implementing strategies for aligning language models for use in our conversational AI platform. Your primary focus will be on post-training techniques such as preference-finetuning, reward modelling, and synthetic data generation.

Responsibilities:

  • Train models and conduct experiments to assess model performance in live deployments.
  • Develop post-training strategies to achieve state of the art performance on domain-specific tasks.
  • Generate, collect, and annotate contact center data from sources such as real customer calls, chats, online open datasets, and synthetic data.
  • Develop robust evaluation benchmarks to track improvements in production models.
  • Collaborate with the legal and compliance team to address any compliance or data privacy-related issues.
  • Work closely with product and engineering teams to ensure alignment with business and production goals.
  • Stay informed about the latest advancements in machine learning, ASR, TTS, and LLM to continuously enhance our technologies.

Requirements:

  • A degree in Computer Science, Machine Learning, or a related field, or equivalent industry experience.
  • 3+ years of experience working with deep learning and statistical models.
  • Strong knowledge of data quality standards and annotation processes, with the ability to independently evaluate and improve models.
  • Proficiency in Python and familiarity with relevant ML frameworks and libraries (e.g., PyTorch).
  • Experience with cloud services such as AWS, GCP, or Azure.
  • Excellent verbal and written communication skills, with the ability to convey complex technical concepts to diverse audiences.
  • A passion for solving technical challenges and driving practical solutions.

Preferred Qualifications:

  • Experience working with LLMs and data preparation pipelines.
  • Experience with speech models, such as ASR or TTS.

Benefits:

  • Participation in the company’s employee share options plan.
  • 25 days holiday, plus bank holidays.
  • Flexible working from home policy.
  • Work from outside of the UK for up to 6 months each year.
  • Enhanced parental leave.
  • Bike2Work scheme.
  • Annual learning and development allowance.
  • One-off WFH allowance when you join.
  • Company-funded fertility and family-forming programmes.
  • Menopause care programme with Maven.
  • Private healthcare and dental cover, discounts on gym memberships and relaxation apps, and access to a range of mental health programs.

At PolyAI, we take great pride in our values—they guide everything we do. We believe that a strong culture leads to meaningful work and lasting impact.

Our core values are:

  • Only the best:We expect the best from our people, we hire people that expect the best from themselves, and we nurture this drive for excellence.
  • Ownership:We care deeply about what we do. We take ownership of our initiatives, decisions and outcomes.
  • Relentlessly improve:We demand more from ourselves and are always evolving. Continuous, obsessive improvement is the only way we will transform the world of conversational AI.
  • Bias for action:Our world moves quickly and so do we. We take calculated risks and we deliver impact fast.
  • Disagree and commit:We are all working toward the same goal. If we don’t agree with something, we work hard to understand it and when a decision is made, we accept it and give it our all.
  • Build for people:We are hyper-focused on delivering the best automated experiences possible so that we can empower people to get exactly what they need, when they need it.

PolyAI is proud to be an equal-opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. All employment decisions at PolyAI will be based on the business needs without attention to ethnicity, religion, sexual orientation, gender identity, family or parental status, national origin, neurodiversity status or disability status.

#J-18808-Ljbffr

National AI Awards 2025

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

LinkedIn Profile Checklist for Machine Learning Jobs: 10 Tweaks to Drive Recruiter Interest

The machine learning landscape is rapidly evolving, with demand soaring for experts in modelling, algorithm tuning and data-driven insights. Recruiters hunt for candidates proficient in Python, TensorFlow, PyTorch and MLOps processes. A generic profile simply won’t cut it. Our step-by-step LinkedIn for machine learning jobs checklist covers 10 targeted tweaks to ensure your profile ranks in searches and communicates your technical impact. Whether launching your ML career or seeking leadership roles, these optimisations will sharpen your professional narrative and boost recruiter engagement.

Part-Time Study Routes That Lead to Machine Learning Jobs: Evening Courses, Bootcamps & Online Masters

Machine learning—a subset of artificial intelligence—enables computers to learn from data and improve over time without explicit programming. From predictive maintenance in manufacturing to recommendation engines in e-commerce and diagnostic tools in healthcare, machine learning (ML) underpins many of today’s most innovative applications. In the UK, demand for ML professionals—engineers, data scientists, research scientists and ML operations specialists—is growing rapidly, with roles projected to increase by over 50% in the next five years. However, many aspiring ML practitioners cannot step away from work or personal commitments for full-time study. Thankfully, a rich ecosystem of part-time learning pathways—Evening Courses, Intensive Bootcamps and Flexible Online Master’s Programmes—empowers you to learn machine learning while working. This comprehensive guide examines each route: foundational CPD units, immersive bootcamps, accredited online MSc programmes, funding options, planning strategies and a real-world case study. Whether you’re a software developer branching into ML, a statistician aiming to upskill, or a professional exploring AI-driven innovation, you’ll discover how to build in-demand ML expertise on your own schedule.

The Ultimate Assessment-Centre Survival Guide for Machine Learning Jobs in the UK

Assessment centres for machine learning positions in the UK are designed to reflect the complexity and collaboration required in real-world ML projects. From psychometric assessments and live model-building tasks to group data science challenges and behavioural interviews, recruiters evaluate your statistical understanding, coding skills, communication and teamwork. Whether you specialise in deep learning, reinforcement learning or NLP, this guide offers a step-by-step approach to excel at every stage and secure your next ML role.