Research Scientist - Large Language Model Post-Training

PolyAI
London
1 month ago
Applications closed

Related Jobs

View all jobs

Machine Learning Research Scientist - PhD, NLP, LLM

Senior Data Scientist (Document Search)

Senior Data Scientist (Document Search)

Principal Data Scientist - NLP

Principal Data Scientist – Operational Research, Simulation & ML

Data Scientist II

Research Scientist - Large Language Model Post-Training

London, England, United Kingdom

PolyAI automates customer service through lifelike voice assistants that let customers lead a conversation. Our voice assistants make it possible for businesses to deliver outstanding customer service that rivals their human agents. Our customers, which include the world’s leading logos, are expanding how they use our platform, driving automation of critical customer service operations and integrating PolyAI into their daily customer service workflows.

As a Research Scientist specialising in large language model post-training, you will play a key role in shaping and implementing strategies for aligning language models for use in our conversational AI platform. Your primary focus will be on post-training techniques such as preference-finetuning, reward modelling, and synthetic data generation.

Responsibilities:

  • Train models and conduct experiments to assess model performance in live deployments.
  • Develop post-training strategies to achieve state of the art performance on domain-specific tasks.
  • Generate, collect, and annotate contact center data from sources such as real customer calls, chats, online open datasets, and synthetic data.
  • Develop robust evaluation benchmarks to track improvements in production models.
  • Collaborate with the legal and compliance team to address any compliance or data privacy-related issues.
  • Work closely with product and engineering teams to ensure alignment with business and production goals.
  • Stay informed about the latest advancements in machine learning, ASR, TTS, and LLM to continuously enhance our technologies.

Requirements:

  • A degree in Computer Science, Machine Learning, or a related field, or equivalent industry experience.
  • 3+ years of experience working with deep learning and statistical models.
  • Strong knowledge of data quality standards and annotation processes, with the ability to independently evaluate and improve models.
  • Proficiency in Python and familiarity with relevant ML frameworks and libraries (e.g., PyTorch).
  • Experience with cloud services such as AWS, GCP, or Azure.
  • Excellent verbal and written communication skills, with the ability to convey complex technical concepts to diverse audiences.
  • A passion for solving technical challenges and driving practical solutions.

Preferred Qualifications:

  • Experience working with LLMs and data preparation pipelines.
  • Experience with speech models, such as ASR or TTS.

Benefits:

  • Participation in the company’s employee share options plan.
  • 25 days holiday, plus bank holidays.
  • Flexible working from home policy.
  • Work from outside of the UK for up to 6 months each year.
  • Enhanced parental leave.
  • Bike2Work scheme.
  • Annual learning and development allowance.
  • One-off WFH allowance when you join.
  • Company-funded fertility and family-forming programmes.
  • Menopause care programme with Maven.
  • Private healthcare and dental cover, discounts on gym memberships and relaxation apps, and access to a range of mental health programs.

At PolyAI, we take great pride in our values—they guide everything we do. We believe that a strong culture leads to meaningful work and lasting impact.

Our core values are:

  • Only the best:We expect the best from our people, we hire people that expect the best from themselves, and we nurture this drive for excellence.
  • Ownership:We care deeply about what we do. We take ownership of our initiatives, decisions and outcomes.
  • Relentlessly improve:We demand more from ourselves and are always evolving. Continuous, obsessive improvement is the only way we will transform the world of conversational AI.
  • Bias for action:Our world moves quickly and so do we. We take calculated risks and we deliver impact fast.
  • Disagree and commit:We are all working toward the same goal. If we don’t agree with something, we work hard to understand it and when a decision is made, we accept it and give it our all.
  • Build for people:We are hyper-focused on delivering the best automated experiences possible so that we can empower people to get exactly what they need, when they need it.

PolyAI is proud to be an equal-opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. All employment decisions at PolyAI will be based on the business needs without attention to ethnicity, religion, sexual orientation, gender identity, family or parental status, national origin, neurodiversity status or disability status.

#J-18808-Ljbffr

Get the latest insights and jobs direct. Sign up for our newsletter.

By subscribing you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Portfolio Projects That Get You Hired for Machine Learning Jobs (With Real GitHub Examples)

In today’s data-driven landscape, the field of machine learning (ML) is one of the most sought-after career paths. From startups to multinational enterprises, organisations are on the lookout for professionals who can develop and deploy ML models that drive impactful decisions. Whether you’re an aspiring data scientist, a seasoned researcher, or a machine learning engineer, one element can truly make your CV shine: a compelling portfolio. While your CV and cover letter detail your educational background and professional experiences, a portfolio reveals your practical know-how. The code you share, the projects you build, and your problem-solving process all help prospective employers ascertain if you’re the right fit for their team. But what kinds of portfolio projects stand out, and how can you showcase them effectively? This article provides the answers. We’ll look at: Why a machine learning portfolio is critical for impressing recruiters. How to select appropriate ML projects for your target roles. Inspirational GitHub examples that exemplify strong project structure and presentation. Tangible project ideas you can start immediately, from predictive modelling to computer vision. Best practices for showcasing your work on GitHub, personal websites, and beyond. Finally, we’ll share how you can leverage these projects to unlock opportunities—plus a handy link to upload your CV on Machine Learning Jobs when you’re ready to apply. Get ready to build a portfolio that underscores your skill set and positions you for the ML role you’ve been dreaming of!

Machine Learning Job Interview Warm‑Up: 30 Real Coding & System‑Design Questions

Machine learning is fuelling innovation across every industry, from healthcare to retail to financial services. As organisations look to harness large datasets and predictive algorithms to gain competitive advantages, the demand for skilled ML professionals continues to soar. Whether you’re aiming for a machine learning engineer role or a research scientist position, strong interview performance can open doors to dynamic projects and fulfilling careers. However, machine learning interviews differ from standard software engineering ones. Beyond coding proficiency, you’ll be tested on algorithms, mathematics, data manipulation, and applied problem-solving skills. Employers also expect you to discuss how to deploy models in production and maintain them effectively—touching on MLOps or advanced system design for scaling model inferences. In this guide, we’ve compiled 30 real coding & system‑design questions you might face in a machine learning job interview. From linear regression to distributed training strategies, these questions aim to test your depth of knowledge and practical know‑how. And if you’re ready to find your next ML opportunity in the UK, head to www.machinelearningjobs.co.uk—a prime location for the latest machine learning vacancies. Let’s dive in and gear up for success in your forthcoming interviews.

Negotiating Your Machine Learning Job Offer: Equity, Bonuses & Perks Explained

How to Secure a Compensation Package That Matches Your Technical Mastery and Strategic Influence in the UK’s ML Landscape Machine learning (ML) has rapidly shifted from an emerging discipline to a mission-critical function in modern enterprises. From optimising e-commerce recommendations to powering autonomous vehicles and driving innovation in healthcare, ML experts hold the keys to transformative outcomes. As a mid‑senior professional in this field, you’re not only crafting sophisticated algorithms; you’re often guiding strategic decisions about data pipelines, model deployment, and product direction. With such a powerful impact on business results, companies across the UK are going beyond standard salary structures to attract top ML talent. Negotiating a compensation package that truly reflects your value means looking beyond the numbers on your monthly payslip. In addition to a competitive base salary, you could be securing equity, performance-based bonuses, and perks that support your ongoing research, development, and growth. However, many mid‑senior ML professionals leave these additional benefits on the table—either because they’re unsure how to negotiate them or they simply underestimate their long-term worth. This guide explores every critical aspect of negotiating a machine learning job offer. Whether you’re joining an AI-focused start-up or a major tech player expanding its ML capabilities, understanding equity structures, bonus schemes, and strategic perks will help you lock in a package that matches your technical expertise and strategic influence. Let’s dive in.