Data Scientist II, RufusX Science UK

Amazon
London, United Kingdom
Last month
Job Type
Permanent
Posted
3 Mar 2026 (Last month)
We are looking for a passionate, talented, and inventive Data Scientist with a strong machine learning and analytics background to help build industry-leading language technology powering Rufus, our AI-driven search and shopping assistant, helping customers with their shopping tasks at every step of their shopping journey.

This innovative role focuses on developing and optimizing large language model (LLM)-powered conversational experiences. The core emphasis is to get the best performance out of state-of-the-art LLMs via careful and methodical instruction design, contextual grounding, informed choices of MCP tools and agent/multi-agent systems, evaluation frameworks, and experimentation to systematically improve LLM quality, robustness, and customer impact. The work combines scientific rigor with product intuition to systematically raise the bar for conversational AI performance at Amazon scale.

Our mission in conversational shopping is to make it easy for customers to find and discover the best products to meet their needs by helping with their product research, providing comparisons and recommendations, answering product questions, enabling shopping directly from images or videos, providing visual inspiration, and more. We do this by leveraging advanced analytics, Natural Language Processing (NLP), Machine Learning (ML), A/B testing, causal inference, and data-driven insights to continuously improve our systems.

Key job responsibilities
As a Data Scientist on our team, you will develop and maintain LLM instructions iterations and evaluation frameworks, including automated eval pipelines, LLM-as-a-judge methodologies, rubric design, and dataset curation to measure nuanced aspects of response quality. You will partner with the wider org to experiment with techniques such as retrieval augmentation, context enrichment, prompt decomposition, and model fine-tuning or post-training strategies, if and when applicable. You will leverage petabytes of data and identify opportunities to leverage machine learning models aimed at making conversational systems more performant.

A day in the life
You will:
Perform hands-on analysis of large-scale multimodal interaction datasets to develop insights into how customers engage with conversational AI systems and how to improve response quality and customer experience. Use statistical methods, experimentation, and data-driven analysis to develop scalable approaches for measuring, evaluating, and optimizing large language model (LLM)-based shopping assistant systems, leveraging structured and unstructured contextual signals. Design and analyze A/B tests and experiments to evaluate new features and model improvements, ensuring statistical rigor and actionable insights. Develop metrics, dashboards, and reporting frameworks to monitor system performance, customer engagement, and business impact. Conduct deep-dive analyses to identify opportunities for improving conversational relevance, grounding, customer satisfaction, and downstream business impact. Collaborate with Applied Scientists and Engineers to translate analytical insights into production systems, working closely on model evaluation and deployment. Establish automated processes for large-scale data analysis, ETL pipelines, metric generation, and experimentation frameworks. Communicate results and insights to both technical and non-technical audiences, including through presentations, written reports, and data visualizations.

About the team
The Rufus Features Science team, based in London, works alongside ~150 engineers, designers and product managers, shaping the future of AI-driven shopping experiences at Amazon. The team works on every aspect of the Rufus AI, from making Rufus agentic, enabling customers to set price alerts or empower Rufus to act on their behalf and automatically purchase products when the price is right, to understanding multimodal user queries and generating answers that combine text, image, audio and video, including deep research reports that scour the web and the Amazon catalog to provide detailed and personalised shopping guidance. We utilize and advance state-of-art techniques in the fields of Natural Language Processing, gen AI, Information Retrieval, Machine/Deep Learning, and Data Mining. We validate our work by actively participating in the internal and external scientific communities.

Related Jobs

View all jobs

Data Scientist

Faculty London, United Kingdom
Hybrid

Data Scientist, Integrity Measurement

OpenAI London, United Kingdom
Hybrid

Data Scientist – Cross Indication (12-month

Relation Therapeutics London, United Kingdom
Permanent

Data Scientist / Algorithm Engineer

PhysicsX United Kingdom

Commercial Data Scientist

Synthesia London, United Kingdom
Remote

Senior Data Scientist

Faculty London, United Kingdom
Remote

Subscribe to Future Tech Insights for the latest jobs & insights, direct to your inbox.

By subscribing, you agree to our privacy policy and terms of service.

Industry Insights

Discover insightful articles, industry insights, expert tips, and curated resources.

Where to Advertise Machine Learning Jobs in the UK (2026 Guide)

Advertising machine learning jobs in the UK requires a different approach to most technical hiring. The candidate pool is small, highly specialised and in demand across AI labs, financial services, healthcare, autonomous systems and consumer technology simultaneously. Machine learning engineers and researchers move between roles through professional networks, conference communities and specialist platforms — not general job boards where ML roles compete with unrelated software engineering positions for the same audience. This guide, published by MachineLearningJobs.co.uk, covers where to advertise machine learning roles in the UK in 2026, how the main platforms compare, what employers should expect to pay, and what the data says about hiring across different role types.

New Machine Learning Employers to Watch in 2026: UK and Global Companies Driving ML Innovation

Machine learning (ML) has transitioned from a specialised field into a core business capability. In 2026, organisations across healthcare, finance, robotics, autonomous systems, natural language processing, and analytics are expanding their machine learning teams to build scalable intelligent products and services. For professionals exploring opportunities on www.MachineLearningJobs.co.uk , understanding the companies that are scaling, winning investment, or securing high‑impact contracts is crucial. This article highlights the new and high‑growth machine learning employers to watch in 2026, focusing on UK innovators, international firms with significant UK presence, and global platforms investing in machine learning talent locally.

How Many Machine Learning Tools Do You Need to Know to Get a Machine Learning Job?

Machine learning is one of the most exciting and rapidly growing areas of tech. But for job seekers it can also feel like a maze of tools, frameworks and platforms. One job advert wants TensorFlow and Keras. Another mentions PyTorch, scikit-learn and Spark. A third lists Mlflow, Docker, Kubernetes and more. With so many names out there, it’s easy to fall into the trap of thinking you must learn everything just to be competitive. Here’s the honest truth most machine learning hiring managers won’t say out loud: 👉 They don’t hire you because you know every tool. They hire you because you can solve real problems with the tools you know. Tools are important — no doubt — but context, judgement and outcomes matter far more. So how many machine learning tools do you actually need to know to get a job? For most job seekers, the real number is far smaller than you think — and more logically grouped. This guide breaks down exactly what employers expect, which tools are core, which are role-specific, and how to structure your learning for real career results.