招聘
We are looking for a passionate, talented, and inventive Data Scientist with a strong machine learning and analytics background to help build industry-leading language technology powering Rufus, our AI-driven search and shopping assistant, helping customers with their shopping tasks at every step of their shopping journey.
This innovative role focuses on developing and optimizing large language model (LLM)-powered conversational experiences. The core emphasis is to get the best performance out of state-of-the-art LLMs via careful and methodical instruction design, contextual grounding, informed choices of MCP tools and agent/multi-agent systems, evaluation frameworks, and experimentation to systematically improve LLM quality, robustness, and customer impact. The work combines scientific rigor with product intuition to systematically raise the bar for conversational AI performance at Amazon scale.
Our mission in conversational shopping is to make it easy for customers to find and discover the best products to meet their needs by helping with their product research, providing comparisons and recommendations, answering product questions, enabling shopping directly from images or videos, providing visual inspiration, and more. We do this by leveraging advanced analytics, Natural Language Processing (NLP), Machine Learning (ML), A/B testing, causal inference, and data-driven insights to continuously improve our systems.
Key job responsibilities
As a Data Scientist on our team, you will develop and maintain LLM instructions iterations and evaluation frameworks, including automated eval pipelines, LLM-as-a-judge methodologies, rubric design, and dataset curation to measure nuanced aspects of response quality. You will partner with the wider org to experiment with techniques such as retrieval augmentation, context enrichment, prompt decomposition, and model fine-tuning or post-training strategies, if and when applicable. You will leverage petabytes of data and identify opportunities to leverage machine learning models aimed at making conversational systems more performant.
A day in the life
You will:
Perform hands-on analysis of large-scale multimodal interaction datasets to develop insights into how customers engage with conversational AI systems and how to improve response quality and customer experience. Use statistical methods, experimentation, and data-driven analysis to develop scalable approaches for measuring, evaluating, and optimizing large language model (LLM)-based shopping assistant systems, leveraging structured and unstructured contextual signals. Design and analyze A/B tests and experiments to evaluate new features and model improvements, ensuring statistical rigor and actionable insights. Develop metrics, dashboards, and reporting frameworks to monitor system performance, customer engagement, and business impact. Conduct deep-dive analyses to identify opportunities for improving conversational relevance, grounding, customer satisfaction, and downstream business impact. Collaborate with Applied Scientists and Engineers to translate analytical insights into production systems, working closely on model evaluation and deployment. Establish automated processes for large-scale data analysis, ETL pipelines, metric generation, and experimentation frameworks. Communicate results and insights to both technical and non-technical audiences, including through presentations, written reports, and data visualizations.
About the team
The Rufus Features Science team, based in London, works alongside ~150 engineers, designers and product managers, shaping the future of AI-driven shopping experiences at Amazon. The team works on every aspect of the Rufus AI, from making Rufus agentic, enabling customers to set price alerts or empower Rufus to act on their behalf and automatically purchase products when the price is right, to understanding multimodal user queries and generating answers that combine text, image, audio and video, including deep research reports that scour the web and the Amazon catalog to provide detailed and personalised shopping guidance. We utilize and advance state-of-art techniques in the fields of Natural Language Processing, gen AI, Information Retrieval, Machine/Deep Learning, and Data Mining. We validate our work by actively participating in the internal and external scientific communities.
Basic Qualifications
- Experience with machine learning/statistical modeling data analysis tools and techniques, and parameters that affect their performance
- Experience in a ML or data scientist role with a large technology company
- Experience with data scripting languages (e.g. SQL, Python, R etc.) or statistical/mathematical software (e.g. R, SAS, or Matlab)
- Experience effectively communicating complex concepts through written and verbal communication
- Master's degree or above in Math, Statistics, Computer Science, or related science field
Preferred Qualifications
- Experience with AWS services including S3, Redshift, Sagemaker, EMR, Kinesis, Lambda, and EC2
- Experience in defining and creating benchmarks for assessing GenAI model performance
- Experience working on multi-team, cross-disciplinary projects
Amazon is an equal opportunities employer. We believe passionately that employing a diverse workforce is central to our success. We make recruiting decisions based on your experience and skills. We value your passion to discover, invent, simplify and build. Protecting your privacy and the security of your data is a longstanding top priority for Amazon. Please consult our Privacy Notice (https://www.amazon.jobs/en/privacy_page) to know more about how we collect, use and transfer the personal data of our candidates.
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.
Total Views
0
Apply Clicks
0
Mock Applicants
0
Scraps
0
Similar Jobs

Research Engineer, Education
Google DeepMind · London, UK

Research Scientist, Science of Post-Training and Reinforcement Learning
Google DeepMind · London, UK

Senior Data Scientist
JPMorgan Chase · London, United Kingdom
DE
Senior Data Scientist
DeepL · London

Lead Data Scientist - Fraud Prevention
Wise · London
About Amazon

Amazon
PublicAmazon.com, Inc. is an American multinational technology company engaged in e-commerce, cloud computing, online advertising, digital streaming, and artificial intelligence.
10,001+
Employees
Seattle
Headquarters
Reviews
2.9
10 reviews
Work Life Balance
2.8
Compensation
3.7
Culture
2.5
Career
2.3
Management
2.1
35%
Recommend to a Friend
Pros
Good pay and compensation
Strong benefits package
Flexible scheduling options
Cons
Poor management and leadership
Limited growth and promotion opportunities
High stress and demanding work environment
Salary Ranges
2 data points
Junior/L3
L2
L3
L4
L5
L6
M3
M4
M5
M6
Mid/L4
Principal/L7
Senior/L5
Staff/L6
Director
Junior/L3 · Data Scientist L4
0 reports
$181,968
total / year
Base
-
Stock
-
Bonus
-
$154,672
$209,264
Interview Experience
10 interviews
Difficulty
3.7
/ 5
Duration
21-35 weeks
Offer Rate
20%
Experience
Positive 10%
Neutral 10%
Negative 80%
Interview Process
1
Application Review
2
Recruiter Screen
3
Online Assessment
4
Technical Phone Screen
5
Onsite/Virtual Loop
6
Team Matching
7
Offer
Common Questions
Coding/Algorithm
System Design
Behavioral/STAR
Leadership Principles
Technical Knowledge
News & Buzz
Life on Fire Announces Amazon Bestseller Milestone for “The Wisdom Collective” - Yahoo Finance
Source: Yahoo Finance
News
·
5w ago
Amazon shuts down controversial payment method - AL.com
Source: AL.com
News
·
5w ago
Amazon Prime members can score these bestselling wireless earbuds for only $20 - thestreet.com
Source: thestreet.com
News
·
5w ago
After lawsuit one of the biggest Amazon customers Perplexity signs $750 million deal with Microsoft, says - Times of India
Source: Times of India
News
·
5w ago