招聘

AI Benchmarking Lead, Performance Benchmarking Evaluation
Hyderabad, TS, IND
·
On-site
·
Full-time
·
2d ago
Join our mission-critical team supporting Seller Assistant, Amazon's Gen-AI powered copilot that helps sellers navigate Amazon's complex ecosystem and grow their businesses. As a Quality Assurance Specialist, you'll play a pivotal role in ensuring the reliability and accuracy of AI model evaluations as we scale from 61% to 90%+ active seller coverage worldwide.
About Seller Assistant:
Seller Assistant is a conversational AI copilot that understands the full context of a seller's business. It intelligently orchestrates backend tools to deliver actionable, drilled-down responses and can independently complete complex tasks on behalf of sellers with their permission.
Our Scale and Impact:
-
Expanded to 2.44MM sellers (45x growth vs. Dec 2024)
-
Currently serving 61% of active sellers worldwide across 9 international stores (CN2XX, IN, UK, DE, JP, BR, MX, AE, SA)
-
Supporting four languages: English, Chinese, German, and Japanese
-
2026 Goal: Scale to 90%+ active sellers WW with 5 new store launches (France, Italy, Spain, Canada, Australia)
As a Quality Assurance Specialist/AI Benchmarking Lead, you will benchmark Seller Assistant AI models for relevancy, correctness, and completeness. Your primary responsibilities include: 1) Evaluate audits performed by the core auditing team to increase confidence in evaluation metrics, 2) Improve audit reliability and consistency through systematic measurement of auditor accuracy,3) Conduct targeted calibration to ensure quality standards across the auditing function, 4) Enforce quality standards by quality-checking audits and providing actionable feedback to team members, 5) Drive continuous improvement in audit processes and methodologies. -
You conduct quality checks on audits performed by the core auditing team.
-
You identify rubric gaps and evaluation ambiguities that lead to inconsistent audit outcomes.
-
You surface high-confidence product issues earlier by validating and categorizing model failures.
-
You serve as point of contact for annotation tasks across ML data process areas, ensuring quality execution and delivery
-
You understand dependencies across ML data workflows and articulate customer impact effectively
-
You modify existing annotation methods and update SOPs.
-
You document SOP changes, secure approval, share knowledge with the team, and audit adoption and execution
-
You test new SOPs and tools, providing feedback on quality and improvement recommendations to support onboarding
-
Key job responsibilities
-
You structure data collection, analyse results and share inputs for SOP changes.
-
You collate, track, and report progress on key metrics agreed to with respective stakeholders (e.g., Program managers, Applied Scientist) specific to your functional area.
-
You identify operational issues related to process and tooling and recommend suggestions to improve key project metrics such as productivity and quality.
Basic Qualifications
- Bachelor's degree or equivalent
- Experience in natural language data labeling, data annotation, linguistic annotation or other forms of data markup
- Technical Skills: Proficiency in MS Excel; basic understanding of SQL and Python
- Experience with Microsoft Office products and applications
- Communication Skills: Strong verbal and written communication skills in English
- Knowledge about SOA and process that deal with sellers.
Preferred Qualifications
- 1 to 3 years of equivalent experience
- Performed annotation related tasks across ML data process areas.
- Strong knowledge of process documentation, analysis knowledge
- Technical proficiency in SQL querying and Python programming for data analysis
- Strong analytical and problem-solving skills
- Ability to work independently and as part of a team
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.
总浏览量
0
申请点击数
0
模拟申请者数
0
收藏
0
相似职位

Applied AI ML Lead
JPMorgan Chase · Hyderabad, Telangana, India, IN

AI LEAD L1
Wipro · Hyderabad, India

Compliance Engineering - Neon Platform - AI/ML Engineer - Vice President - Hyderabad
Goldman Sachs · Hyderabad, Telangana, India

Applied AI ML Director
JPMorgan Chase · Hyderabad, India

Lead Engineer, Senior-Machine Learning Tools
Qualcomm · Hyderabad, Telangana, India
关于Amazon

Amazon
PublicAmazon.com, Inc. is an American multinational technology company engaged in e-commerce, cloud computing, online advertising, digital streaming, and artificial intelligence.
10,001+
员工数
Seattle
总部位置
$1.5T
企业估值
评价
3.4
10条评价
工作生活平衡
2.3
薪酬
4.2
企业文化
3.1
职业发展
3.8
管理层
2.7
65%
推荐给朋友
优点
Great benefits and competitive compensation
Learning opportunities and career advancement
Good teamwork and colleagues
缺点
High pressure and long hours
Poor work-life balance
Toxic work culture and high turnover
薪资范围
4个数据点
Junior/L3
L2
L3
L4
L5
L6
M3
M4
M5
M6
Mid/L4
Principal/L7
Senior/L5
Staff/L6
Director
Junior/L3 · Data Scientist L4
0份报告
$181,968
年薪总额
基本工资
-
股票
-
奖金
-
$154,672
$209,264
面试经验
6次面试
难度
4.0
/ 5
时长
21-35周
体验
正面 0%
中性 17%
负面 83%
面试流程
1
Application Review
2
Recruiter Screen
3
Online Assessment
4
Technical Phone Screen
5
Technical Interview
6
Onsite/Virtual Interviews
常见问题
Coding/Algorithm
System Design
Behavioral/STAR
Technical Knowledge
新闻动态
X-Energy’s Shares Jump in IPO, Delivering Wins to Amazon and Ken Griffin - WSJ
WSJ
News
·
Today
Amazon loses $150M after drones hit its data centers — and insurance won’t cover their losses. What it means for you - Yahoo Finance
Yahoo Finance
News
·
Today
Martha Stewart's new Amazon line has chic kitchen appliances from $40 - USA Today
USA Today
News
·
Today
‘Gen V’ Not Returning for Season 3 at Amazon - Variety
Variety
News
·
Today