採用

AI Benchmarking Lead, Performance Benchmarking Evaluation

Amazon

Hyderabad, TS, IND

On-site

Full-time

1w ago

Join our mission-critical team supporting Seller Assistant, Amazon's Gen-AI powered copilot that helps sellers navigate Amazon's complex ecosystem and grow their businesses. As a Quality Assurance Specialist, you'll play a pivotal role in ensuring the reliability and accuracy of AI model evaluations as we scale from 61% to 90%+ active seller coverage worldwide.

About Seller Assistant:

Seller Assistant is a conversational AI copilot that understands the full context of a seller's business. It intelligently orchestrates backend tools to deliver actionable, drilled-down responses and can independently complete complex tasks on behalf of sellers with their permission.

Our Scale and Impact:

Expanded to 2.44MM sellers (45x growth vs. Dec 2024)
Currently serving 61% of active sellers worldwide across 9 international stores (CN2XX, IN, UK, DE, JP, BR, MX, AE, SA)
Supporting four languages: English, Chinese, German, and Japanese
2026 Goal: Scale to 90%+ active sellers WW with 5 new store launches (France, Italy, Spain, Canada, Australia)
As a Quality Assurance Specialist/AI Benchmarking Lead, you will benchmark Seller Assistant AI models for relevancy, correctness, and completeness. Your primary responsibilities include: 1) Evaluate audits performed by the core auditing team to increase confidence in evaluation metrics, 2) Improve audit reliability and consistency through systematic measurement of auditor accuracy,3) Conduct targeted calibration to ensure quality standards across the auditing function, 4) Enforce quality standards by quality-checking audits and providing actionable feedback to team members, 5) Drive continuous improvement in audit processes and methodologies.
You conduct quality checks on audits performed by the core auditing team.
You identify rubric gaps and evaluation ambiguities that lead to inconsistent audit outcomes.
You surface high-confidence product issues earlier by validating and categorizing model failures.
You serve as point of contact for annotation tasks across ML data process areas, ensuring quality execution and delivery
You understand dependencies across ML data workflows and articulate customer impact effectively
You modify existing annotation methods and update SOPs.
You document SOP changes, secure approval, share knowledge with the team, and audit adoption and execution
You test new SOPs and tools, providing feedback on quality and improvement recommendations to support onboarding
Key job responsibilities
You structure data collection, analyse results and share inputs for SOP changes.
You collate, track, and report progress on key metrics agreed to with respective stakeholders (e.g., Program managers, Applied Scientist) specific to your functional area.
You identify operational issues related to process and tooling and recommend suggestions to improve key project metrics such as productivity and quality.

Basic Qualifications

Bachelor's degree or equivalent
Experience in natural language data labeling, data annotation, linguistic annotation or other forms of data markup
Technical Skills: Proficiency in MS Excel; basic understanding of SQL and Python
Experience with Microsoft Office products and applications
Communication Skills: Strong verbal and written communication skills in English
Knowledge about SOA and process that deal with sellers.

Preferred Qualifications

1 to 3 years of equivalent experience
Performed annotation related tasks across ML data process areas.
Strong knowledge of process documentation, analysis knowledge
Technical proficiency in SQL querying and Python programming for data analysis
Strong analytical and problem-solving skills
Ability to work independently and as part of a team

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.

総閲覧数

応募クリック数

模擬応募者数

スクラップ

類似の求人

AI/ML Computational Science Manager

Accenture · Hyderabad

LEAD Software Developer C++ EDA with AI/ML

AMD · Hyderabad, India

Applied AI ML Lead

JPMorgan Chase · Hyderabad, Telangana, India, IN

Lead Engineer, Senior-Machine Learning Tools

Qualcomm · Hyderabad, Telangana, India

AI LEAD L1

Wipro · Hyderabad, India

Amazonについて

Amazon

Public

Amazon.com, Inc. is an American multinational technology company engaged in e-commerce, cloud computing, online advertising, digital streaming, and artificial intelligence.

10,001+

従業員数

Seattle

本社所在地

$1.5T

企業価値

レビュー

2.9

10件のレビュー

ワークライフバランス

2.8

報酬

3.7

企業文化

2.5

キャリア

2.3

経営陣

2.1

35%

友人に勧める

良い点

Good pay and compensation

Strong benefits package

Flexible scheduling options

改善点

Poor management and leadership

Limited growth and promotion opportunities

High stress and demanding work environment

給与レンジ

4件のデータ

Junior/L3

Mid/L4

Principal/L7

Senior/L5

Staff/L6

Director

Junior/L3 · Data Scientist L4

0件のレポート

$181,968

年収総額

基本給

ストック

ボーナス

$154,672

$209,264

面接体験

10件の面接

難易度

3.7

/ 5

期間

21-35週間

内定率

20%

体験

ポジティブ 10%

普通 10%

ネガティブ 80%

面接プロセス

Application Review

Recruiter Screen

Online Assessment

Technical Phone Screen

Onsite/Virtual Loop

Team Matching

Offer

よくある質問

Coding/Algorithm

System Design

Behavioral/STAR

Leadership Principles

Technical Knowledge

ニュース＆話題

Amazon vs. Walmart: This Isn't Even Close - The Motley Fool

The Motley Fool

News

3d ago

'Kevin' Review: Jason Schwartzman, Aubrey Plaza in Amazon Cat Cartoon - The Hollywood Reporter

The Hollywood Reporter

News

3d ago

Amazon's best weekend deals: Apple, Clinique, Yeti and more — save up to 70% - Yahoo

Yahoo

News

3d ago

Amazon Delivery Drones Involve a Perilous 10-Foot Drop. Users Are Posting the Apparent Results - Gizmodo

Gizmodo

News

3d ago