Jobs
Benefits & Perks
•Healthcare
•401(k)
•Equity
•Learning Budget
•Healthcare
•401k
•Equity
•Learning
Required Skills
Python
Statistical analysis
Machine learning evaluation
Deep learning
CO Salary Range: USD 132,100.00 - 244,600.00 per year
Do you have a passion for computer vision and solving deep learning problems? The Video Engineering Data Analytics and Quality group is seeking an expert in evaluating machine learning and deep learning models, including foundation models and multimodal systems.
This role will play a critical part in crafting robust evaluation frameworks, using both traditional statistical methods and modern techniques like LLM-as-a-Judge! The ideal candidate combines strong analytical thinking, expertise in Python, and advanced knowledge of statistical methodologies and data quality standards.
This role involves collaboration with teams at Apple passionate about developing foundation models, including ML engineers, data scientists, and ML Infrastructure engineers to deliver amazing user experiences!
Description:
Develop robust methodologies to assess the performance of foundation models (e.g., LLMs, vision-language models, etc.) across diverse tasks.
Leverage LLMs as judges to perform subjective and open-ended model evaluations (e.g., for summarization, reasoning, or multimodal generation tasks).
Build, curate, and lead evaluation datasets and benchmarks.
Advanced proficiency in at least one scripting language, preferably Python.
Collaborate with research, engineering, and product teams to define evaluation goals aligned with user experience and product quality.
Conduct failure analysis and uncover edge cases to improve model robustness.
Contribute to our tools and infrastructure to automate and scale evaluation processes.
Preferred Qualifications:
Experience working with open-source evaluation tools like Open Eval, ELO-based ranking, or LLM-as-a-Judge frameworks.
Familiarity with prompt engineering, few-shot or zero-shot evaluation techniques.
Experience evaluating generative models (e.g., text generation, image generation).
Prior contributions to ML benchmarks or public evaluations.
Strong interpersonal skills.
Minimum Qualifications:
BS and a minimum of 3 years relevant industry experience
Strong experience in evaluating supervised, unsupervised, and deep learning models.
Hands-on experience evaluating LLMs and using them as scoring/judging mechanisms.
Familiarity with multimodal models (e.g., image + text, video + audio) and related evaluation challenges.
Proficiency in Python and libraries such as Num Py, pandas, scikit-learn, Py Torch, or Tensor Flow.
Solid understanding of statistical testing, sampling, confidence intervals, and metrics (e.g., precision/recall, BLEU, ROUGE, FID, etc.).
Strong documentation skills, including the ability to write technical reports and present to non-technical audiences.
Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant .
Pay & Benefits:
At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $132,100 and $244,600, and your base pay will depend on your skills, qualifications, experience, and location.
Apple employees also have the opportunity to become an Apple shareholder through participation in Apple's discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple's Employee Stock Purchase Plan. You'll also receive benefits including: Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses - including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation. Learn more about Apple Benefits.
Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.
Total Views
0
Apply Clicks
0
Mock Applicants
0
Scraps
0
Similar Jobs

Sr. Manager ASIC, Annapurna Labs - Cloud Scale Machine Learning Acceleration Team
Amazon · Austin, TX, USA

Lead AI/ML Engineer – Regulatory Reporting - Senior Vice President
Citigroup · MUMBAI, Mahārāshtra, India

SDE, MLA hardware/software co-design, Annapurna Labs Machine Learning Acceleration
Amazon · Austin, TX, USA

Director, Machine Learning Engineering
PayPal · New York City, New York, United States of America; Scottsdale, Arizona, United States of America; Chicago, Illinois, United States of America; San Jose, California, United States of America

Lead Software Engineer - Python, AIML Engineer
JPMorgan Chase · Mumbai, India
About Apple

Apple
PublicA technology company that designs, manufactures, and markets consumer electronics, personal computers, and software.
10,001+
Employees
Cupertino
Headquarters
$3.5T
Valuation
Reviews
4.0
10 reviews
Work Life Balance
4.0
Compensation
4.2
Culture
3.8
Career
3.5
Management
3.2
75%
Recommend to a Friend
Pros
Great coworkers and people
Excellent benefits and perks
Fast-paced and engaging work environment
Cons
High expectations and pressure
Management quality varies
Limited career progression opportunities
Salary Ranges
17,968 data points
Junior/L3
L2
L3
L4
L5
L6
M3
M4
M5
M6
Principal/L7
Senior/L5
Staff/L6
Junior/L3 · Data Scientist ICT2
0 reports
$121,979
total / year
Base
-
Stock
-
Bonus
-
$103,682
$140,276
Interview Experience
5 interviews
Difficulty
3.4
/ 5
Duration
28-42 weeks
Offer Rate
20%
Experience
Positive 20%
Neutral 40%
Negative 40%
Interview Process
1
Application Review
2
Recruiter Screen
3
Technical Phone Screen
4
Behavioral Interview
5
Onsite/Virtual Interviews
6
Team Matching
7
Offer
Common Questions
Coding/Algorithm
System Design
Behavioral/STAR
Technical Knowledge
Culture Fit
News & Buzz
Exclusive | First-ever Apple check signed by Steve Jobs sells for a whopping $2.4M at auction - New York Post
Source: New York Post
News
·
4w ago
Apple Stock Forecast: Trending Upgrade After Earnings Beat - TipRanks
Source: TipRanks
News
·
4w ago
Tim Cook Thinks He Has Identified Apple’s Next Big Growth Opportunity - inc.com
Source: inc.com
News
·
5w ago
Apple Gives Itself the Toughest Act to Follow - Bloomberg
Source: Bloomberg
News
·
5w ago