招聘
Do you have a passion for computer vision and solving deep learning problems? The Video Engineering Data Analytics and Quality group is seeking an expert in evaluating machine learning and deep learning models, including foundation models and multimodal systems.
This role will play a critical part in crafting robust evaluation frameworks, using both traditional statistical methods and modern techniques like LLM-as-a-Judge! The ideal candidate combines strong analytical thinking, expertise in Python, and advanced knowledge of statistical methodologies and data quality standards.
This role involves collaboration with teams at Apple passionate about developing foundation models, including ML engineers, data scientists, and ML Infrastructure engineers to deliver amazing user experiences!
Description
Develop robust methodologies to assess the performance of foundation models (e.g., LLMs, vision-language models, etc.) across diverse tasks. Leverage LLMs as judges to perform subjective and open-ended model evaluations (e.g., for summarization, reasoning, or multimodal generation tasks). Build, curate, and lead evaluation datasets and benchmarks. Advanced proficiency in at least one scripting language, preferably Python. Collaborate with research, engineering, and product teams to define evaluation goals aligned with user experience and product quality. Conduct failure analysis and uncover edge cases to improve model robustness. Contribute to our tools and infrastructure to automate and scale evaluation processes.
Preferred Qualifications
Experience working with open-source evaluation tools like Open Eval, ELO-based ranking, or LLM-as-a-Judge frameworks.
Familiarity with prompt engineering, few-shot or zero-shot evaluation techniques.
Experience evaluating generative models (e.g., text generation, image generation).
Prior contributions to ML benchmarks or public evaluations.
Strong interpersonal skills.
Minimum Qualifications
BS and a minimum of 10 years relevant industry experience.
Strong experience in evaluating supervised, unsupervised, and deep learning models.
Hands-on experience evaluating LLMs (e.g., GPT, Claude, PaLM) and using them as scoring/judging mechanisms.
Familiarity with multimodal models (e.g., image + text, video + audio) and related evaluation challenges.
Proficiency in Python and libraries such as Num Py, pandas, scikit-learn, Py Torch, or Tensor Flow.
Solid understanding of statistical testing, sampling, confidence intervals, and metrics (e.g., precision/recall, BLEU, ROUGE, FID, etc.).
Strong documentation skills, including the ability to write technical reports and present to non-technical audiences.
Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant .
Pay & Benefits
At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $181,100 and $318,400, and your base pay will depend on your skills, qualifications, experience, and location.
Apple employees also have the opportunity to become an Apple shareholder through participation in Apple's discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple's Employee Stock Purchase Plan. You'll also receive benefits including: Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses - including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation. Learn more about Apple Benefits.
Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.
总浏览量
0
申请点击数
0
模拟申请者数
0
收藏
0
相似职位

Machine Learning Engineer II
Uber · Sunnyvale, CA

Machine Learning Engineer
Intuitive Surgical · Sunnyvale

AI Research Scientist – Large Language Models (LLM) & Agentic AI
Bosch · Sunnyvale

Machine Learning Engineer, App SW
Wayve · Sunnyvale

Cloud Customer Engineer, Platform, Strategic AI and ISV
Google · placeSunnyvale, CA, USA; San Francisco, CA, USA
关于Apple

Apple
PublicApple Inc. is an American multinational technology company headquartered in Cupertino, California, in Silicon Valley, best known for its consumer electronics, software and online services.
10,001+
员工数
Cupertino
总部位置
$3.5T
企业估值
评价
3.9
10条评价
工作生活平衡
2.5
薪酬
4.2
企业文化
3.8
职业发展
3.5
管理层
3.2
72%
推荐给朋友
优点
Great benefits and compensation
Talented colleagues and supportive teams
Learning opportunities and mentorship
缺点
Work-life balance challenges
High stress and pressure
Fast-paced environment
薪资范围
11,365个数据点
Junior/L3
L2
L3
L4
L5
L6
M3
M4
M5
M6
Principal/L7
Senior/L5
Staff/L6
Junior/L3 · Data Scientist ICT2
0份报告
$121,979
年薪总额
基本工资
-
股票
-
奖金
-
$103,682
$140,276
面试经验
3次面试
难度
3.3
/ 5
时长
28-42周
录用率
33%
体验
正面 33%
中性 0%
负面 67%
面试流程
1
Application Review
2
Recruiter Screen
3
Technical Phone Screen
4
Onsite/Virtual Interviews
5
Team Matching
6
Offer
常见问题
Coding/Algorithm
System Design
Behavioral/STAR
Technical Knowledge
Past Experience
新闻动态
T-Mobile is giving away the Apple iPhone 17 for free — how to claims yours this weekend - Mashable
Mashable
News
·
1d ago
There Is Incredible News for Apple Investors. Will It Be Enough to Send the Stock Higher? - Yahoo Finance
Yahoo Finance
News
·
1d ago
Apple Says CarPlay Ultra is Coming to These Vehicle Brands - MacRumors
MacRumors
News
·
1d ago
Tracking sleep with Apple Watch? Use these 5 settings for the best accuracy - Tom's Guide
Tom's Guide
News
·
2d ago