채용
Benefits & Perks
•401(k) matching
•Generous paid time off and holidays
•Flexible work arrangements
•Competitive salary and equity package
•Professional development budget
•Comprehensive health, dental, and vision insurance
•Flexible Hours
•Equity
•Learning
•Healthcare
Required Skills
TypeScript
JavaScript
PostgreSQL
About the Role
Apple Silicon GPU SW architecture team is seeking a senior/principal engineer to lead server-side ML acceleration and multi-node distribution initiatives. You will help define and shape our future GPU compute infrastructure on Private Cloud Compute that enables Apple Intelligence.
In this role, you'll be at the forefront of architecting and building our next-generation distributed ML infrastructure, where you'll tackle the complex challenge of orchestrating massive network models across server clusters to power Apple Intelligence at unprecedented scale. It will involve designing sophisticated parallelization strategies that split models across many GPUs, optimizing every layer of the stack—from low-level memory access patterns to high-level distributed algorithms—to achieve maximum hardware utilization while minimizing latency for real-time user experiences. You'll work at the intersection of cutting-edge ML systems and hardware acceleration, collaborating directly with silicon architects to influence future GPU designs based on your deep understanding of inference workload characteristics, while simultaneously building the production systems that will serve billions of requests daily.
This is a hands-on technical leadership position where you'll not only architect these systems but also dive deep into performance profiling, implement novel optimization techniques, and solve unprecedented scaling challenges as you help define the future of AI experiences delivered through Apple's secure cloud infrastructure.
Responsibilities
- Design and implement tensor/data/expert parallelism strategies for large language model inference across distributed server cluster environments
- Drive hardware and software roadmap decisions for ML acceleration
- Design architectures that achieve peak compute utilizations and optimal memory throughput
- Develop and optimize distributed inference systems with focus on latency, throughput, and resource efficiency across multiple nodes
- Architect scalable ML serving infrastructure supporting dynamic model sharding, load balancing, and fault tolerance
- Collaborate with hardware teams on next-generation accelerator requirements and software teams on framework integration
- Lead performance analysis and optimization of ML workloads, identifying bottlenecks in compute, memory, and network subsystems
- Drive adoption of advanced parallelization techniques including pipeline parallelism, expert parallelism, and various other emerging approaches
Minimum Qualifications
- Strong knowledge of GPU programming (CUDA, ROCm) and high-performance computing
- Excellent system programming skills in C/C++; Python is a plus
- Deep understanding of distributed systems and parallel computing architectures
- Experience with inter-node communication technologies (Infini Band, RDMA, NCCL) in the context of ML training/inference
- Understanding of how tensor frameworks (Py Torch, JAX, Tensor Flow) are used in distributed training/inference
- Technical BS/MS degree
Preferred Qualifications
- Familiar with model development lifecycle from trained model to large scale production inference deployment
- Proven track record in ML infrastructure at scale
Equal Opportunity
Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant.
Total Views
0
Apply Clicks
0
Mock Applicants
0
Scraps
0
Similar Jobs

Technical Architect - MuleSoft
Salesforce · Flexible / Remote

Software Dev Engineer III - AMZ9674087
Amazon · Denver, CO, USA

Senior Manufacturing Engineer, Amazon Industrial Robotics
Amazon · Seattle, WA, USA

Senior Software Engineer, Chrome
Google · placeMountain View, CA, USA

Principal Enterprise Architect
Mastercard · Pune, India
About Apple

Apple
PublicA technology company that designs, manufactures, and markets consumer electronics, personal computers, and software.
10,001+
Employees
Cupertino
Headquarters
$3.5T
Valuation
Reviews
4.0
10 reviews
Work Life Balance
4.0
Compensation
4.2
Culture
3.8
Career
3.5
Management
3.2
75%
Recommend to a Friend
Pros
Great coworkers and people
Excellent benefits and perks
Fast-paced and engaging work environment
Cons
High expectations and pressure
Management quality varies
Limited career progression opportunities
Salary Ranges
17,968 data points
L2
L3
L4
L5
L6
M3
M4
M5
M6
L2 · Industrial Designer L2
0 reports
$320,450
total / year
Base
$128,180
Stock
$160,225
Bonus
$32,045
$224,315
$416,585
Interview Experience
5 interviews
Difficulty
3.4
/ 5
Duration
28-42 weeks
Offer Rate
20%
Experience
Positive 20%
Neutral 40%
Negative 40%
Interview Process
1
Application Review
2
Recruiter Screen
3
Technical Phone Screen
4
Behavioral Interview
5
Onsite/Virtual Interviews
6
Team Matching
7
Offer
Common Questions
Coding/Algorithm
System Design
Behavioral/STAR
Technical Knowledge
Culture Fit
News & Buzz
Exclusive | First-ever Apple check signed by Steve Jobs sells for a whopping $2.4M at auction - New York Post
Source: New York Post
News
·
4w ago
Apple Stock Forecast: Trending Upgrade After Earnings Beat - TipRanks
Source: TipRanks
News
·
4w ago
Tim Cook Thinks He Has Identified Apple’s Next Big Growth Opportunity - inc.com
Source: inc.com
News
·
5w ago
Apple Gives Itself the Toughest Act to Follow - Bloomberg
Source: Bloomberg
News
·
5w ago