採用

Member of Technical Staff - Senior ML Engineer - MAI Super Intelligence Team
Switzerland, Zürich, Zürich
·
On-site
·
Full-time
·
3w ago
Required Skills
Machine Learning
Vision-Language Models
CUDA
HIP
Knowledge Distillation
Pruning
Quantization
Overview:
We are seeking a Senior Machine Learning Engineer to bridge the gap between advanced Vision-Language Model (VLM) research and high-performance production serving. Unlike standard data science and engineering roles, this position requires a dual competency: you must be capable of designing novel VLM architectures (including dataset curation and multilingual alignment) AND optimizing the inference stack (kernel optimization, distillation, and memory management) to run these models on specific hardware constraints (NVIDIA H100 and AMD MI300x).
The successful candidate will own the entire vertical slice: from reading the latest ar Xiv papers and improving training sets, to writing the C++/CUDA kernels that serve the final model in production.
Responsibilities1.
VLM Research & Architecture Design:
Continuously evaluate and implement the latest research trends in Vision-Language Models, specifically focusing on Referring Expression Comprehension (REC), Document Understanding (Pix2Struct), and Visual Question Answering (VQA).Design and build massive-scale training and evaluation datasets, ensuring multilingual compatibility and broad visual understanding for European market requirements.
Lead the model co-design process, creating architectures that are natively optimized for accelerator capabilities (compute-bound vs. memory-bound operations).
2.
Advanced Inference Optimization & Serving:
Architect high-throughput serving layers using SGLang and vLLM, optimizing for non-standard decoding strategies.
Implement scientific experiments to find the Pareto-optimal frontier between serving latency and generation quality.
Execute Knowledge Distillation (KD), unstructured pruning, and quantization techniques to fit large-scale VLM architectures onto single-node GPU setups (specifically H100 or MI300x) without compromising model quality.
3.
Systems Engineering & Kernel Development:
Write and optimize custom kernels (CUDA/HIP) to accelerate serving latency, identifying bottlenecks at the operator level.
Manage the full pre-training and post-training tech stack, ensuring seamless integration between model weights and inference engines.
Take ownership of landing the serving-efficient model in a production environment, ensuring reliability and scalability.
Qualifications Mandatory Requirements (Must Have)
- Education: Master’s or PhD in Computer Science, Artificial Intelligence, or High-Performance Computing.
- Experience: Minimum 4+ years of experience in Machine Learning, with a mandatory split focus between Model Architecture and Systems Optimization.
- VLM Expertise: Proven experience building and shipping Vision-Language Models (e.g., architectures similar to CLIP, Flamingo, Pix2Struct). Must have experience creating custom evaluation sets for tasks like Document Understanding.
- Serving Stack Proficiency: Expert-level knowledge of SGLang and vLLM for optimized serving.
- Hardware Specifics: Demonstrable experience optimizing models for both NVIDIA (H100) and AMD (MI300x) accelerators.
- Optimization Techniques: Hands-on experience with Knowledge Distillation and Pruning to reduce model latency for target serving sizes.
- Production Engineering: A track record of taking complex multi-modal models from research code to a deployed, user-facing production product.
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Total Views
0
Apply Clicks
0
Mock Applicants
0
Scraps
0
Similar Jobs

AIML - Machine Learning Research Engineer, Data and Machine Learning Innovation
Apple · Cupertino, CA

AI Research Scientist, Robotics
Meta · Redmond, WA

AI Safety Scientist, Deep Learning
NVIDIA · 2 Locations

Machine Learning Engineer - ML Data
Apple · Cupertino, CA

AI Specialist - Product and Applied Research
Meta · Menlo Park, CA
About Microsoft
Reviews
3.8
5 reviews
Work Life Balance
4.1
Compensation
4.3
Culture
3.4
Career
3.2
Management
3.0
65%
Recommend to a Friend
Pros
Excellent compensation and benefits package
Four-day workweek with improved work-life balance
Supportive managers and teams
Cons
High-pressure environment causing anxiety
Unprofessional interview processes
Limited creative work opportunities
Salary Ranges
5,571 data points
Mid/L4
Principal/L7
Senior/L5
Staff/L6
Director
Mid/L4 · Data and Applied Scientist
0 reports
$202,099
total / year
Base
$149,342
Stock
$32,252
Bonus
$20,505
$139,572
$301,212
Interview Experience
7 interviews
Difficulty
3.7
/ 5
Duration
14-28 weeks
Offer Rate
14%
Experience
Positive 14%
Neutral 29%
Negative 57%
Interview Process
1
Application Review
2
Recruiter Screen
3
Technical Phone Screen
4
Technical Interview
5
Onsite/Virtual Interviews
6
Final Round
7
Offer
Common Questions
Coding/Algorithm
System Design
Behavioral/STAR
Technical Knowledge
Past Experience
News & Buzz
Microsoft loses $400 billion in few hours, what's behind one of the worst stock market days for the compa - Times of India
Source: Times of India
News
·
5w ago
Microsoft Stock Tumbles 12.1% In Worst Day For Company In Years - HuffPost
Source: HuffPost
News
·
5w ago
Microsoft: The 'question' the company needs to answer - Yahoo Finance
Source: Yahoo Finance
News
·
5w ago
AI is a planet-sized bubble — and Microsoft's slump is a taste of the crash to come, tech guru Erik Gordon says - Business Insider
Source: Business Insider
News
·
5w ago