招聘
Required Skills
Python
PyTorch
Machine Learning
Inference Optimization
Diffusion Models
CUDA
Kubernetes
Docker
Generative AI Inference Engineer:
About the role:
We are seeking passionate Machine Learning Engineers to join our Inference team, focusing on the creative applications of generative AI models. The ideal candidate will have substantial experience developing and running inference for multi-modal models. A deep understanding of diffusion model architectures and familiarity with workflow tools like ComfyUI are a big plus. You will be expected to leverage and push the boundaries of state-of-the-art inference optimization techniques for multi-modal generative models. This role offers the opportunity to work alongside top researchers and engineers, utilizing cutting-edge high-performance computing resources to make a significant impact in the rapidly evolving field of generative AI.
Responsibilities:
-
Lead efforts to drive the design, development of customer-facing multi modal ML inference systems.
-
Work with the Platform and Inference teams on building inference systems for the next generation of models, where you will work on areas such as optimization, model tuning and deployment.
-
Partner with leading cloud providers to deliver hosted Stability AI inference solutions.
-
Be a strategic thought partner for leaders across the organization on driving business impact through machine learning
-
Be part of the team to bring new Stability models and pipelines into existence
-
Prototype and productionize inference platform improvements and new features
Qualifications:
-
7+ years working on productionizing machine learning systems, including inference pipeline development
-
Expert level knowledge on writing and running python services at scale
-
5+ years working on python scientific stack, py Torch and at least one high-performance inference framework (e.g. Triton and TensorRT)
-
Deep understanding of Diffusion Architecture
-
Experience profiling and optimizing deep neural networks on Nvidia GPUs, using profiling tools such as NVIDIA Nsight
-
Experience with python-based image manipulation/encoding/decoding frameworks, such as OpenCV
-
Experience deploying to cloud orchestration systems such as Kubernetes and cloud providers such as AWS, GCP, and Azure
-
Experience with Docker
-
Ability to rapidly prototype solutions and iterate on them with tight product deadlines
-
Strong communication, collaboration, and documentation skills
-
Experience with the open-source ML ecosystem (Hugging Face, W&B, etc.)
Equal Employment Opportunity:
We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or other legally protected statuses.
Total Views
0
Apply Clicks
0
Mock Applicants
0
Scraps
0
Similar Jobs

Staff Software Engineer, Realtime Data
Tecton · Remote (US)

Software Engineer, AI Platform - New Grad
Nuro · Mountain View, California (HQ)

Senior Embedded Software Engineer (onsite)
Johnson Controls · 3 Locations

Applied Scientist / Research Engineer (Internship)
Mistral AI · Paris

Software Engineer, Onboard Systems
Nuro · Mountain View, California (HQ)
About Stability AI

Stability AI
Series AStability AI Ltd is a UK-based artificial intelligence company, best known for its text-to-image model Stable Diffusion.
51-200
Employees
London
Headquarters
$1B
Valuation
Reviews
4.2
10 reviews
Work Life Balance
3.5
Compensation
4.9
Culture
4.4
Career
4.4
Management
3.7
92%
Recommend to a Friend
Pros
Strong research and publication culture
Working on cutting-edge AI/ML technologies
Top-tier compensation with excellent equity
Cons
Extremely fast-paced with constant changes
Work-life balance can suffer during critical periods
Ambiguity in rapidly evolving field
Salary Ranges
0 data points
Junior/L3
Junior/L3 · Recruiter
0 reports
$117,600
total / year
Base
$117,600
Stock
-
Bonus
-
$99,960
$135,240
Interview Experience
41 interviews
Difficulty
4.2
/ 5
Duration
21-35 weeks
Offer Rate
27%
Experience
Positive 70%
Neutral 12%
Negative 18%
Interview Process
1
Recruiter Screen
2
ML Coding
3
ML System Design
4
Research Discussion
5
Team Interviews
Common Questions
ML fundamentals
Design an ML system
Research paper discussion
Statistical concepts
News & Buzz
AOC warns we may be in a 'massive' AI bubble with '2008-style threats to economic stability'
·
15w ago
·
7,474
·
819
AOC warns we may be in a 'massive' AI bubble with '2008-style threats to economic stability'
·
15w ago
·
613
·
130
Warner Music Group and Stability AI Join Forces To Build The Next Generation Of Responsible AI Tools For Music Creation - Stability AI
Source: Stability AI
News
·
15w ago
AOC: The exposure to this (AI) industry and investment, I fear, has reached broad levels of the American economy. We could be facing 2008-style threats to economic stability..Should this bubble pop, we should not be entertaining a bailout of these corporations.
·
15w ago
·
7,123
·
135