refresh

트렌딩 기업

트렌딩

채용

JobsStability AI

Generative AI Inference Engineer

Stability AI

Generative AI Inference Engineer

Stability AI

United States

·

On-site

·

Full-time

·

1w ago

Required Skills

Python

PyTorch

Machine Learning

Inference Optimization

Diffusion Models

CUDA

Kubernetes

Docker

Generative AI Inference Engineer:

About the role:

We are seeking passionate Machine Learning Engineers to join our Inference team, focusing on the creative applications of generative AI models. The ideal candidate will have substantial experience developing and running inference for multi-modal models. A deep understanding of diffusion model architectures and familiarity with workflow tools like ComfyUI are a big plus. You will be expected to leverage and push the boundaries of state-of-the-art inference optimization techniques for multi-modal generative models. This role offers the opportunity to work alongside top researchers and engineers, utilizing cutting-edge high-performance computing resources to make a significant impact in the rapidly evolving field of generative AI.

Responsibilities:

  • Lead efforts to drive the design, development of customer-facing multi modal ML inference systems.

  • Work with the Platform and Inference teams on building inference systems for the next generation of models, where you will work on areas such as optimization, model tuning and deployment.

  • Partner with leading cloud providers to deliver hosted Stability AI inference solutions.

  • Be a strategic thought partner for leaders across the organization on driving business impact through machine learning

  • Be part of the team to bring new Stability models and pipelines into existence

  • Prototype and productionize inference platform improvements and new features

Qualifications:

  • 7+ years working on productionizing machine learning systems, including inference pipeline development

  • Expert level knowledge on writing and running python services at scale

  • 5+ years working on python scientific stack, py Torch and at least one high-performance inference framework (e.g. Triton and TensorRT)

  • Deep understanding of Diffusion Architecture

  • Experience profiling and optimizing deep neural networks on Nvidia GPUs, using profiling tools such as NVIDIA Nsight

  • Experience with python-based image manipulation/encoding/decoding frameworks, such as OpenCV

  • Experience deploying to cloud orchestration systems such as Kubernetes and cloud providers such as AWS, GCP, and Azure

  • Experience with Docker

  • Ability to rapidly prototype solutions and iterate on them with tight product deadlines

  • Strong communication, collaboration, and documentation skills

  • Experience with the open-source ML ecosystem (Hugging Face, W&B, etc.)

Equal Employment Opportunity:

We are an equal opportunity employer and do not discriminate on the basis of race, religion, national origin, gender, sexual orientation, age, veteran status, disability or other legally protected statuses.

Total Views

0

Apply Clicks

0

Mock Applicants

0

Scraps

0

About Stability AI

Stability AI

Stability AI

Series A

Stability AI Ltd is a UK-based artificial intelligence company, best known for its text-to-image model Stable Diffusion.

51-200

Employees

London

Headquarters

$1B

Valuation

Reviews

4.2

10 reviews

Work Life Balance

3.5

Compensation

4.9

Culture

4.4

Career

4.4

Management

3.7

92%

Recommend to a Friend

Pros

Strong research and publication culture

Working on cutting-edge AI/ML technologies

Top-tier compensation with excellent equity

Cons

Extremely fast-paced with constant changes

Work-life balance can suffer during critical periods

Ambiguity in rapidly evolving field

Salary Ranges

0 data points

Junior/L3

Junior/L3 · Recruiter

0 reports

$117,600

total / year

Base

$117,600

Stock

-

Bonus

-

$99,960

$135,240

Interview Experience

41 interviews

Difficulty

4.2

/ 5

Duration

21-35 weeks

Offer Rate

27%

Experience

Positive 70%

Neutral 12%

Negative 18%

Interview Process

1

Recruiter Screen

2

ML Coding

3

ML System Design

4

Research Discussion

5

Team Interviews

Common Questions

ML fundamentals

Design an ML system

Research paper discussion

Statistical concepts