채용
Benefits & Perks
•Equity
•Equity
Required Skills
Machine Learning
Deep Learning
CUDA
GPU Programming
Python
Rust
C++
We are looking for an AI Inference engineer to join our growing team. Our current stack is Python, Rust, C++, Py Torch, Triton, CUDA, Kubernetes. You will have the opportunity to work on large-scale deployment of machine learning models for real-time inference.
Responsibilities:
-
Develop APIs for AI inference that will be used by both internal and external customers
-
Benchmark and address bottlenecks throughout our inference stack
-
Improve the reliability and observability of our systems and respond to system outages
-
Explore novel research and implement LLM inference optimizations
Qualifications:
-
Experience with ML systems and deep learning frameworks (e.g. Py Torch, Tensor Flow, ONNX)
-
Familiarity with common LLM architectures and inference optimization techniques (e.g. continuous batching, quantization, etc.)
-
Understanding of GPU architectures or experience with GPU kernel programming using CUDA
Final offer amounts are determined by multiple factors, including, experience and expertise.
Equity: In addition to the base salary, equity may be part of the total compensation package.
Total Views
0
Apply Clicks
0
Mock Applicants
0
Scraps
0
Similar Jobs

Software Engineer, DevOps, Research Platform
Mistral AI · Paris

Autonomy Systems Safety Engineer
Kodiak Robotics · Remote, United States

Senior/Staff Software Engineer, Labeling Platform
Nuro · Mountain View, California (HQ)

Mechanical Engineer, Autonomous Vehicle
Nuro · Mountain View, California (HQ)

Software Engineer, Onboard Infrastructure
Nuro · Mountain View, California (HQ)
About Perplexity AI

Perplexity AI
Series BPerplexity AI, Inc., or simply Perplexity, is an American privately held software company offering a web search engine that processes user queries and synthesizes responses.
51-200
Employees
San Francisco
Headquarters
$1B
Valuation
Reviews
4.0
1 reviews
Work Life Balance
3.0
Compensation
3.0
Culture
3.0
Career
3.5
Management
3.0
70%
Recommend to a Friend
Pros
Helpful tool for research and analysis
Useful for job application preparation
Effective for complex marketing challenges
Cons
Limited feedback provided
No specific criticisms mentioned
Insufficient detail on potential drawbacks
Salary Ranges
28 data points
Senior/L5
Senior/L5 · Data Scientist
0 reports
$791,025
total / year
Base
-
Stock
-
Bonus
-
$672,171
$909,879
Interview Experience
1 interviews
Difficulty
4.0
/ 5
Duration
14-28 weeks
Experience
Positive 0%
Neutral 0%
Negative 100%
Interview Process
1
Application Review
2
HR Screen
3
Take-home Marketing Challenge
4
Hiring Manager Interview
5
Panel Interview
6
Offer
Common Questions
Digital Marketing Strategy
Campaign Performance Analysis
Behavioral/STAR
Technical Marketing Knowledge
Case Study
News & Buzz
After lawsuit, one of the biggest Amazon customers, Perplexity, signs $750 million deal with Microsoft, says 'AWS remains' - MSN
Source: MSN
News
·
5w ago
Perplexity signs $750 million AI cloud deal with Microsoft - The American Bazaar
Source: The American Bazaar
News
·
5w ago
Perplexity strikes Microsoft AI cloud deal amid Amazon legal fight - Cryptopolitan
Source: Cryptopolitan
News
·
5w ago
Perplexity Inks Microsoft AI Cloud Deal Amid Dispute With Amazon - Bloomberg
Source: Bloomberg
News
·
5w ago