招聘
必备技能
PyTorch
CUDA
GPU programming
We are looking for an AI Inference engineer to join our growing team. Our current stack is Python, Rust, C++, Py Torch, Triton, CUDA, Kubernetes. You will have the opportunity to work on large-scale deployment of machine learning models for real-time inference.
Responsibilities:
-
Develop APIs for AI inference that will be used by both internal and external customers
-
Benchmark and address bottlenecks throughout our inference stack
-
Improve the reliability and observability of our systems and respond to system outages
-
Explore novel research and implement LLM inference optimizations
Qualifications:
-
Experience with ML systems and deep learning frameworks (e.g. Py Torch, Tensor Flow, ONNX)
-
Familiarity with common LLM architectures and inference optimization techniques (e.g. continuous batching, quantization, etc.)
-
Understanding of GPU architectures or experience with GPU kernel programming using CUDA
总浏览量
1
申请点击数
0
模拟申请者数
0
收藏
0
相似职位

Partner Deployed Engineer - US
Cognition AI · San Francisco Bay Area

Test EA ->AC Webhook failure
LinkedIn Learning · San Francisco, CA

Thermal-Mechanical Manufacturing Engineer
OpenAI · San Francisco

Test EA Sensitive Question Check
LinkedIn Learning · San Francisco, CA

Design Engineer
Anysphere (Cursor) · SF / NY
关于Perplexity AI

Perplexity AI
Series BPerplexity AI, Inc., or simply Perplexity, is an American privately held software company offering a web search engine that processes user queries and synthesizes responses.
51-200
员工数
San Francisco
总部位置
$1B
企业估值
评价
3.8
10条评价
工作生活平衡
3.2
薪酬
2.5
企业文化
4.0
职业发展
2.5
管理层
2.8
65%
推荐给朋友
优点
Supportive team and management
Good work-life balance and flexibility
Cutting-edge technology and interesting projects
缺点
Low compensation compared to industry standards
Poor management and lack of leadership direction
Fast-paced and overwhelming workload
薪资范围
26个数据点
Senior/L5
Intern
Senior/L5 · MEMBER OF TECHNICAL STAFF AI RESEARCH ENGINEER
1份报告
$337,217
年薪总额
基本工资
$259,397
股票
-
奖金
-
$337,217
$337,217
面试经验
1次面试
难度
4.0
/ 5
时长
14-28周
体验
正面 0%
中性 0%
负面 100%
面试流程
1
Application Review
2
HR Screen
3
Take-home Marketing Challenge
4
Hiring Manager Interview
5
Panel Interview
6
Offer
常见问题
Digital Marketing Strategy
Campaign Performance Analysis
Behavioral/STAR
Technical Marketing Knowledge
Case Study
新闻动态
Perplexity launches Personal Computer that brings AI agents Directly on your Mac - The Times of India
The Times of India
News
·
3d ago
"Perplexity" Unveils a Broader Vision for the Role of Artificial Intelligence in Personal Computing - وكالة صدى نيوز
وكالة صدى نيوز
News
·
3d ago
Perplexity AI Cheat Sheet: How an ‘Answer Engine’ Is Challenging Gemini, ChatGPT - eWeek
eWeek
News
·
4d ago
Perplexity priced me out of its OpenClaw clone - PCWorld
PCWorld
News
·
4d ago