
AI computing company
Senior AI Systems Performance Engineer
필수 스킬
Python
PyTorch
TensorFlow
Machine Learning
The era of pervasive AI has arrived. In this era, organizations will use generative AI to unlock hidden value in their data, accelerate processes, reduce costs, drive efficiency and innovation to fundamentally transform their businesses and operations at scale.
Samba Nova Suite™ is the first full-stack, generative AI platform, from chip to model, optimized for enterprise and government organizations. Powered by the intelligent SN40L chip, the Samba Nova Suite is a fully integrated platform, delivered on-premises or in the cloud, combined with state-of-the-art open-source models that can be easily and securely fine-tuned using customer data for greater accuracy. Once adapted with customer data, customers retain model ownership in perpetuity, so they can turn generative AI into one of their most valuable assets.
About the role
We are seeking a talented and driven ML performance engineer to optimize and scale state-of-the-art foundation models on Samba Nova's reconfigurable dataflow platform. You'll work hands-on with some of the most advanced models in the world — such as Deep Seek R1, GPT OSS, and other frontier architectures — to push the limits of throughput, latency, and efficiency. In this role, you'll bridge the gap between deep learning and systems performance, collaborating across compiler, runtime, and hardware layers to deliver world-record performance for large-scale AI inference.
Responsibilities
-
Bring up and optimize cutting-edge foundation models (e.g., Deep Seek, Llama, Qwen, and others) on the Samba Nova platform through the Samba Nova software stack.
-
Profile and enhance model performance across compiler, runtime, and hardware layers to achieve SOTA throughput and latency.
-
Collaborate with machine learning, compiler, runtime, and hardware teams to deliver co-designed, high-performance AI applications.
-
Integrate the latest advances in model architecture, quantization, scheduling, and memory optimization from both academia and industry.
-
Develop robust, scalable, and efficient end-to-end inference solutions aligned with customer needs.
-
Identify performance bottlenecks and propose dataflow or scheduling optimizations for both single-node and distributed systems.
Basic Qualifications
-
Bachelor's or higher degree in computer science, electrical engineering, or a related field (e.g., applied mathematics, physics, or statistics).
-
3+ years of experience in one or more of the following areas:
-
Deep learning model development and performance optimization
-
Compiler, runtime, or kernel-level optimization
-
Software–hardware co-design or systems performance tuning
-
Proficiency in Python or C++, with strong foundations in algorithms, data structures, and numerical computing.
-
Experience with at least one major ML framework — Py Torch, Tensor Flow, or JAX.
-
Demonstrated ability to analyze and optimize performance in real-world ML pipelines.
Preferred Qualifications
-
Hands-on experience with LLM or multimodal model training and inference.
-
Background in large-scale distributed training, continuous batching, and high-throughput inference systems.
-
Familiarity with quantization, graph optimization, kernel fusion, and model partitioning.
-
Experience with frameworks such as Deep Speed, Megatron, vLLM, or TensorRT.
-
Strong GPU programming skills (CUDA, Triton, or OpenCL); experience with cuDNN, cuBLAS, or similar libraries is a plus.
-
Knowledge of memory hierarchy optimization, caching, and scheduling for large-scale model execution.
-
Publication record or open-source contributions in ML systems or performance optimization is a plus.
Submission Guidelines
Please note that in order to be considered an applicant for any position at Samba Nova Systems, you must submit an application form for each position for which you believe you are qualified.
EEO Policy
Samba Nova Systems is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard basis of age (40 and over), color, disability, gender identity, genetic information, marital status, military or veteran status, national origin/ancestry, race, religion, creed, sex (including pregnancy, childbirth, breastfeeding), sexual orientation, and any other applicable status protected by federal, state, or local laws.
Benefits Summary for US-Based, Full-Time Employment Positions
Samba Nova offers a competitive total rewards package, including the base salary, plus equity and benefits. We cover 95% premium coverage for employee medical insurance, and 77% premium coverage for dependents and offer a Health Savings Account (HSA) with employer contribution. We also offer Dental, Vision, Short/Long term Disability, Basic Life, Voluntary Life, and AD&D insurance plans in addition to Flexible Spending Account (FSA) options like Health Care, Limited Purpose, and Dependent Care. Our library of well-being benefits available to you and your dependents includes a full subscription to Headspace, Gympass+ membership with access to physical gyms, One Medical membership, counseling services with an Employee Assistance Program, and much more.
전체 조회수
0
전체 지원 클릭
0
전체 Mock Apply
0
전체 스크랩
0
비슷한 채용공고

Senior AI Operations (AI Ops) Engineer
Navan · Palo Alto, CA

Senior Staff WiFi Connectivity Engineer
Ford · Palo Alto, CA, United States, US

Principal Strategic Architect
Workato · Palo Alto, California

Sr. Engineer, Infinity infra - Identity (AD/OKTA) protection
Rubrik · Palo Alto, CA

Sr. Audio Visual Engineer
xAI · Palo Alto, CA
SambaNova 소개

SambaNova
PublicIntel Capital Corporation started off as the investment arm of Intel Corporation in 1991 and in January 2025, it spun off as a standalone investment fund.
201-500
직원 수
Santa Clara
본사 위치
리뷰
10개 리뷰
4.3
10개 리뷰
워라밸
3.8
보상
4.2
문화
4.5
커리어
3.9
경영진
3.4
78%
지인 추천률
장점
Supportive team and colleagues
Good benefits and competitive compensation
Flexible work arrangements and remote options
단점
Heavy workload and overtime expectations
Fast-paced and high-pressure environment
Management direction and communication issues
연봉 정보
35개 데이터
Staff/L6
Staff/L6 · Principal Technical Writer
1개 리포트
$172,500
총 연봉
기본급
$150,000
주식
-
보너스
-
$172,500
$172,500
면접 후기
후기 1개
난이도
4.0
/ 5
소요 기간
14-28주
면접 과정
1
Application Review
2
Recruiter Screen
3
Technical Phone Screen
4
Onsite/Virtual Interviews
5
Team Matching
6
Offer
자주 나오는 질문
Coding/Algorithm
System Design
Behavioral/STAR
Technical Knowledge
Culture Fit
최근 소식
Intel investing at least $100M into SambaNova should help AI push: Wedbush - MSN
MSN
News
·
1w ago
SambaNova and TEPCO Systems Partner to Deliver Energy-Efficient AI Infrastructure to Japan’s Power Sector - sambanova.ai
sambanova.ai
News
·
2w ago
SambaNova and Intel target agentic inference - Jon Peddie Research
Jon Peddie Research
News
·
2w ago
Intel and SambaNova introduce a hardware system combining GPUs, RDUs, and CPUs - MSN
MSN
News
·
2w ago