採用
NVIDIA are seeking dynamic Solution Architects with specialized expertise in training Large Language Models (LLMs), implementing RAG workflows, and agentic inference. You will leverage the full NVIDIA software & hardware ecosystem to design, optimize, and deliver production-grade generative AI solutions for enterprise customers. With competitive salaries and a generous benefits package, we are widely considered to be one of the world’s most desirable employers! We have some of the most forward-thinking and hardworking people in the world working for us and, due to outstanding growth, our best-in-class engineering teams are rapidly growing. If you're a creative and autonomous person with a real passion for technology, we want to hear from you.
What You Will Be Doing:
-
Architect end-to-end solutions focused on LLM pretraining, fine-tuning, high-performance inference, RAG workflows, and agentic inference orchestration using NVIDIA’s hardware and software platforms.
-
Collaborate with customers to understand their LLM-related business challenges and design tailored solutions aligned with the NVIDIA ecosystem.
-
Lead LLM training, distributed optimization, and performance tuning to achieve optimal throughput, latency, and memory efficiency.
-
Design and integrate RAG workflows and agentic inference pipelines into customer systems; provide technical guidance on best practices.
-
Collaborate with NVIDIA engineering teams to provide feedback and support pre-sales technical activities (workshops, demos).
What We Need to See:
-
Master’s / Ph.D. in Computer Science, Artificial Intelligence, or equivalent experience.
-
4+ years hands-on experience in AI, focusing on open-source LLM training, fine-tuning, and production inference optimization.
-
Deep understanding of mainstream LLM architectures and proficiency in LLM customization via Py Torch, Hugging Face Transformers.
-
Solid knowledge of GPU computing, cluster architecture, and distributed parallel training/inference for LLMs.
-
Competency in agentic inference design and using AI agents to solve business challenges.
-
Strong communication skills, able to articulate complex technical concepts to technical and non-technical stakeholders.
Ways to Stand Out from the Crowd:
-
Hands-on experience with NVIDIA’s generative AI ecosystem (TRT-LLM, Megatron-LM, NVIDIA Ne Mo).
-
Advanced skills in LLM optimization (quantization, KV Cache tuning, memory footprint reduction).
-
Experience with Docker, Kubernetes for containerized LLM and agent workflow deployment on-prem.
-
In-depth knowledge of multi-GPU parallelism and large-scale GPU cluster management.
#deeplearning
総閲覧数
0
応募クリック数
0
模擬応募者数
0
スクラップ
0
類似の求人

Software Engineer, Machine Learning Tooling
Waymo · Taipei, Taiwan; Hsinchu, Taiwan

AI/ML Scientist
Maersk · China, Shanghai, Shanghai, 200003

Software Engineer, Search, Ranking and Applied Machine Learning
Google ·

Applied Scientist 2
Microsoft · China, Beijing, Beijing; China, Jiangsu, Suzhou

Gen AI Engineer_Python
Infosys · Charlotte, NC
NVIDIAについて

NVIDIA
PublicA computing platform company operating at the intersection of graphics, HPC, and AI.
10,001+
従業員数
Santa Clara
本社所在地
$4.57T
企業価値
レビュー
4.1
10件のレビュー
ワークライフバランス
3.5
報酬
4.2
企業文化
4.3
キャリア
4.5
経営陣
4.0
75%
友人に勧める
良い点
Great culture and supportive environment
Smart colleagues and excellent people
Cutting-edge technology and learning opportunities
改善点
Team-dependent experience and outcomes
Work-life balance issues with long hours
Politics and influence over competence
給与レンジ
73件のデータ
L3
L4
L5
L3 · Data Scientist IC2
0件のレポート
$177,542
年収総額
基本給
-
ストック
-
ボーナス
-
$150,910
$204,174
面接体験
7件の面接
難易度
3.1
/ 5
体験
ポジティブ 0%
普通 86%
ネガティブ 14%
面接プロセス
1
Application Review
2
Recruiter Screen
3
Online Assessment
4
Technical Interview
5
System Design Interview
6
Team Review
よくある質問
Coding/Algorithm
System Design
Technical Knowledge
Behavioral/STAR
ニュース&話題
Negotiating NVIDIA's Offer
Base, stock, and sign-on negotiable. Recruiters invested in closing candidates. CEO reviews all 42K employee salaries monthly. Stock growth has made many employees millionaires.
News
·
NaNw ago
NVIDIA Company Reviews
WLB rated 3.9/5 (lowest category). 64% satisfied with WLB but 53% feel burnt out. Compensation rated 4.4-4.5/5. Experience highly team-dependent.
News
·
NaNw ago
NVIDIA Interview Discussions
Technical bar is high with 4-6 rounds. Process takes 4-8 weeks. Expect C++ questions, LeetCode medium, and system design. Difficulty rated 3.16/5.
News
·
NaNw ago
NVIDIA Culture Discussions
Team-dependent experience; sink-or-swim culture that rewards high performers but can be overwhelming. No politics, flat structure, but demanding workload with some teams requiring evening/weekend work.
News
·
NaNw ago