채용

Deep Learning Solution Architect

NVIDIA

China

On-site

Full-time

2w ago

NVIDIA are seeking dynamic Solution Architects with specialized expertise in training Large Language Models (LLMs), implementing RAG workflows, and agentic inference. You will leverage the full NVIDIA software & hardware ecosystem to design, optimize, and deliver production-grade generative AI solutions for enterprise customers. With competitive salaries and a generous benefits package, we are widely considered to be one of the world’s most desirable employers! We have some of the most forward-thinking and hardworking people in the world working for us and, due to outstanding growth, our best-in-class engineering teams are rapidly growing. If you're a creative and autonomous person with a real passion for technology, we want to hear from you.

What You Will Be Doing:

Architect end-to-end solutions focused on LLM pretraining, fine-tuning, high-performance inference, RAG workflows, and agentic inference orchestration using NVIDIA’s hardware and software platforms.
Collaborate with customers to understand their LLM-related business challenges and design tailored solutions aligned with the NVIDIA ecosystem.
Lead LLM training, distributed optimization, and performance tuning to achieve optimal throughput, latency, and memory efficiency.
Design and integrate RAG workflows and agentic inference pipelines into customer systems; provide technical guidance on best practices.
Collaborate with NVIDIA engineering teams to provide feedback and support pre-sales technical activities (workshops, demos).

What We Need to See:

Master’s / Ph.D. in Computer Science, Artificial Intelligence, or equivalent experience.
4+ years hands-on experience in AI, focusing on open-source LLM training, fine-tuning, and production inference optimization.
Deep understanding of mainstream LLM architectures and proficiency in LLM customization via Py Torch, Hugging Face Transformers.
Solid knowledge of GPU computing, cluster architecture, and distributed parallel training/inference for LLMs.
Competency in agentic inference design and using AI agents to solve business challenges.
Strong communication skills, able to articulate complex technical concepts to technical and non-technical stakeholders.

Ways to Stand Out from the Crowd:

Hands-on experience with NVIDIA’s generative AI ecosystem (TRT-LLM, Megatron-LM, NVIDIA Ne Mo).
Advanced skills in LLM optimization (quantization, KV Cache tuning, memory footprint reduction).
Experience with Docker, Kubernetes for containerized LLM and agent workflow deployment on-prem.
In-depth knowledge of multi-GPU parallelism and large-scale GPU cluster management.

#deeplearning

총 조회수

총 지원 클릭 수

모의 지원자 수

비슷한 채용공고

Software Engineer, Machine Learning Tooling

Waymo · Taipei, Taiwan; Hsinchu, Taiwan

AI/ML Scientist

Maersk · China, Shanghai, Shanghai, 200003

Software Engineer, Search, Ranking and Applied Machine Learning

Google ·

Applied Scientist 2

Microsoft · China, Beijing, Beijing; China, Jiangsu, Suzhou

Gen AI Engineer_Python

Infosys · Charlotte, NC

NVIDIA 소개

NVIDIA

Public

A computing platform company operating at the intersection of graphics, HPC, and AI.

10,001+

직원 수

Santa Clara

본사 위치

$4.57T

기업 가치

리뷰

4.1

10개 리뷰

워라밸

3.5

보상

4.2

문화

4.3

커리어

4.5

경영진

4.0

75%

친구에게 추천

장점

Great culture and supportive environment

Smart colleagues and excellent people

Cutting-edge technology and learning opportunities

단점

Team-dependent experience and outcomes

Work-life balance issues with long hours

Politics and influence over competence

연봉 정보

73개 데이터

L3 · Data Scientist IC2

0개 리포트

$177,542

총 연봉

기본급

주식

보너스

$150,910

$204,174

면접 경험

7개 면접

난이도

3.1

/ 5

경험

긍정 0%

보통 86%

부정 14%

면접 과정

Application Review

Recruiter Screen

Online Assessment

Technical Interview

System Design Interview

Team Review

자주 나오는 질문

Coding/Algorithm

System Design

Technical Knowledge

Behavioral/STAR

뉴스 & 버즈

Negotiating NVIDIA's Offer

Base, stock, and sign-on negotiable. Recruiters invested in closing candidates. CEO reviews all 42K employee salaries monthly. Stock growth has made many employees millionaires.

News

NaNw ago

NVIDIA Company Reviews

WLB rated 3.9/5 (lowest category). 64% satisfied with WLB but 53% feel burnt out. Compensation rated 4.4-4.5/5. Experience highly team-dependent.

News

NaNw ago

NVIDIA Interview Discussions

Technical bar is high with 4-6 rounds. Process takes 4-8 weeks. Expect C++ questions, LeetCode medium, and system design. Difficulty rated 3.16/5.

News

NaNw ago

NVIDIA Culture Discussions

Team-dependent experience; sink-or-swim culture that rewards high performers but can be overwhelming. No politics, flat structure, but demanding workload with some teams requiring evening/weekend work.

News

NaNw ago