refresh

트렌딩 기업

트렌딩 기업

채용

채용NVIDIA

Deep Learning Solution Architect

NVIDIA

Deep Learning Solution Architect

NVIDIA

China

·

On-site

·

Full-time

·

2w ago

NVIDIA are seeking dynamic Solution Architects with specialized expertise in training Large Language Models (LLMs), implementing RAG workflows, and agentic inference. You will leverage the full NVIDIA software & hardware ecosystem to design, optimize, and deliver production-grade generative AI solutions for enterprise customers. With competitive salaries and a generous benefits package, we are widely considered to be one of the world’s most desirable employers! We have some of the most forward-thinking and hardworking people in the world working for us and, due to outstanding growth, our best-in-class engineering teams are rapidly growing. If you're a creative and autonomous person with a real passion for technology, we want to hear from you.

What You Will Be Doing:

  • Architect end-to-end solutions focused on LLM pretraining, fine-tuning, high-performance inference, RAG workflows, and agentic inference orchestration using NVIDIA’s hardware and software platforms.

  • Collaborate with customers to understand their LLM-related business challenges and design tailored solutions aligned with the NVIDIA ecosystem.

  • Lead LLM training, distributed optimization, and performance tuning to achieve optimal throughput, latency, and memory efficiency.

  • Design and integrate RAG workflows and agentic inference pipelines into customer systems; provide technical guidance on best practices.

  • Collaborate with NVIDIA engineering teams to provide feedback and support pre-sales technical activities (workshops, demos).

What We Need to See:

  • Master’s / Ph.D. in Computer Science, Artificial Intelligence, or equivalent experience.

  • 4+ years hands-on experience in AI, focusing on open-source LLM training, fine-tuning, and production inference optimization.

  • Deep understanding of mainstream LLM architectures and proficiency in LLM customization via Py Torch, Hugging Face Transformers.

  • Solid knowledge of GPU computing, cluster architecture, and distributed parallel training/inference for LLMs.

  • Competency in agentic inference design and using AI agents to solve business challenges.

  • Strong communication skills, able to articulate complex technical concepts to technical and non-technical stakeholders.

Ways to Stand Out from the Crowd:

  • Hands-on experience with NVIDIA’s generative AI ecosystem (TRT-LLM, Megatron-LM, NVIDIA Ne Mo).

  • Advanced skills in LLM optimization (quantization, KV Cache tuning, memory footprint reduction).

  • Experience with Docker, Kubernetes for containerized LLM and agent workflow deployment on-prem.

  • In-depth knowledge of multi-GPU parallelism and large-scale GPU cluster management.

#deeplearning

총 조회수

0

총 지원 클릭 수

0

모의 지원자 수

0

스크랩

0

NVIDIA 소개

NVIDIA

NVIDIA

Public

A computing platform company operating at the intersection of graphics, HPC, and AI.

10,001+

직원 수

Santa Clara

본사 위치

$4.57T

기업 가치

리뷰

4.1

10개 리뷰

워라밸

3.5

보상

4.2

문화

4.3

커리어

4.5

경영진

4.0

75%

친구에게 추천

장점

Great culture and supportive environment

Smart colleagues and excellent people

Cutting-edge technology and learning opportunities

단점

Team-dependent experience and outcomes

Work-life balance issues with long hours

Politics and influence over competence

연봉 정보

73개 데이터

L3

L4

L5

L3 · Data Scientist IC2

0개 리포트

$177,542

총 연봉

기본급

-

주식

-

보너스

-

$150,910

$204,174

면접 경험

7개 면접

난이도

3.1

/ 5

경험

긍정 0%

보통 86%

부정 14%

면접 과정

1

Application Review

2

Recruiter Screen

3

Online Assessment

4

Technical Interview

5

System Design Interview

6

Team Review

자주 나오는 질문

Coding/Algorithm

System Design

Technical Knowledge

Behavioral/STAR