Jobs
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.
Join NVIDIA, a groundbreaking leader in AI computing and visual technologies, at the forefront of innovation. As an AI in Industry Solution Architecture Intern, you'll be integral to our mission of redefining industries through AI and HPC. Our Solution Architect team builds innovative AI computing platforms, analyzes applications, and delivers outstanding value to our customers. This role offers a remarkable opportunity to harness NVIDIA's newest technologies to optimize large models, develop sophisticated AI workflows, and empower our clients with advanced AI solutions.
What you will be doing:
-
Provide technical support to internal developers and external customers, facilitating the adoption and implementation of NVIDIA technologies and products.
-
Apply your experience and knowledge in areas of accelerated computing and machine learning. Design and implement optimization of various AI models or business scenarios.
-
Setup model training or inference, identify the bottlenecks and verify the ways to improve model efficiency. Conduct surveys and experiments on learning models and to consolidate guidelines and relevant papers.
What we need to see:
-
Pursuing a Bachelor or Master in Computer Science, AI, or a related field; Or candidates pursuing a PhD in ML Infra or data systems for ML.
-
Can work under Linux, with strong programming skills in Python or C++.
-
Familiarity with AI models, including language models, video models, multi-modality models, or domain-specific models. Proficiency in at least one inference framework(e.g. TensorRT/TRT-LLM, ONNX Runtime, Py Torch, vLLM, SGLang, Dynamo).
-
Excellent problem-solving skills and the ability to troubleshoot complex technical issues.
-
Demonstrated ability to collaborate effectively across diverse, global teams, adapting communication styles while maintaining clear, constructive professional interactions.
Ways to stand out from the crowd:
-
Optimizing critical operators such as GEMM and attention mechanisms tailored to different GPU architectures to improve inference performance.
-
Conducting in-depth research on Speech LLM training and implementing audio classification.
-
Aligning performance with benchmark data to evaluate the accuracy of current modeling, including KV-cache and multi-modality modeling.
-
Familiarity with mainstream inference engines (e.g., vLLM, SGLang), or familiarity with disaggregated LLM Inference.
-
Experience on SOTA RL for reasoning model methods and try to consolidate best practices and relevant papers.
Total Views
0
Apply Clicks
0
Mock Applicants
0
Scraps
0
Similar Jobs

Field Solutions Architect, Generative AI, Google Cloud (Japanese, English)
Google · placeTokyo, Japan

Field Solutions Architect, Applied AI
Google ·

Solution Architect - Total Rewards
Netflix · USA - Remote

Automation Solutions Architect, AI Accelerator Team
Meta · Seattle, WA

Partner Solutions Architect
OpenAI · San Francisco
About NVIDIA

NVIDIA
PublicA computing platform company operating at the intersection of graphics, HPC, and AI.
10,001+
Employees
Santa Clara
Headquarters
$4.57T
Valuation
Reviews
4.1
10 reviews
Work Life Balance
3.5
Compensation
4.2
Culture
4.3
Career
4.5
Management
4.0
75%
Recommend to a Friend
Pros
Great culture and supportive environment
Smart colleagues and excellent people
Cutting-edge technology and learning opportunities
Cons
Team-dependent experience and outcomes
Work-life balance issues with long hours
Politics and influence over competence
Salary Ranges
47 data points
Junior/L3
Mid/L4
Junior/L3 · Analyst
7 reports
$170,275
total / year
Base
$130,981
Stock
-
Bonus
-
$155,480
$234,166
Interview Experience
7 interviews
Difficulty
3.1
/ 5
Experience
Positive 0%
Neutral 86%
Negative 14%
Interview Process
1
Application Review
2
Recruiter Screen
3
Online Assessment
4
Technical Interview
5
System Design Interview
6
Team Review
Common Questions
Coding/Algorithm
System Design
Technical Knowledge
Behavioral/STAR
News & Buzz
NVIDIA Culture Discussions
Team-dependent experience; sink-or-swim culture that rewards high performers but can be overwhelming. No politics, flat structure, but demanding workload with some teams requiring evening/weekend work.
News
·
NaNw ago
Negotiating NVIDIA's Offer
Base, stock, and sign-on negotiable. Recruiters invested in closing candidates. CEO reviews all 42K employee salaries monthly. Stock growth has made many employees millionaires.
News
·
NaNw ago
NVIDIA Company Reviews
WLB rated 3.9/5 (lowest category). 64% satisfied with WLB but 53% feel burnt out. Compensation rated 4.4-4.5/5. Experience highly team-dependent.
News
·
NaNw ago
NVIDIA Interview Discussions
Technical bar is high with 4-6 rounds. Process takes 4-8 weeks. Expect C++ questions, LeetCode medium, and system design. Difficulty rated 3.16/5.
News
·
NaNw ago