
Pioneering accelerated computing and AI
Solution Architecture Intern, AI in Industry - 2026
必备技能
Python
Linux
PyTorch
Machine Learning
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.
Join NVIDIA, a groundbreaking leader in AI computing and visual technologies, at the forefront of innovation. As an AI in Industry Solution Architecture Intern, you'll be integral to our mission of redefining industries through AI and HPC. Our Solution Architect team builds innovative AI computing platforms, analyzes applications, and delivers outstanding value to our customers. This role offers a remarkable opportunity to harness NVIDIA's newest technologies to optimize large models, develop sophisticated AI workflows, and empower our clients with advanced AI solutions.
What you will be doing:
-
Provide technical support to internal developers and external customers, facilitating the adoption and implementation of NVIDIA technologies and products.
-
Apply your experience and knowledge in areas of accelerated computing and machine learning. Design and implement optimization of various AI models or business scenarios.
-
Setup model training or inference, identify the bottlenecks and verify the ways to improve model efficiency. Conduct surveys and experiments on learning models and to consolidate guidelines and relevant papers.
What we need to see:
-
Pursuing a Bachelor or Master in Computer Science, AI, or a related field; Or candidates pursuing a PhD in ML Infra or data systems for ML.
-
Can work under Linux, with strong programming skills in Python or C++.
-
Familiarity with AI models, including language models, video models, multi-modality models, or domain-specific models. Proficiency in at least one inference framework(e.g. TensorRT/TRT-LLM, ONNX Runtime, Py Torch, vLLM, SGLang, Dynamo).
-
Excellent problem-solving skills and the ability to troubleshoot complex technical issues.
-
Demonstrated ability to collaborate effectively across diverse, global teams, adapting communication styles while maintaining clear, constructive professional interactions.
Ways to stand out from the crowd:
-
Optimizing critical operators such as GEMM and attention mechanisms tailored to different GPU architectures to improve inference performance.
-
Conducting in-depth research on Speech LLM training and implementing audio classification.
-
Aligning performance with benchmark data to evaluate the accuracy of current modeling, including KV-cache and multi-modality modeling.
-
Familiarity with mainstream inference engines (e.g., vLLM, SGLang), or familiarity with disaggregated LLM Inference.
-
Experience on SOTA RL for reasoning model methods and try to consolidate best practices and relevant papers.
浏览量
0
申请点击
0
Mock Apply
0
收藏
0
相似职位

Solutions Architect - Fraud Systems
Regions Financial · Hoover, Alabama, United States of America

Senior Solution Architect - Personalization Strategist
Contentful · Chicago, Illinois, United States

Part Time (30 Hours) Associate Banker, Northlake Blvd Branch, Palm Beach Gardens, FL Bilingual Spanish Required
JPMorgan Chase · Palm Beach Gardens, FL, United States, US

Cloud Computing Application Architect, Mid
Booz Allen Hamilton · Chantilly, VA

Solutions Architect Intern
Typeface · Palo Alto, CA
关于NVIDIA

NVIDIA
PublicA computing platform company operating at the intersection of graphics, HPC, and AI.
10,001+
员工数
Santa Clara
总部位置
$4.57T
企业估值
评价
10条评价
4.4
10条评价
工作生活平衡
2.8
薪酬
4.5
企业文化
4.2
职业发展
4.3
管理层
3.8
78%
推荐率
优点
Cutting-edge technology and innovation
Excellent compensation and benefits
Great team culture and collaboration
缺点
High pressure and expectations
Poor work-life balance and long hours
Fast-paced environment leading to burnout
薪资范围
79个数据点
Junior/L3
Mid/L4
Senior/L5
Junior/L3 · Analyst
7份报告
$170,275
年薪总额
基本工资
$130,981
股票
-
奖金
-
$155,480
$234,166
面试评价
5条评价
难度
3.0
/ 5
面试流程
1
Application Review
2
Recruiter Screen
3
Technical Phone Screen
4
Onsite/Virtual Interviews
5
Team Matching
6
Offer
常见问题
Coding/Algorithm
System Design
Behavioral/STAR
Technical Knowledge
Past Experience