
Pioneering accelerated computing and AI
Senior Software Engineer, Data Center Workloads – Infrastructure at NVIDIA
About the role
At NVIDIA, we are pioneers in innovation, transforming computer graphics, PC gaming, and accelerated computing for over 25 years. Our team is driven by powerful technology and outstanding people who expand the limits of what’s achievable. Now, we are unlocking the potential of AI to usher in the next era of computing.
As part of our engineering organization, you will play a key hands-on role in developing and executing software-driven characterization workflows on NVIDIA rack-scale systems. This role is focused on running AI workloads across the full stack to analyze, characterize, and optimize power, performance, and drive behavior at system level. This is an opportunity to work at the intersection of software, infrastructure, silicon, and large-scale AI platforms, with direct impact on next-generation NVIDIA systems.
What you’ll be doing:
-
Develop and run software tools, automation, and workloads to characterize power, performance, and drive behavior across NVIDIA rack-scale systems.
-
Execute AI and system-level workloads to stress and evaluate behavior across the stack, including GPUs, CPUs, networking, storage, firmware, drivers, and system software.
-
Build automated frameworks for data collection, telemetry, validation, correlation, and analysis of characterization results.
-
Investigate system behavior under different workloads and operating conditions to identify bottlenecks, anomalies, and optimization opportunities.
-
Work closely with hardware, firmware, driver, system software, performance, and validation teams to define characterization methodologies and debug cross-stack issues.
-
Support bring-up, validation, and readiness activities for new rack-scale platforms and AI infrastructure.
-
Create clear documentation, test flows, and repeatable processes to improve coverage, efficiency, and reproducibility.
What we need to see:
-
B.Sc. or M.Sc. in Computer Science, Electrical Engineering, or a related field.
-
5+ years of software engineering experience, preferably in system software, infrastructure, validation, or performance-focused environments.
-
Strong programming skills in Python and at least one system-level language such as C/C++.
-
Experience developing automation and test infrastructure for complex hardware/software systems.
-
Hands-on experience running, debugging, or optimizing AI, HPC, or large-scale system workloads.
-
Good understanding of system-level architecture, including interactions across hardware, firmware, drivers, operating systems, and application layers
-
Experience working in Linux environments and with scripting, telemetry, logging, and data analysis tools.
-
Strong debugging and problem-solving skills, with the ability to work across multiple engineering disciplines.
-
Good communication skills and the ability to drive technical work in a fast-paced, cross-functional environment.
Ways to stand out from the crowd:
-
Experience with NVIDIA platforms, GPU systems, or rack-scale AI infrastructure.
-
Background in power, thermal, performance, or storage/drive characterization.
-
Experience with workload automation, cluster orchestration, or lab infrastructure.
-
Familiarity with AI benchmarks, training/inference workloads, and system stress methodologies.
-
Experience in post-silicon validation, production testing, or system bring-up.
Required skills
Infrastructure engineering
Automation
Performance analysis
Telemetry
Systems software
Data collection
Workload characterization
Debugging
Total Views
0
Total Apply Clicks
0
Total Mock Apply
0
Total Bookmarks
0
More open roles at NVIDIA

Senior Software Engineer - GPU Networking
NVIDIA · US, CA, Santa Clara

Senior System Software Test Engineer, Networking
NVIDIA · US, CA, Santa Clara

Manager, Networking Software Test
NVIDIA · US, CA, Santa Clara

Senior Firmware Engineer, Networking
NVIDIA · US, CA, Santa Clara

Senior Software K8S Engineer
NVIDIA · 5 Locations
Similar jobs

Associate Director, DT Portfolio Architect - Production (Remote)
Collins Aerospace (RTX) · US-CT-REMOTE

Enterprise Classified Cloud Sr. Manager
Collins Aerospace (RTX) · US-TX-RICHARDSON-C17 ~ 1717 Cityline Dr ~ CITYLINE C17

Senior Principal Engineer, Infrastructure Platform Architect (Onsite)
Collins Aerospace (RTX) · US-TX-PLANO-465 ~ 465 Independence Pkwy ~ INDEPENDENCE

CDS Platform Services
RTX (Raytheon) · US-CO-AURORA-S78 ~ 16201 E Centretech Pkwy ~ BLDG S78

Facilities Engineer (Onsite)
RTX (Raytheon) · US-MD-ANNAPOLIS-906 ~ 2551 Riva Rd ~ BLDG 906
About NVIDIA

NVIDIA
PublicA computing platform company operating at the intersection of graphics, HPC, and AI.
10,001+
Employees
Santa Clara
Headquarters
$4.57T
Valuation
Reviews
10 reviews
4.4
10 reviews
Work-life balance
2.8
Compensation
4.5
Culture
4.2
Career
4.3
Management
3.8
78%
Recommend to a friend
Pros
Cutting-edge technology and innovation
Excellent compensation and benefits
Great team culture and collaboration
Cons
High pressure and expectations
Poor work-life balance and long hours
Fast-paced environment leading to burnout
Salary Ranges
79 data points
Junior/L3
Mid/L4
Senior/L5
Junior/L3 · Analyst
7 reports
$170,275
total per year
Base
$130,981
Stock
-
Bonus
-
$155,480
$234,166
Interview experience
5 interviews
Difficulty
3.0
/ 5
Interview process
1
Application Review
2
Recruiter Screen
3
Technical Phone Screen
4
Onsite/Virtual Interviews
5
Team Matching
6
Offer
Common questions
Coding/Algorithm
System Design
Behavioral/STAR
Technical Knowledge
Past Experience
Latest updates
Negotiating NVIDIA's Offer
Base, stock, and sign-on negotiable. Recruiters invested in closing candidates. CEO reviews all 42K employee salaries monthly. Stock growth has made many employees millionaires.
reddit/blind
·
NVIDIA Company Reviews
WLB rated 3.9/5 (lowest category). 64% satisfied with WLB but 53% feel burnt out. Compensation rated 4.4-4.5/5. Experience highly team-dependent.
reddit/blind
·
NVIDIA Interview Discussions
Technical bar is high with 4-6 rounds. Process takes 4-8 weeks. Expect C++ questions, LeetCode medium, and system design. Difficulty rated 3.16/5.
reddit/blind
·
NVIDIA Culture Discussions
Team-dependent experience; sink-or-swim culture that rewards high performers but can be overwhelming. No politics, flat structure, but demanding workload with some teams requiring evening/weekend work.
reddit/blind
·