
Pioneering accelerated computing and AI
Senior Manager, DGX Cloud Technical Program Management at NVIDIA
About the role
For over 25 years, NVIDIA has led the world in visual computing and accelerated computing. Today, we’re crafting the future of AI by driving breakthroughs in generative models, autonomous systems, and large-scale research. The DGX Cloud organization builds and operates the AI infrastructure that makes this innovation possible. We are seeking a Technical Program Management Manager to lead core infrastructure programs across DGX Cloud, including network, storage, trust services, security, break/fix operations, and telemetry. This role manages a team of TPMs responsible for bringing structure, operational rigor, and cross-functional alignment to infrastructure programs that keep DGX Cloud resilient, scalable, and customer-ready.
What you’ll be doing:
-
Lead and nurture a team of Technical Program Managers engaged in DGX Cloud core infrastructure projects.
-
Propel progress across network, storage, trust services, security programs, telemetry, and break/fix operational workstreams.
-
Partner with engineering, product, operations, security, and cloud provider teams to define priorities, achievements, dependencies, and delivery plans.
-
Build clear operating rhythms for infrastructure planning, managing blocking issues, risk tracking, and cross-functional decision-making.
-
Improve access to infrastructure health, delivery status, blockers, and program risks through practical metrics, dashboards, and reporting.
-
Coordinate break/fix and operational readiness programs that improve reliability, response time, and customer impact management.
-
Support continuous improvement across TPM practices, helping the team standardize planning, execution, and communication across DGX Cloud infrastructure.
What we need to see:
-
More than 12 overall years in technical program management, infrastructure program management, or similar roles, including upwards of 3 years directing or supervising TPMs.
-
Experience managing infrastructure programs in domains such as networking, storage, security, trust services, observability, telemetry, or cloud operations.
-
Strong ability to manage priorities, dependencies, risks, and execution plans across multiple engineering teams.
-
Experience building TPM operating rhythms, including status reviews, paths for handling blocking issues, tracking critical achievements, and leadership-ready updates.
-
Working knowledge of cloud infrastructure, distributed systems, or large-scale platform operations.
-
Strong communication skills with the ability to translate complex infrastructure work into clear program status, risks, and decisions.
-
Bachelor’s or Master’s degree in Computer Science, Engineering, or related field, or equivalent experience.
Ways to stand out from the crowd:
-
Experience supporting infrastructure for AI/ML platforms, GPU clusters, or large-scale cloud services.
-
Background with observability and telemetry tools such as Grafana, Prometheus, or similar platforms.
-
Experience with security, trust, compliance, or reliability programs in cloud infrastructure environments.
-
Track record improving operational processes for break/fix, incident response, or infrastructure readiness.
-
Strong technical judgment and ability to partner closely with engineering leaders while developing TPM talent.
Join NVIDIA and help us build the future of AI infrastructure with your expertise and passion!
#Li-Hybrid
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 240,000 USD - 379,500 USD.
You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until May 8, 2026.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
Required skills
People management
Technical program management
Cloud infrastructure
Risk management
Operational reporting
Cross-functional leadership
Planning
Metrics
Total Views
0
Total Apply Clicks
0
Total Mock Apply
0
Total Bookmarks
0
More open roles at NVIDIA

Senior Software Engineer - GPU Networking
NVIDIA · US, CA, Santa Clara

Senior System Software Test Engineer, Networking
NVIDIA · US, CA, Santa Clara

Manager, Networking Software Test
NVIDIA · US, CA, Santa Clara

Senior Firmware Engineer, Networking
NVIDIA · US, CA, Santa Clara

Senior Software K8S Engineer
NVIDIA · 5 Locations
Similar jobs

Agile and Digital Business Integration Lead (Onsite)
RTX (Raytheon) · US-CT-EAST HARTFORD-ETC ~ 400 Main St ~ BLDG ETC

Portfolio Engineering Demand Focal (hybrid)
RTX (Raytheon) · US-VA-DULLES-710 ~ 22110 Pacific Blvd ~ BLDG 10

Portfolio Engineering Excellence Focal (hybrid)
RTX (Raytheon) · US-VA-DULLES-710 ~ 22110 Pacific Blvd ~ BLDG 10

MRAD Program Integration
RTX (Raytheon) · US-MA-ANDOVER-AN0 ~ 366 Lowell St ~ BLDG AN0

Senior Principal Software Systems Engineer Product Owner / Release Train Engineer (Onsite)
RTX (Raytheon) · US-TX-RICHARDSON-C17 ~ 1717 Cityline Dr ~ CITYLINE C17
About NVIDIA

NVIDIA
PublicA computing platform company operating at the intersection of graphics, HPC, and AI.
10,001+
Employees
Santa Clara
Headquarters
$4.57T
Valuation
Reviews
10 reviews
4.4
10 reviews
Work-life balance
2.8
Compensation
4.5
Culture
4.2
Career
4.3
Management
3.8
78%
Recommend to a friend
Pros
Cutting-edge technology and innovation
Excellent compensation and benefits
Great team culture and collaboration
Cons
High pressure and expectations
Poor work-life balance and long hours
Fast-paced environment leading to burnout
Salary Ranges
79 data points
Junior/L3
Mid/L4
Senior/L5
Junior/L3 · Analyst
7 reports
$170,275
total per year
Base
$130,981
Stock
-
Bonus
-
$155,480
$234,166
Interview experience
5 interviews
Difficulty
3.0
/ 5
Interview process
1
Application Review
2
Recruiter Screen
3
Technical Phone Screen
4
Onsite/Virtual Interviews
5
Team Matching
6
Offer
Common questions
Coding/Algorithm
System Design
Behavioral/STAR
Technical Knowledge
Past Experience
Latest updates
Negotiating NVIDIA's Offer
Base, stock, and sign-on negotiable. Recruiters invested in closing candidates. CEO reviews all 42K employee salaries monthly. Stock growth has made many employees millionaires.
reddit/blind
·
NVIDIA Company Reviews
WLB rated 3.9/5 (lowest category). 64% satisfied with WLB but 53% feel burnt out. Compensation rated 4.4-4.5/5. Experience highly team-dependent.
reddit/blind
·
NVIDIA Interview Discussions
Technical bar is high with 4-6 rounds. Process takes 4-8 weeks. Expect C++ questions, LeetCode medium, and system design. Difficulty rated 3.16/5.
reddit/blind
·
NVIDIA Culture Discussions
Team-dependent experience; sink-or-swim culture that rewards high performers but can be overwhelming. No politics, flat structure, but demanding workload with some teams requiring evening/weekend work.
reddit/blind
·