
Pioneering accelerated computing and AI
Senior Solutions Architect, Customer Success
NVIDIA is looking for a Senior Solutions Architect, Customer Success to join its NVIDIA Infrastructure Specialist Team. Academic and commercial organizations around the world are using NVIDIA products to redefine deep learning and data analytics, and to power next-generation data centers. Join the team building and advising on many of the largest and fastest AI/HPC systems in the world!
We are looking for someone who blends deep technical expertise with a consultative, collaborative approach. This role will engage directly with customers, partners, and multi-functional internal teams to assess infrastructure needs, architect scalable solutions, and guide the implementation of large-scale networking and AI infrastructure projects. The scope spans networking, system design, and automation—serving as a trusted strategic advisor and the technical face of NVIDIA to key accounts.
What You’ll Be Doing:
-
Serve as a senior technical authority and trusted consultant on NVIDIA technologies, contributing to architecture reviews, guiding infrastructure decisions at scale, and providing strategic recommendations aligned with each customer’s business objectives.
-
Establish and refine monitoring and optimization methodologies using analytics, telemetry, and automation to proactively detect bottlenecks, improve infrastructure resiliency, and drive continuous operational maturity.
-
Lead and advise on the analysis, optimization, and performance tuning of complex GPU-accelerated systems and AI workloads, ensuring high availability and efficiency across customer data centers.
-
Facilitate post-deployment reviews, incident retrospectives, and strategy sessions to shape the customer experience and deliver actionable insights into NVIDIA’s infrastructure roadmap.
-
Own and lead complex technical projects end-to-end—from initial discovery and solution design through implementation, knowledge transfer, and continuous improvement—ensuring alignment to SLAs and proactive mitigation of technical risks.
-
Support business growth by identifying AI infrastructure opportunities in cloud and enterprise environments, crafting compelling technical proposals, and driving initiatives that showcase NVIDIA’s leadership in this space.
What We Need to See:
-
Education & Experience: BS/MS/PhD or equivalent experience in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, or related fields, with 10+ years of professional experience in large-scale data center service operations with a focus on infrastructure.
-
NVIDIA GPU Expertise: Demonstrated hands-on experience deploying, configuring, and optimizing NVIDIA GPU-accelerated infrastructure, including driver and firmware management, CUDA toolkit integration, and GPU workload profiling and fix.
-
Customer Engagement: Track record of building long-term customer relationships and driving adoption through consultative engagement.
-
Analytical & Problem-Solving Skills: Strong analytical and decision-making capabilities, with a demonstrable ability to identify root causes, drive continuous improvement, and deliver resilient technical solutions.
-
System & Infrastructure Proficiency: Expertise in end-to-end data center architecture, spanning operating systems, Linux kernel drivers, GPU and NIC hardware, high-speed networking (Infini Band, Ethernet, RDMA), and storage systems (Lustre, GPFS, NFS).
-
Leadership & Communication: Good communication, time management, and organizational skills, with the ability to lead complex multi-functional projects, guide technical teams, and present to executive partners.
-
Travel: Willingness to travel up to 25% for customer engagements.
Ways to Stand Out from the Crowd:
-
Experience with Kubernetes for container orchestration, resource scheduling, and integration with GPU-accelerated workloads.
-
Familiarity with observability stacks (Grafana, Prometheus, Loki) for monitoring, alerting, and building fault-tolerant systems.
-
Experience with multi-tenant GPU cluster management and workload scheduling frameworks.
-
Experience with NVIDIA Base Command Manager (BCM) for provisioning, managing, and monitoring GPU clusters at scale.
-
Background with RDMA-based fabrics (Infini Band or RoCE) in HPC or AI environments as well as knowledge of CI/CD pipelines, Infrastructure-as-Code (Terraform, Ansible), and Git Ops workflows for infrastructure automation.
浏览量
0
申请点击
0
Mock Apply
0
收藏
0
相似职位

Senior Solution Architect - Personalization Strategist
Contentful · Los Angeles, California, United States

Senior Cloud Solution Architect - Cloud & AI Apps - CTJ - Top Secret
Microsoft · United States, District of Columbia, Washington D.C.; United States, Virginia, Reston; United States, Virginia, Arlington

Enterprise Capture Solution, Senior Manager
Booz Allen Hamilton · McLean, VA

Senior Solution Consultant
Talkdesk · Austin

Sr. Specialist Solutions Architect - Builder Team
Databricks · Bengaluru, India
关于NVIDIA

NVIDIA
PublicA computing platform company operating at the intersection of graphics, HPC, and AI.
10,001+
员工数
Santa Clara
总部位置
$4.57T
企业估值
评价
10条评价
4.4
10条评价
工作生活平衡
2.8
薪酬
4.5
企业文化
4.2
职业发展
4.3
管理层
3.8
78%
推荐率
优点
Cutting-edge technology and innovation
Excellent compensation and benefits
Great team culture and collaboration
缺点
High pressure and expectations
Poor work-life balance and long hours
Fast-paced environment leading to burnout
薪资范围
79个数据点
Junior/L3
Mid/L4
Senior/L5
Junior/L3 · Analyst
7份报告
$170,275
年薪总额
基本工资
$130,981
股票
-
奖金
-
$155,480
$234,166
面试评价
5条评价
难度
3.0
/ 5
面试流程
1
Application Review
2
Recruiter Screen
3
Technical Phone Screen
4
Onsite/Virtual Interviews
5
Team Matching
6
Offer
常见问题
Coding/Algorithm
System Design
Behavioral/STAR
Technical Knowledge
Past Experience