招聘
We are looking for a Senior QA and Automation Engineer to join the NMX team.
NVIDIA NMX is an integrated platform for management, monitoring, and analytics of cloud telemetry in large-scale GPU and NVLink-based data centers. It includes NMX Telemetry (NMX‑T) for collecting and aggregating telemetry, NMX Controller (NMX‑C) for configuring and controlling network and fabric components, and NMX Manager (NMX‑M) for analytics, health monitoring, and policy-driven automation. As part of the NMX QA group, you will own the quality of a distributed, cloud-scale management and telemetry platform that sits at the heart of NVIDIA’s next-generation AI data centers.
What you'll be doing:
-
Design, develop, and execute end-to-end tests for new NMX features as part of GA and maintenance releases.
-
Plan and implement test automation for APIs, services, and data pipelines (REST/gRPC, telemetry collection, control-plane flows) including test infrastructure and reusable libraries.
-
Build and maintain regression suites for functional, performance, scale, and resiliency scenarios across NVOS-based switches, GPU systems, and cloud environments.
-
Integrate and validate with 3rd‑party and platform components, such as Linux, NVOS/NVLink switches, networking stacks, containers and orchestration environments.
-
Investigate complex issues across multiple services: reproduce bugs, analyze logs and telemetry, collaborate closely with development and architecture teams to isolate root causes, and verify fixes.
-
Contribute to observability of the product (metrics, logs, health checks, dashboards) to improve testability, debuggability, and production-readiness.
What we need to see:
-
Practical / B.A. / B.Sc. in Computer Science, Electrical Engineering, or equivalent experience.
-
5+ years of hands-on QA / test automation experience in backend, distributed, or networking systems.
-
Strong programming/scripting skills, 5+ years with at least one of: Python (preferred), Bash, or similar for automation and tooling.
-
Solid networking and system background (3+ years): TCP/IP, L2/L3, data center networking and/or fabric technologies.
-
Strong Linux fundamentals (shell, processes, networking, system debugging).
-
Proven ability to work independently and end-to-end: from test design through automation and execution to reporting.
-
Excellent communication and interpersonal skills, comfortable working with multi-site R&D and architecture teams.
Ways to stand out from the crowd:
-
Experience with telemetry / monitoring / observability platforms (e.g., Prometheus, Grafana, Open Telemetry, Kafka, time-series databases).
-
Experience in HPC or large-scale AI/data center environments, or with fabric management solutions.
-
Proven experience designing automation infrastructure (frameworks, reusable libraries, CI integration) for distributed systems.
-
Hands-on experience with containers and orchestration (Docker, Kubernetes, Nomad, Consul) and CI/CD pipelines.
-
Familiarity with NVLink / Infini Band / high-speed networking concepts.
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
总浏览量
0
申请点击数
0
模拟申请者数
0
收藏
0
相似职位

Senior Software Engineer - SM Core
Redis · Israel

Software Engineer II and Senior Software Engineer- Microsoft Security (Multiple Roles)
Microsoft · Israel, Tel Aviv, Herzliya

Principal Software Engineer - Edge AI
Microsoft · Israel, Tel Aviv, Herzliya; Israel, Multiple Locations, Multiple Locations

Senior Software Engineer - HiredScore
Workday · Israel, Tel Aviv

Principal Application Software Engineer - Relocation to Tokyo
Wayve · Israel
关于NVIDIA

NVIDIA
PublicA computing platform company operating at the intersection of graphics, HPC, and AI.
10,001+
员工数
Santa Clara
总部位置
$4.57T
企业估值
评价
4.1
10条评价
工作生活平衡
3.5
薪酬
4.2
企业文化
4.3
职业发展
4.5
管理层
4.0
75%
推荐给朋友
优点
Great culture and supportive environment
Smart colleagues and excellent people
Cutting-edge technology and learning opportunities
缺点
Team-dependent experience and outcomes
Work-life balance issues with long hours
Politics and influence over competence
薪资范围
73个数据点
Junior/L3
Mid/L4
Junior/L3 · Analyst
7份报告
$170,275
年薪总额
基本工资
$130,981
股票
-
奖金
-
$155,480
$234,166
面试经验
7次面试
难度
3.1
/ 5
体验
正面 0%
中性 86%
负面 14%
面试流程
1
Application Review
2
Recruiter Screen
3
Online Assessment
4
Technical Interview
5
System Design Interview
6
Team Review
常见问题
Coding/Algorithm
System Design
Technical Knowledge
Behavioral/STAR
新闻动态
Negotiating NVIDIA's Offer
Base, stock, and sign-on negotiable. Recruiters invested in closing candidates. CEO reviews all 42K employee salaries monthly. Stock growth has made many employees millionaires.
News
·
NaNw ago
NVIDIA Company Reviews
WLB rated 3.9/5 (lowest category). 64% satisfied with WLB but 53% feel burnt out. Compensation rated 4.4-4.5/5. Experience highly team-dependent.
News
·
NaNw ago
NVIDIA Interview Discussions
Technical bar is high with 4-6 rounds. Process takes 4-8 weeks. Expect C++ questions, LeetCode medium, and system design. Difficulty rated 3.16/5.
News
·
NaNw ago
NVIDIA Culture Discussions
Team-dependent experience; sink-or-swim culture that rewards high performers but can be overwhelming. No politics, flat structure, but demanding workload with some teams requiring evening/weekend work.
News
·
NaNw ago