热门公司

招聘

职位NVIDIA

Senior QA and Automation Engineer

NVIDIA

Senior QA and Automation Engineer

NVIDIA

Israel, Raanana

·

On-site

·

Full-time

·

2d ago

We are looking for a Senior QA and Automation Engineer to join the NMX team.

NVIDIA NMX is an integrated platform for management, monitoring, and analytics of cloud telemetry in large-scale GPU and NVLink-based data centers. It includes NMX Telemetry (NMX‑T) for collecting and aggregating telemetry, NMX Controller (NMX‑C) for configuring and controlling network and fabric components, and NMX Manager (NMX‑M) for analytics, health monitoring, and policy-driven automation. As part of the NMX QA group, you will own the quality of a distributed, cloud-scale management and telemetry platform that sits at the heart of NVIDIA’s next-generation AI data centers.

What you'll be doing:

  • Design, develop, and execute end-to-end tests for new NMX features as part of GA and maintenance releases.

  • Plan and implement test automation for APIs, services, and data pipelines (REST/gRPC, telemetry collection, control-plane flows) including test infrastructure and reusable libraries.

  • Build and maintain regression suites for functional, performance, scale, and resiliency scenarios across NVOS-based switches, GPU systems, and cloud environments.

  • Integrate and validate with 3rd‑party and platform components, such as Linux, NVOS/NVLink switches, networking stacks, containers and orchestration environments.

  • Investigate complex issues across multiple services: reproduce bugs, analyze logs and telemetry, collaborate closely with development and architecture teams to isolate root causes, and verify fixes.

  • Contribute to observability of the product (metrics, logs, health checks, dashboards) to improve testability, debuggability, and production-readiness.

What we need to see:

  • Practical / B.A. / B.Sc. in Computer Science, Electrical Engineering, or equivalent experience.

  • 5+ years of hands-on QA / test automation experience in backend, distributed, or networking systems.

  • Strong programming/scripting skills, 5+ years with at least one of: Python (preferred), Bash, or similar for automation and tooling.

  • Solid networking and system background (3+ years): TCP/IP, L2/L3, data center networking and/or fabric technologies.

  • Strong Linux fundamentals (shell, processes, networking, system debugging).

  • Proven ability to work independently and end-to-end: from test design through automation and execution to reporting.

  • Excellent communication and interpersonal skills, comfortable working with multi-site R&D and architecture teams.

Ways to stand out from the crowd:

  • Experience with telemetry / monitoring / observability platforms (e.g., Prometheus, Grafana, Open Telemetry, Kafka, time-series databases).

  • Experience in HPC or large-scale AI/data center environments, or with fabric management solutions.

  • Proven experience designing automation infrastructure (frameworks, reusable libraries, CI integration) for distributed systems.

  • Hands-on experience with containers and orchestration (Docker, Kubernetes, Nomad, Consul) and CI/CD pipelines.

  • Familiarity with NVLink / Infini Band / high-speed networking concepts.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

总浏览量

0

申请点击数

0

模拟申请者数

0

收藏

0

关于NVIDIA

NVIDIA

NVIDIA

Public

A computing platform company operating at the intersection of graphics, HPC, and AI.

10,001+

员工数

Santa Clara

总部位置

$4.57T

企业估值

评价

4.1

10条评价

工作生活平衡

3.5

薪酬

4.2

企业文化

4.3

职业发展

4.5

管理层

4.0

75%

推荐给朋友

优点

Great culture and supportive environment

Smart colleagues and excellent people

Cutting-edge technology and learning opportunities

缺点

Team-dependent experience and outcomes

Work-life balance issues with long hours

Politics and influence over competence

薪资范围

73个数据点

Junior/L3

Mid/L4

Junior/L3 · Analyst

7份报告

$170,275

年薪总额

基本工资

$130,981

股票

-

奖金

-

$155,480

$234,166

面试经验

7次面试

难度

3.1

/ 5

体验

正面 0%

中性 86%

负面 14%

面试流程

1

Application Review

2

Recruiter Screen

3

Online Assessment

4

Technical Interview

5

System Design Interview

6

Team Review

常见问题

Coding/Algorithm

System Design

Technical Knowledge

Behavioral/STAR