热门公司

招聘

职位NVIDIA

Systems Quality and Reliability Lead - LPU

NVIDIA

Systems Quality and Reliability Lead - LPU

NVIDIA

US, CA, Santa Clara

·

On-site

·

Full-time

·

1mo ago

必备技能

Python

Linux

We are seeking Lead Systems Quality and Reliability Engineer to join our LPU team!

NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 fueled the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing. NVIDIA is a “learning machine” that constantly evolves by adapting to new opportunities that are hard to solve, that only we can tackle, and that matter to the world. This is our life’s work, to amplify human imagination and intelligence!

What you'll be doing:

You will own, build, and manage the RMA and FA debug and root-cause analysis for existing and new Nvidia AI/ML products. You will conduct tests, and root-cause analysis. Other responsibilities include:

  • Conduct and lead debug and root-cause analysis of field RMAs. Collaborate with Systems Engineers, Hardware engineers, Software engineers, and operations engineers as required

  • Scale root cause FA capabilities within your organization

  • Create FA result reports that align with standard 8D or similar process

  • Analyze RMA, FA and repair data. Identify trends and raise quality alerts when necessary. Drive resolution, containment, and mitigation plans for such quality alerts

  • Oversee hardware quality performance, monitoring field quality data and associated metrics including RMA rates, MTBF, and Reliability Ratio

  • Manage operational perf of FA at CMs, ensuring partner achieve key perf indicators including FA cycle times, fault duplication rates and fault isolation rates

  • Oversee the setup of new products into Failure Analysis operations

What we need to see:

  • BS/MS in EE, Physics or a related degree (or equivalent experience)

  • 8+ yrs of hands on systems test and/or validation engineering experience

  • Proven hands-on management and leadership experience

  • Competence using lab equipment such as oscilloscopes, logic analyzers, power analyzers etc.

  • Experience with enabling reliability tests such as HTOL and quality tests such as Burn in

  • Ideal candidate will have working knowledge of FA techniques and tools such as FIB, SEM, TDR, VNA and CSAM

  • Strong knowledge of Fault isolation techniques such as OBIRCH, DLS/LADA, LVP and LVI

  • Proficiency with high speed interfaces (Ser Des, PCIe, DDR)

  • Proficiency in Python, PERL, C++, or other languages on UNIX /Linux

  • Excellent knowledge of PCB card and system level test and debug as well as be able to manage factory floor partners (CMs) for RMA/FA activities

With competitive salaries and a generous benefits package, NVIDIA is widely considered to be one of the technology world’s most desirable employers. We welcome you join our team with some of the most hard-working people in the world working together to promote rapid growth. Are you passionate about becoming a part of a best-in-class team supporting the latest in GPU and AI technology?

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 168,000 USD - 264,500 USD for Level 4, and 196,000 USD - 310,500 USD for Level 5.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until March 2, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

总浏览量

0

申请点击数

0

模拟申请者数

0

收藏

0

关于NVIDIA

NVIDIA

NVIDIA

Public

A computing platform company operating at the intersection of graphics, HPC, and AI.

10,001+

员工数

Santa Clara

总部位置

$4.57T

企业估值

评价

4.1

10条评价

工作生活平衡

3.5

薪酬

4.2

企业文化

4.3

职业发展

4.5

管理层

4.0

75%

推荐给朋友

优点

Great culture and supportive environment

Smart colleagues and excellent people

Cutting-edge technology and learning opportunities

缺点

Team-dependent experience and outcomes

Work-life balance issues with long hours

Politics and influence over competence

薪资范围

73个数据点

Junior/L3

Mid/L4

Junior/L3 · Analyst

7份报告

$170,275

年薪总额

基本工资

$130,981

股票

-

奖金

-

$155,480

$234,166

面试经验

7次面试

难度

3.1

/ 5

体验

正面 0%

中性 86%

负面 14%

面试流程

1

Application Review

2

Recruiter Screen

3

Online Assessment

4

Technical Interview

5

System Design Interview

6

Team Review

常见问题

Coding/Algorithm

System Design

Technical Knowledge

Behavioral/STAR