招聘
NVIDIA is looking for an outstanding Linux Systems Administrator to join a leading network verification and infrastructure automation team. The team develops and maintains a wide range of infrastructure solutions including an internal cloud provisioning platform on both VMs and baremetal servers, driver verification environments, automated build and test systems, and more. You will have the opportunity to impact engineering teams by ensuring our NVIDIA/Mellanox networking hardware is provisioned, tested, and validated at scale. Your responsibilities will include using cutting-edge automation technologies and AI based solutions to developing infrastructure capabilities spanning our internal cloud grid, provisioning pipelines, kernel and driver verification environments, and high-performance networking setups.We are looking for a motivated teammate who isn't afraid of learning new technologies, tackle complicated debugs, work closely with internal R&D teams and develop modern tools to make constant improvements to our server fleet and automation infrastructure.
What you'll be doing:
- Join our infrastructure team and develop best-in-class automation solutions for bare-metal server provisioning and network driver verification.
- Build and maintain Ansible playbooks and roles for full server lifecycle management — from OS installation and kernel configuration to OFED driver setup and production readiness.
- Develop Python solutions for hardware introspection, REST API integration, inventory management, and resource allocation across the server fleet.
- Develop virtualization and system capabilities — KVM, QEMU, libvirt, Vagrant, Docker, and Kubernetes — across a variety of operating systems and hardware architectures (x86*64, aarch64, ppc64le).*
- Build and maintain Jenkins CI/CD pipelines (Groovy/Jenkinsfile) that orchestrate the full provisioning workflow from BIOS configuration through Ansible provisioning to automated validation.
- Be a part of an experienced team with a great atmosphere.
- Collaborate with multiple cross-domain teams — verification engineers, hardware teams, and cloud engineers — to provide the best infrastructure solutions to our customers.
What we need to see:
- B.Sc. (or equivalent experience) in Computer Engineering, Computer Science, or a related technical field.
- 5+ years of experience in the field of Linux systems administration, infrastructure automation, or DevOps.
- Background in designing, implementing, and debugging automation software. Strong debugging and analytical skills.
- Experience in Python — scripting, REST API clients, subprocess management, and pip package management.
- Solid understanding of Linux — systemd, package management (dnf/yum, apt, zypper), kernel parameters, GRUB, sysctl tuning, NFS, and service management.
- Agility and multitasking.
- Strong collaboration and communication skills with peer and internal customers.
Ways to stand out from the crowd:
- Experience with Ansible (playbooks, roles, tags, idempotency) and infrastructure-as-code principles as well as background with Kubernetes, Vagrant (vagrant-libvirt), Docker, and KVM/QEMU/libvirt virtualization stacks.
- Familiarity with NVIDIA/Mellanox hardware — ConnectX NIC series, Blue Field DPUs, MFT (Mellanox Firmware Tools), and RSHIM driver configuration.
- Hands-on experience with hardware management APIs such as Redfish (Dell iDRAC / HP iLO) and IPMI for automated BIOS and BMC configuration.
- Experience with performance tuning — hugepages, DPDK, NUMA, CPU pinning — for virtualization and high-performance networking workloads
总浏览量
0
申请点击数
0
模拟申请者数
0
收藏
0
相似职位

Senior Administrator - Windows Azure IaaS, Terraform
HCL Technologies · Madurai, India

Senior Manager, Security Analytics & Operations
Charles Schwab · Austin, TX; Southlake, TX

Principal Engineer, Hardware Systems & Silicon Validation
Marvell · Irvine, CA

Senior Technical Specialist - DevOps, Python, Kubernetes
HCL Technologies ·

Senior Developer Relations Engineer, Conversational AI
Google ·
关于NVIDIA

NVIDIA
PublicA computing platform company operating at the intersection of graphics, HPC, and AI.
10,001+
员工数
Santa Clara
总部位置
$4.57T
企业估值
评价
4.1
10条评价
工作生活平衡
3.5
薪酬
4.2
企业文化
4.3
职业发展
4.5
管理层
4.0
75%
推荐给朋友
优点
Great culture and supportive environment
Smart colleagues and excellent people
Cutting-edge technology and learning opportunities
缺点
Team-dependent experience and outcomes
Work-life balance issues with long hours
Politics and influence over competence
薪资范围
73个数据点
Junior/L3
Mid/L4
Junior/L3 · Analyst
7份报告
$170,275
年薪总额
基本工资
$130,981
股票
-
奖金
-
$155,480
$234,166
面试经验
7次面试
难度
3.1
/ 5
体验
正面 0%
中性 86%
负面 14%
面试流程
1
Application Review
2
Recruiter Screen
3
Online Assessment
4
Technical Interview
5
System Design Interview
6
Team Review
常见问题
Coding/Algorithm
System Design
Technical Knowledge
Behavioral/STAR
新闻动态
Negotiating NVIDIA's Offer
Base, stock, and sign-on negotiable. Recruiters invested in closing candidates. CEO reviews all 42K employee salaries monthly. Stock growth has made many employees millionaires.
News
·
NaNw ago
NVIDIA Company Reviews
WLB rated 3.9/5 (lowest category). 64% satisfied with WLB but 53% feel burnt out. Compensation rated 4.4-4.5/5. Experience highly team-dependent.
News
·
NaNw ago
NVIDIA Interview Discussions
Technical bar is high with 4-6 rounds. Process takes 4-8 weeks. Expect C++ questions, LeetCode medium, and system design. Difficulty rated 3.16/5.
News
·
NaNw ago
NVIDIA Culture Discussions
Team-dependent experience; sink-or-swim culture that rewards high performers but can be overwhelming. No politics, flat structure, but demanding workload with some teams requiring evening/weekend work.
News
·
NaNw ago