
Pioneering accelerated computing and AI
Senior HPC Software Engineer
必备技能
Python
AWS
Kubernetes
Go
Ruby
Linux
GCP
Azure
Join our team as a Senior Devops Engineer. At NVIDIA, you'll be part of the team shaping the future of computing and guaranteeing the smooth operation of our brand-new technologies. Our mission is to leverage AI's power to build outstanding and pioneering solutions that have a significant impact on the world.
What you'll be doing:
-
Own the solutions you build, collaborating with cross-functional teams to successfully implement them.
-
Collaborate with various teams in a fast-paced environment to ensure seamless project completion.
-
Continuously improve solution provisioning and management through automation.
-
Detect performance issues and recommend solutions to maintain world-class service quality.
-
Conduct capacity management and planning to meet ongoing operational needs.
-
Participate in incident reviews, assist in root cause identification, and write RCA reports.
-
Deliver SRE solutions in a globally distributed, multi-cloud hybrid environment
-
AWS, GCP, and On-prem.
-
Participate in the team's on-call rotation.
What we need to see:
-
B.S. degree in Computer Science or related technical field (or equivalent experience)
-
8+ years in building and supporting critical services
-
5+ years of coding/scripting experience in at least two high-level programming languages such as Python, Go, Ruby, or Groovy.
-
Proficiency in Kubernetes administration, modern CI/CD techniques and Infrastructure as Code (IaC).
-
Full-stack AI experience with deep expertise in MCP ecosystems, Carpenter, n8n orchestration, and AI-assisted development via Cursor.
-
Expertise with at least one major cloud service provider
-
AWS, GCP, Azure.
-
Demonstrated proficiency with end-to-end SRE capabilities and observability.
-
Proficient in monitoring, metrics gathering, APM, container management, and log collection tools.
-
Creative problem solver with excellent debugging skills and great communication and documentation abilities.
Ways to stand out from the crowd:
-
Linux certification from a well-known vendor
-
Red Hat, Oracle, etc.
-
Prior experience managing large-scale Kubernetes deployment in production.
-
Strong skills in modern container networking and storage architecture.
-
Hands-on background working with Flexlm and license management system.
-
Hands-on experience working with Slurm/LSF environments.
浏览量
0
申请点击
0
Mock Apply
0
收藏
0
相似职位

Senior Software Engineer, AI Platform
Orca Security · Tel Aviv-Yafo, Tel Aviv District, Israel

Senior Windows Software Engineer
SentinelOne · Israel; Tel Aviv-Yafo, Tel Aviv District, Israel

Senior Software Engineer
JFrog · Tel Aviv/ Netanya, Israel

Senior Software Engineer - JFrog Fly
JFrog · Tel Aviv/ Netanya, Israel

Senior Software Engineer
Orca Security · Tel Aviv-Yafo, Tel Aviv District, Israel
关于NVIDIA

NVIDIA
PublicA computing platform company operating at the intersection of graphics, HPC, and AI.
10,001+
员工数
Santa Clara
总部位置
$4.57T
企业估值
评价
10条评价
4.4
10条评价
工作生活平衡
2.8
薪酬
4.5
企业文化
4.2
职业发展
4.3
管理层
3.8
78%
推荐率
优点
Cutting-edge technology and innovation
Excellent compensation and benefits
Great team culture and collaboration
缺点
High pressure and expectations
Poor work-life balance and long hours
Fast-paced environment leading to burnout
薪资范围
79个数据点
Junior/L3
Mid/L4
Senior/L5
Junior/L3 · Analyst
7份报告
$170,275
年薪总额
基本工资
$130,981
股票
-
奖金
-
$155,480
$234,166
面试评价
5条评价
难度
3.0
/ 5
面试流程
1
Application Review
2
Recruiter Screen
3
Technical Phone Screen
4
Onsite/Virtual Interviews
5
Team Matching
6
Offer
常见问题
Coding/Algorithm
System Design
Behavioral/STAR
Technical Knowledge
Past Experience