채용

Senior Hardware Engineer - GPU & AI Infrastructure
San Mateo, CA, United States
·
On-site
·
Full-time
·
1mo ago
보상
$242,100 - $293,800
복지 및 혜택
•Equity
•Healthcare
필수 스킬
GPU Architecture
Hardware Engineering
Python
Linux
Debugging
PCIe
NVLink
Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers and creators.
At Roblox, we’re building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device.We’re on a mission to connect a billion people with optimism and civility, and looking for amazing talent to help us get there.
A career at Roblox means you’ll be working to shape the future of human interaction, solving unique technical challenges at scale, and helping to create safer, more civil shared experiences for everyone.
As a member of the Infrastructure Foundation Hardware Engineering team, you will play a key role in enabling our mission to deliver a reliable, high-performing, and cost-efficient infrastructure that powers the world’s play. In this specialized role, you will be the technical lead for our GPU and AI accelerator ecosystem. You will be responsible for the full lifecycle of GPU hardware, from initial architectural evaluation and firmware qualification to large-scale fleet integration and performance tuning. You will ensure that Roblox’s massive-scale rendering and ML workloads run on the most optimized and stable hardware possible.
You Will:
-
Architect & Prototype: Prototype next-generation GPU-accelerated hardware platforms, ensuring seamless integration between high-density compute nodes, high-speed interconnects (NVLink/PCIe Gen5/6), and system firmware.
-
GPU Optimization: Drive the integration, performance testing, and debugging of GPUs in our fleet, focusing specifically on hardware-level optimizations, driver tuning, and thermal/power management.
-
Validation & Certification: Develop and execute rigorous evaluation and stress-testing strategies for GPU-heavy server platforms to ensure they meet Roblox’s unique demands for real-time rendering and low-latency AI inference.
-
Firmware & Systems: Lead firmware qualification (BIOS/BMC) and troubleshooting, implementing automation systems to manage GPU health, firmware updates.
-
Vendor Collaboration: Provide technical guidance and deep-dive feedback to hardware vendors. Lead critical investigations into component-level failures, triaging issues across the hardware, driver, and kernel layers.
-
Observability: Build and maintain advanced monitoring stacks (Grafana/Prometheus) to track GPU metrics like HBM utilization, thermal throttling events, and PCIe bandwidth saturation.
You Have:
-
Education: BA/BS Degree in Electrical Engineering, Computer Engineering, or related field with equivalent practical experience.
-
**GPU Expertise:**5+ years of hardware engineering experience with a specific focus on GPU architecture (NVIDIA HGX/MGX platforms preferred), AI accelerators, or high-performance compute (HPC) systems.
-
Deep Technical Knowledge: In-depth understanding of modern data center technologies, including PCIe fabric, NVLink, Infini Band, and liquid cooling systems for high-TDP hardware.
-
Testing Skills: Hands-on experience testing and validating CPU, Memory (HBM/DDR5), Storage (NVMe), and high-speed networking subsystems in a Linux environment.
-
Programming: Proficiency in Python, Go, or C++ for developing hardware validation tools and automation scripts.
-
Systemic Debugging: Expert-level skills in debugging complex server issues remotely, with the ability to analyze kernel logs, hardware registers, and bus-level captures.
You Are:
-
A Problem Solver: Decisive and effective at tracking hardware issues from identification through to fleet-wide resolution.
-
A Communicator: Excellent oral and written communication skills; able to translate complex hardware constraints into actionable insights for software teams.
-
Collaborative: Strong interpersonal skills with the ability to lead cross-functional projects with Data Center Ops, SRE, and external vendors.
-
Adaptable: Willing to travel occasionally to data centers or vendor sites to oversee hardware deployments or "first-of-a-kind" builds.
For roles that are based at our headquarters in San Mateo, CA: The starting base pay for this position is as shown below. The actual base pay is dependent upon a variety of job-related factors such as professional background, training, work experience, location, business needs and market demand. Therefore, in some circumstances, the actual salary could fall outside of this expected range. This pay range is subject to change and may be modified in the future. All full-time employees are also eligible for equity compensation and for benefits as described on this page.
Annual Salary Range**$242,100—$293,800 USD**
Roles that are based in an office are onsite Tuesday, Wednesday, and Thursday, with optional presence on Monday and Friday (unless otherwise noted).
Roblox provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. Roblox also provides reasonable accommodations to candidates with qualifying disabilities or religious beliefs during the recruiting process.
총 조회수
1
총 지원 클릭 수
0
모의 지원자 수
0
스크랩
0
비슷한 채용공고

Senior/Staff Embedded Software Engineer – Camera Systems
Skydio · San Mateo, California, United States

Staff Embedded Linux Engineer
Aurora · Mountain View, California

Senior Software Engineer – Embedded Linux (Storage Systems)
Verkada · San Mateo, CA United States

Staff Embedded Software Engineer - Camera Firmware
Verkada · San Mateo, CA United States

Senior Embedded Software Engineer
Stanley Black & Decker · Towson, MD, United States
Roblox 소개

Roblox
PublicReimagining the way people come together.
1,001-5,000
직원 수
San Mateo
본사 위치
$16.4B
기업 가치
리뷰
3.8
38개 리뷰
워라밸
3.6
보상
3.7
문화
4.0
커리어
3.7
경영진
3.8
73%
친구에게 추천
장점
Opportunity for career growth
Supportive team and management
Interesting projects and challenges
단점
Internal communication could improve
Room for improvement in processes
Some organizational bureaucracy
연봉 정보
33개 데이터
Junior/L3
L3/New Grad
Mid/L4
Principal/L7
Senior/L5
Staff/L6
Director
Junior/L3 · Data Science - New Grad
1개 리포트
$280,000
총 연봉
기본급
-
주식
-
보너스
-
$280,000
$280,000
면접 경험
3개 면접
난이도
3.0
/ 5
소요 기간
14-28주
경험
긍정 0%
보통 67%
부정 33%
면접 과정
1
Application Review
2
Online Assessment
3
Technical Interview
4
Behavioral Interview
5
Onsite/Virtual Interviews
자주 나오는 질문
Coding/Algorithm
Technical Knowledge
Behavioral/STAR
Problem Solving
뉴스 & 버즈
Roblox (RBLX) Is Up 8.5% After Launching Unified Age-Based Safety And Parental Control System - simplywall.st
simplywall.st
News
·
3d ago
Roblox's Set-Up For Long Term Success (NYSE:RBLX) - Seeking Alpha
Seeking Alpha
News
·
3d ago
How teens are being recruited into criminal hacking on gaming sites like Roblox - Good Morning America
Good Morning America
News
·
3d ago
Podcast: Roblox, Hollywood, and London Games Week - GamesIndustry.biz
GamesIndustry.biz
News
·
4d ago