招聘

High-Performance Networking Engineer - Supercomputing
Palo Alto, CA; San Francisco, CA
·
On-site
·
Full-time
·
1mo ago
Compensation
$180,000 - $440,000
Benefits & Perks
•Competitive salary and equity package
•Comprehensive health, dental, and vision insurance
•Parental leave
•401(k) matching
•Team events and activities
•Equity
•Healthcare
•Parental Leave
Required Skills
Node.js
React
TypeScript
About xAI
xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All employees are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.
About the Role
High-Performance Networking Engineer on xAI’s Supercomputing team, you will design and optimize low-latency, high-bandwidth networking solutions using NVIDIA’s RDMA-capable technologies to support some of the world’s largest GPU supercomputing clusters. These clusters drive AI training and inference workloads, demanding cutting-edge performance and scalability.
Focus
-
Develop and tune RDMA-based communication systems leveraging NVIDIA GPUs and Mellanox NICs (Infini Band, RoCE) for ultra-fast data transfer between nodes.
-
Implement and optimize GPUDirect RDMA to enable direct memory access between GPUs and network interfaces, minimizing CPU overhead.
-
Integrate RDMA solutions with Kubernetes-based workloads, ensuring seamless operation across distributed compute and storage systems.
-
Collaborate with AI researchers and infrastructure teams to accelerate data pipelines and collective communications using NCCL and MPI.
-
Troubleshoot and resolve performance bottlenecks in high-throughput, low-latency networking environments.
Ideal Experience
-
Hands-on experience with NVIDIA RDMA technologies (e.g., GPUDirect RDMA, RoCE, Infini Band) in HPC or AI supercomputing environments.
-
Proficiency in programming with Rust, C, or C++ for low-level networking and system optimization.
-
Familiarity with NVIDIA’s networking stack, including Mellanox drivers, libraries (e.g., libibverbs), and tools (e.g., NVPeer Memory).
-
Experience optimizing distributed systems with MPI, NCCL, or similar frameworks for GPU-accelerated workloads.
-
Knowledge of Kubernetes networking and integrating RDMA into containerized environments.
-
Bonus: Background in AI/ML training workflows and their networking demands (e.g., large-scale parameter synchronization).
Tech Stack
-
NVIDIA GPUs and Mellanox networking (Infini Band, RoCE)
-
RDMA protocols (e.g., GPUDirect RDMA, RoCEv2)
-
Kubernetes
-
Rust and C/C++
-
MPI (Message Passing Interface) and NCCL (NVIDIA Collective Communications Library)
Annual Salary Range
$180,000 - $440,000 USD
Benefits
Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks.
*xAI is an equal opportunity employer. For details on data processing, view our *Recruitment Privacy Notice.
Total Views
0
Apply Clicks
0
Mock Applicants
0
Scraps
0
Similar Jobs

Workday HCM Integrations, Conversions & Reporting Consultant with TS/SCI
Deloitte · Arlington, VA; Baltimore, MD; Huntsville, AL; McLean, VA; Richmond, VA; Washington, DC

Financial Solutions Advisor - East Valley Phoenix Area
Bank of America · Chandler, AZ; Mesa, AZ; Phoenix, AZ; Tempe, AZ

Technical Sourcer
Lyft · Nashville, TN

See full role description
Apple ·

Forward Deployed Engineer, Privy
Stripe · New York Privy HQ
About xAI

xAI
Series BX.AI Corp., doing business as xAI, is an American company working in the area of artificial intelligence (AI), social media and technology that is a wholly owned subsidiary of American aerospace company SpaceX.
201-500
Employees
Austin
Headquarters
$50B
Valuation
Reviews
4.1
25 reviews
Work Life Balance
3.9
Compensation
4.6
Culture
4.3
Career
4.3
Management
3.5
83%
Recommend to a Friend
Pros
Strong engineering culture with focus on code quality
Competitive compensation packages with equity
Flexible remote work options and good work-life balance
Cons
Organizational changes and restructuring can be disruptive
Internal politics in some teams
Fast-paced environment with tight deadlines
Salary Ranges
0 data points
Junior/L3
Junior/L3 · Technical Writer
0 reports
$89,690
total / year
Base
-
Stock
-
Bonus
-
$76,237
$103,144
Interview Experience
5 interviews
Difficulty
3.0
/ 5
Duration
14-28 weeks
Interview Process
1
Coding Assessment
2
Live coding round
3
Technical Interview
4
Systems Design
Common Questions
Coding challenges
Technical problem solving
Systems design
Algorithm implementation
News & Buzz
Elon Musk’s SpaceX and xAI Are Planning a Megamerger of Rockets and AI - The Wall Street Journal
Source: The Wall Street Journal
News
·
4w ago
SpaceX reportedly mulling Tesla merger or tie-up with Elon Musk’s xAI firm - The Guardian
Source: The Guardian
News
·
5w ago
SpaceX and xAI could be merging. Why Elon Musk is doing it—and what might happen next - Fast Company
Source: Fast Company
News
·
5w ago
Why Elon Musk would want to merge SpaceX with xAI or Tesla - Business Insider
Source: Business Insider
News
·
5w ago