Jobs
NVIDIA’s accelerated computing platform is the foundation of modern HPC and AI.At the core of this platform are the CUDA Core Libraries. C++ and Python libraries that enable developers to write fast, reliable, and scalable GPU-accelerated software! We are hiring a full-time Software Engineer to work on the CUDA Core Libraries that power GPU computing for both C++ and Python developers. This includes projects such as CCCL (Thrust, CUB, libcudacxx), cuda-python, and numba-cuda. You will join the team building the foundational libraries, algorithms, and language/runtime infrastructure that make CUDA a speed-of-light experience for developers across deep learning, scientific computing, and data analytics!
What you’ll be doing:
-
Develop and implement CUDA Core Libraries in C++ and/or Python, including parallel algorithms and idiomatic language bindings for core CUDA functionality.
-
Compose, optimize, and evolve GPU algorithms and APIs, from high-level interfaces down to low-level performance tuning involving memory, parallelism, and synchronization.
-
Own features end-to-end: develop, implementation, testing, benchmarking, documentation, and long-term maintenance.
-
Improve developer experience across the stack: CI, tests, benchmarks, packaging, examples, and docs.
-
Collaborate with senior CUDA engineers in design reviews, code reviews, and open-source-style workflows.
-
Engage with real users through issues, performance investigations, and API feedback.
What we need to see:
-
BS, MS, or PhD in Computer Science, Computer Engineering, or a related field or equivalent experience.
-
Minimum of 8+ years of related development experience
-
Strong programming skills in C++, Python, or both, with proven interest in systems-level software (performance, memory, concurrency, API design).
-
Solid understanding of modern C++ (templates, generics, standard library) and/or Python library development and packaging.
-
Practical experience with parallel or heterogeneous programming (CUDA, OpenMP, GPU-accelerated Python, or similar).
-
Experience contributing to production software or open-source libraries, including testing, profiling, and code review.
-
Ability to work independently, scope problems, and drive projects to completion.
-
Clear written communication for technical design and documentation.
-
Comfort navigating large, multi-language codebases (C++, Python, CMake, Pixi, CI systems).
Ways to stand out from the crowd:
-
Strong understanding of CPU/GPU architecture and how hardware details affect performance.
-
Hands-on experience with CUDA C++, CUDA Python, Py Torch, JAX, Numba, Cu Py, or similar GPU-accelerated stacks.
-
Familiarity with Thrust, CUB, libcudacxx, or other modern C++/GPU libraries.
-
Experience with compiler infrastructure or tooling (LLVM, Clang tooling, MLIR).
-
Demonstrated interest in developer tools, library design, and making other developers faster.
If you care deeply about performance, enjoy working at the C++/Python boundary, and want to shape the core CUDA libraries relied on by thousands of developers, this role is a direct fit.
Total Views
0
Apply Clicks
0
Mock Applicants
0
Scraps
0
Similar Jobs

AI Data Security & Privacy Engineer
Rocket Lawyer · California

Senior Principal SWE, Rust Development
Zscaler · San Jose, California, USA

Salesforce Developer (Remote, IND)
CrowdStrike · India - Remote

Digital Product Configuration Management Engineer II
Raytheon (RTX) · US-AZ-TUCSON-805 ~ 1151 E Hermans Rd ~ BLDG 805

Staff Systems Software Engineer, Infrastructure Platform
General Motors · Austin, TX; Mountain View, CA; Warren, MI
About NVIDIA

NVIDIA
PublicA computing platform company operating at the intersection of graphics, HPC, and AI.
10,001+
Employees
Santa Clara
Headquarters
$4.57T
Valuation
Reviews
4.1
10 reviews
Work Life Balance
3.5
Compensation
4.2
Culture
4.3
Career
4.5
Management
4.0
75%
Recommend to a Friend
Pros
Great culture and supportive environment
Smart colleagues and excellent people
Cutting-edge technology and learning opportunities
Cons
Team-dependent experience and outcomes
Work-life balance issues with long hours
Politics and influence over competence
Salary Ranges
47 data points
Junior/L3
Mid/L4
Junior/L3 · Analyst
7 reports
$170,275
total / year
Base
$130,981
Stock
-
Bonus
-
$155,480
$234,166
Interview Experience
7 interviews
Difficulty
3.1
/ 5
Experience
Positive 0%
Neutral 86%
Negative 14%
Interview Process
1
Application Review
2
Recruiter Screen
3
Online Assessment
4
Technical Interview
5
System Design Interview
6
Team Review
Common Questions
Coding/Algorithm
System Design
Technical Knowledge
Behavioral/STAR
News & Buzz
NVIDIA Company Reviews
WLB rated 3.9/5 (lowest category). 64% satisfied with WLB but 53% feel burnt out. Compensation rated 4.4-4.5/5. Experience highly team-dependent.
News
·
NaNw ago
Negotiating NVIDIA's Offer
Base, stock, and sign-on negotiable. Recruiters invested in closing candidates. CEO reviews all 42K employee salaries monthly. Stock growth has made many employees millionaires.
News
·
NaNw ago
NVIDIA Interview Discussions
Technical bar is high with 4-6 rounds. Process takes 4-8 weeks. Expect C++ questions, LeetCode medium, and system design. Difficulty rated 3.16/5.
News
·
NaNw ago
NVIDIA Culture Discussions
Team-dependent experience; sink-or-swim culture that rewards high performers but can be overwhelming. No politics, flat structure, but demanding workload with some teams requiring evening/weekend work.
News
·
NaNw ago