招聘
必备技能
Python
Linux
We’re working on the next generation of recommendation tools and pushing the boundaries of accelerating model training and inference on GPU.
You’ll join a team of ML, HPC and Software Engineers and Applied Researcher developing a framework designed to make the productization of GPU-based recommender systems as simple and fast as possible.
What you'll be doing:In your role as Devtech Compute Engineer or CUDA Performance Engineer you will be primarily for the development of performance critical code of our deep learning applications with the goal of establishing world class performance for our customers.
This includes investigating the current performance and exploring optimization opportunities together with the global developers.
Important part of the work is that once optimal performance has been demonstrated that these solutions are integrated into our open source software libraries like ACCV-Lab, Recsys-Example .
With the knowledge to the requirements from customers and performance bottleneck, you will also work with our GPU, CPU, Network team to define the next generation hardware and software solutions.
Our coverage is wide including: LLM, Recsys, Robotic, Assisted Driving.
What we need to see:2+ years of experience of c++ code development in collaborative software development projects Skilled at writing CUDA kernels and optimizing code Basic knowledge of ML algorithms and deep learning Basic knowledge and understanding of mathematical topics including linear algebra, calculus and statistics Experience with algorithms and optimization Python and jupyter notebook for analysis, algorithm exploration and processing High standard for code quality and rigorous testing practices Conversational level English proficiency Some experience with Linux, openMP and MPIWay to stand out from the crowd:Experience in c++ HPC code development / PhD in related fields Able to perform in-depth performance analysis, can demonstrate to model the performance with mathematical and statistical considerations Linear algebra, calculus and statistics as second nature and this is reflected in your background of mathematics, physics, applied science or HPC related field Demonstrate the ability to write CUDA kernels with the purpose of utilizing the hardware to its full potential.
Write unit tests and validate the correctness of the optimizations as well as strive for and propose optimal solutions and ambitious goals, convince and help others to do the same#deeplearning
总浏览量
0
申请点击数
0
模拟申请者数
0
收藏
0
相似职位

Engineering Planner / Lufkin, TX Lufkin, Texas
Lockheed Martin · lufkin

Hardware Test Specialist, Failure Analysis (Gateway)
SpaceX · Redmond, WA

Principle Design Engineer
Northrop Grumman · United States-Utah-Magna

Data Center Controls Engineer, Deployment
Amazon · Fredericksburg, VA, USA

Forward Deployed Engineer - Data-as-a-Service
Snorkel AI · New York City, NY (Hybrid); Redwood City, CA (Hybrid); San Francisco, CA (Hybrid)
关于NVIDIA

NVIDIA
PublicA computing platform company operating at the intersection of graphics, HPC, and AI.
10,001+
员工数
Santa Clara
总部位置
$4.57T
企业估值
评价
4.4
10条评价
工作生活平衡
2.8
薪酬
4.2
企业文化
4.3
职业发展
4.1
管理层
3.8
78%
推荐给朋友
优点
Cutting-edge technology and innovation
Excellent compensation and benefits
Great team culture and collaboration
缺点
Work-life balance challenges
High pressure and stress
Long hours required
薪资范围
67个数据点
Junior/L3
Mid/L4
Senior/L5
Junior/L3 · Analyst
7份报告
$170,275
年薪总额
基本工资
$130,981
股票
-
奖金
-
$155,480
$234,166
面试经验
5次面试
难度
3.0
/ 5
面试流程
1
Application Review
2
Recruiter Screen
3
Technical Phone Screen
4
Onsite/Virtual Interviews
5
Team Matching
6
Offer
常见问题
Coding/Algorithm
System Design
Behavioral/STAR
Technical Knowledge
Past Experience
新闻动态
Negotiating NVIDIA's Offer
Base, stock, and sign-on negotiable. Recruiters invested in closing candidates. CEO reviews all 42K employee salaries monthly. Stock growth has made many employees millionaires.
News
·
NaNw ago
NVIDIA Culture Discussions
Team-dependent experience; sink-or-swim culture that rewards high performers but can be overwhelming. No politics, flat structure, but demanding workload with some teams requiring evening/weekend work.
News
·
NaNw ago
NVIDIA Interview Discussions
Technical bar is high with 4-6 rounds. Process takes 4-8 weeks. Expect C++ questions, LeetCode medium, and system design. Difficulty rated 3.16/5.
News
·
NaNw ago
NVIDIA Company Reviews
WLB rated 3.9/5 (lowest category). 64% satisfied with WLB but 53% feel burnt out. Compensation rated 4.4-4.5/5. Experience highly team-dependent.
News
·
NaNw ago