採用

Deep Learning Algorithms Engineer - ACOT

NVIDIA

Vietnam

On-site

Full-time

2w ago

NVIDIA is seeking a motivated AI Acceleration & Optimization Engineer to join our Acceleration Computing, Optimization and Tools (ACOT) team. In this role, you will help improve the performance, scalability, and efficiency of modern AI models across NVIDIA GPU platforms. You will work with engineers across algorithms, systems, and hardware to support high-performance model deployment and development for real-world AI workloads.

As part of ACOT, you will collaborate with architecture, research, CUDA, compiler, and framework teams to help bring next-generation AI workloads from research to production with strong performance and reliability.

What you will be doing

Assist in optimizing AI models such as LLMs, VLMs, diffusion models, and multimodal models for inference and training on NVIDIA GPUs.
Profile workloads and help identify performance bottlenecks across GPU compute, memory, networking, and storage.
Support the development and integration of optimization techniques such as quantization, kernel fusion, parallelism, and memory efficiency improvements.
Use tools including CUDA, TensorRT, Nsight, and NVIDIA acceleration libraries to analyze and improve model performance.
Work with deep learning frameworks including Py Torch, JAX, and Tensor Flow, as well as open-source inference frameworks like vLLM and SGLang.
Contribute to performance benchmarking, testing, and internal tooling to improve optimization workflows.
Partner with senior engineers and multi-functional teams to evaluate workload behavior and support future performance improvements.

What we want to see

Bachelor’s or Master’s degree in Computer Science, Electrical Engineering, Computer Engineering, or related field (or equivalent experience).
2–4 years of experience, or strong academic/project experience, in deep learning, performance engineering, systems, or high-performance computing.
Good understanding of deep learning fundamentals and modern AI model architectures, especially transformers.
Familiarity with GPU architecture and parallel computing concepts such as CUDA, kernels, memory hierarchy, and streams.
Exposure to profiling and performance analysis tools.
Programming skills in Python.
Experience with at least one major ML framework such as Py Torch, Tensor Flow, or JAX.

Ways to stand out from the crowd

Internship, research, or project experience optimizing AI/ML workloads on GPUs.
Hands-on experience with TensorRT, TensorRT-LLM, vLLM, SGLang, or similar inference/runtime frameworks.
Familiarity with quantization, sparsity, or mixed-precision techniques.
Experience with distributed training or inference concepts. Contributions to open-source ML systems, performance tools, or infrastructure projects.
Proficiency in C++, strong debugging skills and interest in low-level performance optimization.

総閲覧数

応募クリック数

模擬応募者数

スクラップ

類似の求人

AI ENGINEER L1

Wipro · Pune, India

Software Engineer III, AI/ML, Google Cloud

Google ·

Machine Learning Engineer - Data Recommendation (Capcut)- San Jose

TikTok · San Jose, CA

Software Engineer III, AI/ML GenAI, GCP, Performance

Google ·

AI Deployment Engineer, Startups

OpenAI · Singapore

NVIDIAについて

NVIDIA

Public

A computing platform company operating at the intersection of graphics, HPC, and AI.

10,001+

従業員数

Santa Clara

本社所在地

$4.57T

企業価値

レビュー

4.1

10件のレビュー

ワークライフバランス

3.5

報酬

4.2

企業文化

4.3

キャリア

4.5

経営陣

4.0

75%

友人に勧める

良い点

Great culture and supportive environment

Smart colleagues and excellent people

Cutting-edge technology and learning opportunities

改善点

Team-dependent experience and outcomes

Work-life balance issues with long hours

Politics and influence over competence

給与レンジ

73件のデータ

L3 · Data Scientist IC2

0件のレポート

$177,542

年収総額

基本給

ストック

ボーナス

$150,910

$204,174

面接体験

7件の面接

難易度

3.1

/ 5

体験

ポジティブ 0%

普通 86%

ネガティブ 14%

面接プロセス

Application Review

Recruiter Screen

Online Assessment

Technical Interview

System Design Interview

Team Review

よくある質問

Coding/Algorithm

System Design

Technical Knowledge

Behavioral/STAR

ニュース＆話題

Negotiating NVIDIA's Offer

Base, stock, and sign-on negotiable. Recruiters invested in closing candidates. CEO reviews all 42K employee salaries monthly. Stock growth has made many employees millionaires.

News

NaNw ago

NVIDIA Company Reviews

WLB rated 3.9/5 (lowest category). 64% satisfied with WLB but 53% feel burnt out. Compensation rated 4.4-4.5/5. Experience highly team-dependent.

News

NaNw ago

NVIDIA Interview Discussions

Technical bar is high with 4-6 rounds. Process takes 4-8 weeks. Expect C++ questions, LeetCode medium, and system design. Difficulty rated 3.16/5.

News

NaNw ago

NVIDIA Culture Discussions

Team-dependent experience; sink-or-swim culture that rewards high performers but can be overwhelming. No politics, flat structure, but demanding workload with some teams requiring evening/weekend work.

News

NaNw ago