refresh

트렌딩 기업

트렌딩 기업

채용

채용NVIDIA

Senior Machine Learning Applications and Compiler Engineer

NVIDIA

Senior Machine Learning Applications and Compiler Engineer

NVIDIA

2 Locations

·

On-site

·

Full-time

·

1mo ago

복지 및 혜택

Equity

필수 스킬

C++

Rust

LLVM

MLIR

TensorFlow

PyTorch

Compiler development

NVIDIA is seeking engineers to develop algorithms and optimizations for our inference and compiler stack. You will work at the intersection of large-scale systems, compilers, and deep learning, crafting how neural network workloads map onto future NVIDIA platforms. This is your chance to be part of something outstandingly innovative!

What you’ll be doing:

  • Build, develop, and maintain high-performance runtime and compiler components, focusing on end-to-end inference optimization.

  • Define and implement mappings of large-scale inference workloads onto NVIDIA’s systems.

  • Extend and integrate with NVIDIA’s SW ecosystem, contributing to libraries, tooling, and interfaces that enable seamless deployment of models across platforms.

  • Benchmark, profile, and monitor key performance and efficiency metrics to ensure the compiler generates efficient mappings of neural network graphs to our inference hardware.

  • Collaborate closely with hardware architects and design teams to feedback software observations, influence future architectures, and codesign features that unlock new performance and efficiency points.

  • Prototype and evaluate new compilation and runtime techniques, including graph transformations, scheduling strategies, and memory/layout optimizations tailored to spatial processors.

  • Publish and present technical work on novel compilation approaches for inference and related spatial accelerators at top tier ML, compiler, and computer architecture venues.

What we need to see:

  • MS or PhD in Computer Science, Electrical/Computer Engineering, or related field, or equivalent experience, with 5 years of relevant experience.

  • Strong software engineering background with proficiency in systems level programming (e.g., C/C++ and/or Rust) and solid CS fundamentals in data structures, algorithms, and concurrency.

  • Hands on experience with compiler or runtime development, including IR design, optimization passes, or code generation.

  • Experience with LLVM and/or MLIR, including building custom passes, dialects, or integrations.

  • Familiarity with deep learning frameworks such as Tensor Flow and Py Torch, and experience working with portable graph formats such as ONNX.

  • Solid understanding of parallel and heterogeneous compute architectures, such as GPUs, spatial accelerators, or other domain specific processors.

  • Strong analytical and debugging skills, with experience using profiling, tracing, and benchmarking tools to drive performance improvements.

  • Excellent communication and collaboration skills, with the ability to work across hardware, systems, and software teams.

  • Ideal candidates will have direct experience with MLIR based compilers or other multilevel IR stacks, especially in the context of graph based deep learning workloads.

Ways to stand out from the crowd:

  • Prior work on spatial or dataflow architectures, including static scheduling, pipeline parallelism, or tensor parallelism at scale.

  • Contributions to opensource ML frameworks, compilers, or runtime systems, particularly in areas related to performance or scalability.

  • Demonstrated research impact, such as publications or presentations at conferences like PLDI, CGO, ASPLOS, ISCA, MICRO, MLSys, NeurIPS, or similar.

  • Experience with large-scale AI distributed inference or training systems, including performance modeling and capacity planning for multi rack deployments.

총 조회수

1

총 지원 클릭 수

0

모의 지원자 수

0

스크랩

0

NVIDIA 소개

NVIDIA

NVIDIA

Public

A computing platform company operating at the intersection of graphics, HPC, and AI.

10,001+

직원 수

Santa Clara

본사 위치

$4.57T

기업 가치

리뷰

4.1

10개 리뷰

워라밸

3.5

보상

4.2

문화

4.3

커리어

4.5

경영진

4.0

75%

친구에게 추천

장점

Great culture and supportive environment

Smart colleagues and excellent people

Cutting-edge technology and learning opportunities

단점

Team-dependent experience and outcomes

Work-life balance issues with long hours

Politics and influence over competence

연봉 정보

73개 데이터

L3

L4

L5

L3 · Data Scientist IC2

0개 리포트

$177,542

총 연봉

기본급

-

주식

-

보너스

-

$150,910

$204,174

면접 경험

7개 면접

난이도

3.1

/ 5

경험

긍정 0%

보통 86%

부정 14%

면접 과정

1

Application Review

2

Recruiter Screen

3

Online Assessment

4

Technical Interview

5

System Design Interview

6

Team Review

자주 나오는 질문

Coding/Algorithm

System Design

Technical Knowledge

Behavioral/STAR