
Pioneering accelerated computing and AI
SOC AI Application Engineer — AI Services, Agents and Knowledge Systems at NVIDIA
About the role
The NVIDIA System-On-Chip(SOC) Design team is looking for a top AI Engineer with curiosity about SOC design automation, RTL integration, and chip build and assembly now. If you are interested in using AI to upgrade the conventional SOC Design flow, come and join us. We need you to be passionate about AI+Hardware. You are expected to help us to build AI application-layer services which would boost HW execution team's work efficiency, includes : assistants, retrieval and Q&A; workflow automation; and develop AI agent for SOC Design-related tasks. You will be shipping and operating AI services (APIs, orchestration, RAG, evaluation), evaluating and using modern frameworks and tools, such as Lang Chain (and similar stacks), RAG pipelines, and coding-agent / IDE-centric workflows (e.g. Claude Code-class assistants, reusable skills / playbooks for agents).
What you’ll be doing:
-
Design, implement, and operate LLM-backed services: APIs, async jobs, streaming responses, and integration with internal tools and data sources.
-
Build RAG and knowledge systems: chunking, embeddings, vector retrieval, reranking, access control, and quality/latency tuning.
-
Apply agent and orchestration patterns with frameworks like Lang Chain (or comparable): tool use, multi-step plans, memory, and guardrails—aligned with how SOC Hardware team works.
-
Improve developer and engineer experience with AI-assisted coding and repeatable “skills”: prompts, procedures, and small utilities that teams can run consistently (including patterns like Claude Code + structured skills).
-
Own reliability and perform evaluation: logging, tracing, regression tests for prompts/pipelines, and metrics for usefulness and safety on proprietary data.
-
Co-work with Hardware engineers from Methodology, CAD, and Design teams to scope the problem, propose the solution, implementation (in multiple iterations), and online production-ready features.
What we need to see:
-
MS/PhD in CS, CE, EE
-
2+ years of professional experience with a clear focus on AI application / AI service development (building products on top of LLMs, not only ad-hoc scripts).
-
Strong Python and experience shipping services (REST/gRPC, containers, basic cloud or on-prem deployment patterns as applicable).
-
Hands-on use of LLM application frameworks (e.g. Lang Chain or equivalent) and RAG (vector DBs, retrieval design, evaluation).
-
Familiarity with coding agents and IDE workflows (e.g. Claude Code-style usage) and frameworks (skills, templates, or internal “agent packs”).
-
Solid software engineering habits: dependency management, configuration, testing, and clear interfaces for other teams.
-
Excellent communication and ability to work with partners who are not AI specialists.
Ways to stand out from the crowd:
-
Hardware knowledge: RTL Coding capability; Makefile Coding capability; SOC Design know-how; Physical Design know-how; etc—enough to understand user context and data (no requirement to be a chip designer).
-
Web development: lightweight UIs, internal portals, or full-stack slices (e.g. React/TypeScript, FastAPI + frontend) for AI features.
Required skills
LLM applications
RAG
API development
agent orchestration
workflow automation
evaluation
knowledge systems
Total Views
0
Total Apply Clicks
0
Total Mock Apply
0
Total Bookmarks
0
More open roles at NVIDIA

Senior Software Engineer - GPU Networking
NVIDIA · US, CA, Santa Clara

Senior System Software Test Engineer, Networking
NVIDIA · US, CA, Santa Clara

Manager, Networking Software Test
NVIDIA · US, CA, Santa Clara

Senior Firmware Engineer, Networking
NVIDIA · US, CA, Santa Clara

Senior Software K8S Engineer
NVIDIA · 5 Locations
Similar jobs

Principal Speech Recognition Researcher (Onsite)
Collins Aerospace (RTX) · US-MD-COLUMBIA-720 ~ 9861 Broken Land Pkwy ~ BBN COLUMBIA, Ste 400

Senior Speech Recognition Researcher (Onsite)
Collins Aerospace (RTX) · US-MD-COLUMBIA-720 ~ 9861 Broken Land Pkwy ~ BBN COLUMBIA, Ste 400

Generative AI Software Developer/Engineer – Aerospace Technologies (Onsite)
RTX (Raytheon) · US-IA-CEDAR RAPIDS-124 ~ 400 Collins Rd NE ~ BLDG 124

AI Engineer
Rockwell Automation · Singapore, Singapore

AI Engineer
Rockwell Automation · Milwaukee; Mayfield Heights
About NVIDIA

NVIDIA
PublicA computing platform company operating at the intersection of graphics, HPC, and AI.
10,001+
Employees
Santa Clara
Headquarters
$4.57T
Valuation
Reviews
10 reviews
4.4
10 reviews
Work-life balance
2.8
Compensation
4.5
Culture
4.2
Career
4.3
Management
3.8
78%
Recommend to a friend
Pros
Cutting-edge technology and innovation
Excellent compensation and benefits
Great team culture and collaboration
Cons
High pressure and expectations
Poor work-life balance and long hours
Fast-paced environment leading to burnout
Salary Ranges
79 data points
L3
L4
L5
L3 · Data Scientist IC2
0 reports
$177,542
total per year
Base
-
Stock
-
Bonus
-
$150,910
$204,174
Interview experience
5 interviews
Difficulty
3.0
/ 5
Interview process
1
Application Review
2
Recruiter Screen
3
Technical Phone Screen
4
Onsite/Virtual Interviews
5
Team Matching
6
Offer
Common questions
Coding/Algorithm
System Design
Behavioral/STAR
Technical Knowledge
Past Experience
Latest updates
Negotiating NVIDIA's Offer
Base, stock, and sign-on negotiable. Recruiters invested in closing candidates. CEO reviews all 42K employee salaries monthly. Stock growth has made many employees millionaires.
reddit/blind
·
NVIDIA Company Reviews
WLB rated 3.9/5 (lowest category). 64% satisfied with WLB but 53% feel burnt out. Compensation rated 4.4-4.5/5. Experience highly team-dependent.
reddit/blind
·
NVIDIA Interview Discussions
Technical bar is high with 4-6 rounds. Process takes 4-8 weeks. Expect C++ questions, LeetCode medium, and system design. Difficulty rated 3.16/5.
reddit/blind
·
NVIDIA Culture Discussions
Team-dependent experience; sink-or-swim culture that rewards high performers but can be overwhelming. No politics, flat structure, but demanding workload with some teams requiring evening/weekend work.
reddit/blind
·