채용

Principal Engineer – Gen AI Platform Inferencing Engineering
CHARLOTTE; CONCORD; IRVING
·
On-site
·
Full-time
·
3d ago
About this role:
Wells Fargo is seeking a Principal Engineer – Gen AI Platform Inferencing Engineering to lead the development and optimization of our AI model serving and inferencing platforms within Digital Technology's AI Capability Engineering group.
This is a software engineering role — you'll write code, build systems, and solve hard problems in the AI inference stack. You'll work deep inside frameworks like vLLM, SGLang, and NVIDIA Dynamo, extending and optimizing them to serve models at enterprise scale. You'll also build the automation, tooling, and deployment infrastructure that connects these runtimes to Kubernetes-native serving layers like KServe, KNative, and Open Shift AI.
If you've contributed to inference frameworks, written custom serving logic, or built production ML serving pipelines in Python, we want to hear from you.
In this role, you will:
- Develop, extend, and optimize inference runtime configurations and integrations across vLLM, SGLang, NVIDIA Dynamo, TensorRT-LLM, and Triton
- Write Python-based tooling and automation for model onboarding, serving configuration, performance benchmarking, and deployment pipelines
- Build and maintain Kubernetes-native model serving infrastructure using KServe, KNative, and Open Shift AI — including custom serving runtimes and inference graphs
- Implement and tune inference performance optimizations — continuous batching, speculative decoding, prefix caching, concurrency control, autoscaling policies, and disaggregated prefill/decode pipelines
- Develop Helm charts, operators, and Kustomize overlays for deploying and managing inference workloads on Open Shift/OCP
- Integrate inference platforms with GPU workload orchestrators (Run:AI or similar) — automating project provisioning, quota management, and workload scheduling
- Build observability and testing harnesses — load testing frameworks, latency/throughput profiling scripts, and regression test suites for inference stack upgrades
- Partner with AI/ML teams to productionize new models, defining serving architectures, resource requirements, and SLA targets
Required Qualifications:
- 7+ years in software engineering or platform engineering (work experience, training, military experience, or education)
- 5+ years of programming experience in Python with experience building production systems
Desired Qualifications:
- Experience with Inference frameworks, such as vLLM, SGLang, NVIDIA Dynamo, TensorRT-LLM, or Triton Inference Server
- Experience with Kubernetes-native ML serving, such as KServe, KNative, Seldon, or Open Shift AI
- Experience with Inference optimization, (Continuous batching, speculative decoding, KV-cache management, prefix caching, quantization-aware serving (FP8, AWQ, GPTQ), or tensor parallelism configuration)
- Experience with Container platform development, (Writing Helm charts, operators, or custom controllers for Open Shift, GKE, or EKS)
- Experience with GPU workload orchestration, (Run:AI, Kueue, Volcano — scripting workload automation, quota management, or scheduler integrations)
- Experience with Performance and load testing, (Building benchmarking tools for token throughput, time-to-first-token, batch latency, and autoscaling behavior)
- Familiarity with NVIDIA GPU fundamentals (CUDA, MIG, NCCL), experience contributing to open-source inference projects, or background in ML observability tooling (Prometheus, Grafana, Arize)
Job Expectations:
- This position is not eligible for Visa sponsorship
- This position requires a hybrid in office work schedule
Pay Range
Reflected is the base pay range offered for this position. Pay may vary depending on factors including but not limited to demonstrated examples of prior performance, skills, experience, or work location. Employees may also be eligible for incentive opportunities.
$159,000.00 - $305,000.00
Benefits
-
Wells Fargo provides eligible employees with a comprehensive set of benefits, many of which are listed below. Visit [Benefits
-
Wells Fargo Jobs](https://www.wellsfargojobs.com/en/life-at-wells-fargo/benefits) for an overview of the following benefit plans and programs offered to employees.
-
Health benefits
-
401(k) Plan
-
Paid time off
-
Disability benefits
-
Life insurance, critical illness insurance, and accident insurance
-
Parental leave
-
Critical caregiving leave
-
Discounts and savings
-
Commuter benefits
-
Tuition reimbursement
-
Scholarships for dependent children
-
Adoption reimbursement
Posting End Date:
20 Apr 2026
Job posting may come down early due to volume of applicants.We Value Equal Opportunity
Wells Fargo is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other legally protected characteristic.
Employees support our focus on building strong customer relationships balanced with a strong risk mitigating and compliance-driven culture which firmly establishes those disciplines as critical to the success of our customers and company. They are accountable for execution of all applicable risk programs (Credit, Market, Financial Crimes, Operational, Regulatory Compliance), which includes effectively following and adhering to applicable Wells Fargo policies and procedures, appropriately fulfilling risk and compliance obligations, timely and effective escalation and remediation of issues, and making sound risk decisions. There is emphasis on proactive monitoring, governance, risk identification and escalation, as well as making sound risk decisions commensurate with the business unit’s risk appetite and all risk and compliance program requirements.
Applicants with Disabilities
To request a medical accommodation during the application or interview process, visit Disability Inclusion at Wells Fargo.
Drug and Alcohol Policy
Wells Fargo maintains a drug free workplace. Please see our Drug and Alcohol Policy to learn more.
Wells Fargo Recruitment and Hiring Requirements:
a. Third-Party recordings are prohibited unless authorized by Wells Fargo.
b. Wells Fargo requires you to directly represent your own experiences during the recruiting and hiring process.
총 조회수
0
총 지원 클릭 수
0
모의 지원자 수
0
스크랩
0
비슷한 채용공고

Senior Messaging Platform Engineer
Dell · Eldorado Do Sul, Brazil

Principal Software Engineer, ITC
Nike · Karnataka, India

Senior Group Technical Architect 3DEXP Platform, ENOVIA Fundamentals
HCL Technologies · Pune, India

Senior Tech Lead Mechatronics
Emerson · PUNE, MAHARASHTRA, India, IN

Principal Technical Strategist - Wallet, Payments, and Commerce Engineering
Apple · New York, NY
Wells Fargo 소개

Wells Fargo
PublicWells Fargo & Company is an American multinational financial services company. The company operates in 35 countries and serves more than 70 million customers worldwide.
10,001+
직원 수
San Francisco
본사 위치
$163B
기업 가치
리뷰
3.7
10개 리뷰
워라밸
3.8
보상
3.2
문화
3.9
커리어
2.8
경영진
3.1
65%
친구에게 추천
장점
Good benefits and health coverage
Flexible hours and remote work options
Good work-life balance
단점
Limited career advancement opportunities
High stress and fast-paced environment
Poor management and lack of direction
연봉 정보
15개 데이터
Mid/L4
Senior/L5
Mid/L4 · Lead Analytics Consultant
1개 리포트
$151,878
총 연봉
기본급
$116,064
주식
-
보너스
-
$151,878
$151,878
면접 경험
3개 면접
난이도
3.0
/ 5
소요 기간
21-35주
합격률
33%
면접 과정
1
Application Review
2
Recruiter Screen
3
Online Assessment
4
Technical Interview
5
Behavioral Interview
6
Offer
자주 나오는 질문
Coding/Algorithm
Technical Knowledge
Behavioral/STAR
Past Experience
뉴스 & 버즈
Nationwide class debated in Wells Fargo cash sweep case - Daily Journal
Daily Journal
News
·
3d ago
Greenville police investigate robbery at Wells Fargo on Red Banks Road - WCTI
WCTI
News
·
4d ago
UPDATE: One arrested in Greenville bank robbery - WITN
WITN
News
·
4d ago
The 'debasement trade' has driven gold to new heights. Wells Fargo's bull case calls for $8,000 an ounce - CNBC
CNBC
News
·
5d ago