
The Goldman Sachs Group, Inc
AI/ML Engineer – Vice President (Seattle, WA)
必备技能
Machine Learning
WHO WE ARE
Goldman Sachs is a leading global investment banking, securities and investment management firm that provides a wide range of services worldwide to a substantial and diversified client base that includes corporations, financial institutions, governments and high net-worth individuals.
Founded in 1869, it is one of the oldest and largest investment banking firms. The firm is headquartered in New York and maintains offices in London, Bangalore, Frankfurt, Tokyo, Hong Kong and other major financial centers around the world.
We are committed to growing our distinctive Culture and holding to our core values which always place our client's interests first. These values are reflected in our Business Principles, which emphasize integrity, commitment to excellence, innovation and teamwork.
**BUSINESS UNIT OVERVIEWEnterprise Technology Operations (ETO)**is a Business Unit within Core Engineering focused on running scalable production management services with a mandate of operational excellence and operational risk reduction achieved through large scale automation, best-in-class engineering, and application of data science and machine learning. The Production Runtime Experience (PRX) team in ETO applies software engineering and machine learning to production management services, processes, and activities to streamline monitoring, alerting, automation, and workflows.
TEAM OVERVIEW The Machine Learning and Artificial Intelligence team in PRX applies advanced ML and GenAI to reduce the risk and cost of operating the firm’s large-scale compute infrastructure and extensive application estate. Building on strengths in statistical modelling, anomaly detection, predictive modelling, and time-series forecasting, we leverage foundational LLM Models to orchestrate multi-agent systems for automated production management services. By unifying classical ML with agentic AI, we deliver reliable, explainable, and cost-efficient operations at scale.
ROLE AND RESPONSIBILITIES In this role, you will be responsible for launching and implementing GenAI agentic solutions aimed at reducing the risk and cost of managing large-scale production environments with varying complexities. You will address various production runtime challenges by developing agentic AI solutions that can diagnose, reason, and take actions in production environments to improve productivity and address issues related to production support.
What you’ll do:
-
Build agentic AI systems: Design and implement tool-calling agents that combine retrieval, structured reasoning, and secure action execution (function calling, change orchestration, policy enforcement) following MCP protocol. Engineer robust guardrails for safety, compliance, and least-privilege access.
-
Productionize LLMs: Build evaluation framework for open-source and foundational LLMs; implement retrieval pipelines, prompt synthesis, response validation, and self-correction loops tailored to production operations.
-
Integrate with runtime ecosystems: Connect agents to observability, incident management, and deployment systems to enable automated diagnostics, runbook execution, remediation, and post-incident summarization with full traceability.
-
Collaborate directly with users: Partner with production engineers, and application teams to translate production pain points into agentic AI roadmaps; define objective functions linked to reliability, risk reduction, and cost; and deliver auditable, business-aligned outcomes.
-
Safety, reliability, and governance: Build validator models, adversarial prompts, and policy checks into the stack; enforce deterministic fallbacks, circuit breakers, and rollback strategies; instrument continuous evaluations for usefulness, correctness, and risk.
-
Scale and performance: Optimize cost and latency via prompt engineering, context management, caching, model routing, and distillation; leverage batching, streaming, and parallel tool-calls to meet stringent SLOs under real-world load.
-
Build a RAG pipeline: Curate domain-knowledge; build data-quality validation framework; establish feedback loops and milestone framework maintain knowledge freshness.
-
Raise the bar: Drive design reviews, experiment rigor, and high-quality engineering practices; mentor peers on agent architectures, evaluation methodologies, and safe deployment patterns.
QUALIFICATIONS
A Bachelor’s degree (Masters/ PhD preferred) in a computational field (Computer Science, Applied Mathematics, Engineering, or in a related quantitative discipline), with 7+ years of experience as an applied data scientist / machine learning engineer.
ESSENTIAL SKILLS
-
7+ years of software development in one or more languages (Python, C/C++, Go, Java); strong hands-on experience building and maintaining large-scale Python applications preferred.
-
3+ years designing, architecting, testing, and launching production ML systems, including model deployment/serving, evaluation and monitoring, data processing pipelines, and model fine-tuning workflows.
-
Practical experience with Large Language Models (LLMs): API integration, prompt engineering, finetuning/adaptation, and building applications using RAG and tool-using agents (vector retrieval, function calling, secure tool execution).
-
Understanding of different LLMs, both commercial and open source, and their capabilities (e.g., OpenAI, Gemini, Llama, Qwen, Claude).
-
Solid grasp of applied statistics, core ML concepts, algorithms, and data structures to deliver efficient and reliable solutions.
-
Strong analytical problem-solving, ownership, and urgency; ability to communicate complex ideas simply and collaborate effectively across global teams with a focus on measurable business impact.
Preferred:
- Proficiency building and operating on cloud infrastructure (ideally AWS), including containerized services (ECS/EKS), serverless (Lambda), data services (S3, DynamoDB, Redshift), orchestration (Step Functions), model serving (Sage Maker), and infra-as-code (Terraform/CloudFormation).
Salary Range
The expected base salary for this Seattle, Washington United States-based position is $150,000-$250,000. In addition, you may be eligible for a discretionary bonus if you are an active employee as of fiscal year-end.
YOUR CAREER
Goldman Sachs is a meritocracy where you will be given all the tools to advance your career. At Goldman Sachs, you will have access to excellent training programs designed to improve multiple facets of your skill portfolio. Our in-house training program, “Goldman Sachs University” offers a comprehensive series of courses that you will have access to as your career progresses. Goldman Sachs University has an impressive catalogue of courses which span technical, business and leadership skills.
浏览量
0
申请点击
0
Mock Apply
0
收藏
0
相似职位

Applied Science Manager GenAI, CreativeX, Amazon Advertising
Amazon · Seattle, WA, USA

Machine Learning Scientist - Natural Language Processing (NLP) - Vice President - Machine Learning Center of Excellence
JPMorgan Chase · Seattle, WA, United States, US

Director, Applied AI Acceleration (SWE)
Walt Disney · Seattle, WA, USA

Applied Science Manager, Sponsored Products and Brands
Amazon · Seattle, WA, USA

Lead Machine Learning Engineer, Ads Research
Hulu (Disney) · 3 Locations
关于Goldman Sachs

Goldman Sachs
PublicThe Goldman Sachs Group, Inc. is an American multinational investment bank and financial services company. Founded in 1869, Goldman Sachs is headquartered in the Battery Park City neighborhood of Manhattan in New York City, with regional offices in many international financial centers.
45,000+
员工数
Lower Manhattan
总部位置
$80B
企业估值
评价
2条评价
2.9
2条评价
工作生活平衡
2.5
薪酬
3.0
企业文化
2.0
职业发展
4.0
管理层
2.5
45%
推荐率
优点
Amazing career growth opportunities
Chill management at some locations
Work-life balance valued in certain roles
缺点
Toxic workplace culture
Codependent atmosphere
Confusing interview process
薪资范围
20,304个数据点
Junior/L3
VP
Junior/L3 · Data Scientist Analyst
0份报告
$146,500
年薪总额
基本工资
-
股票
-
奖金
-
$124,525
$168,475
面试评价
4条评价
难度
3.5
/ 5
时长
21-35周
体验
正面 0%
中性 75%
负面 25%
面试流程
1
Application Review
2
HR Screen/HireVue
3
Recruiter Screen
4
Superday/Panel Interview
5
Final Decision
常见问题
Behavioral/STAR
Technical Knowledge
Culture Fit
Past Experience
Case Study
最新动态
Aidoc Raises $150 Million Series E Led by Goldman Sachs to Scale Clinical AI for Earlier, Safer Diagnoses - Yahoo Finance UK
Yahoo Finance UK
News
·
1w ago
Goldman Sachs and Bain Lead Investment in AI Marketing Startup - WSJ
WSJ
News
·
1w ago
Goldman Staff in Hong Kong Lose Access to Anthropic’s Claude - Bloomberg.com
Bloomberg.com
News
·
1w ago
Goldman cuts access to Anthropic's Claude for Hong Kong bankers, source says - Yahoo Finance
Yahoo Finance
News
·
1w ago