招聘
Replace the first sentence with "As a Senior Lead Site Reliability Engineer at JPMorgan Chase within Consumer and Community banking team, you will set clear quality gates across requirements, design, secure coding, testing, releases, and post-production monitoring to ensure reliability, performance, security, and observability.
Job responsibilities
- Set clear quality gates across requirements, design, secure coding, testing, releases, and post-production monitoring to ensure reliability, performance, security, and observability.
- Turn business goals into clear, testable requirements—and hold teams to an objective “Definition of Done” before release.
- Define and manage SLIs/SLOs and error budgets, and ensure they’re reflected in roadmaps and delivery plans.
- Lead operational readiness reviews, assess delivery risk, and drive fixes through root-cause analysis, corrective actions, and automation to prevent repeat issues.
- Improve logging, monitoring, and alerting so dashboards are actionable and alerts are tuned to reduce noise and speed response.
- Own CI/CD controls (security, reliability, testing, change management) and drive automation to reduce toil and increase release confidence.
- Lead and participate in major incident response (including outside business hours when needed), run post-incident reviews, and drive improvements against KPIs like availability, MTTR, and change failure rate.
Required qualifications, capabilities, and skills
-
10+ years supporting critical applications in large-scale environments, including experience leading and mentoring engineers/teams.
-
Strong SDLC and secure development practices, with experience implementing objective quality gates and release readiness standards.
-
Hands-on SRE experience, including SLIs/SLOs, error budgets, incident management, and post-incident reviews/root-cause analysis.
-
Experience designing actionable monitoring/logging and dashboards (e.g., Splunk, App Dynamics, or equivalent), including alert tuning.
-
Experience with CI/CD pipelines and automated testing (unit, integration, security), plus operational controls that reduce change risk.
-
Calm, accountable incident leadership under pressure, with strong communication and stakeholder management.
-
Comfortable collaborating with global teams and engaging during critical incidents outside standard business hours.
Preferred qualifications, capabilities, and skills
- Proficiency in Python; experience with Lang Chain, Lang Graph, or similar agentic frameworks
- Experience implementing LLMs using vector databases and Retrieval-Augmented Generation (RAG), as well as model tuning
- Strong SRE fundamentals: SLOs, SLIs, error budgets, blameless post-mortems, capacity planning
- Hands-on with observability tooling (Datadog, Prometheus, Open Telemetry, distributed tracing)
- Experience leading operational readiness reviews and maintaining “Definition of Done” checklists (SLO monitoring, runbooks, rollback validation, resilience/failover testing, vulnerability remediation, audit/control artifacts).
- Deep public cloud expertise (AWS or equivalent), including infrastructure automation (Terraform/Terraform Enterprise, CloudFormation), capacity planning, and resilience patterns for distributed systems.
- Track record of improving reliability outcomes (higher availability, lower MTTR, lower change failure rate) through automation and observability.
- Splunk Administrator certification (or equivalent).
- Familiarity with containers and orchestration (Docker, Kubernetes) and modern production operations practices.
总浏览量
0
申请点击数
0
模拟申请者数
0
收藏
0
相似职位

Senior Systems Administrator - MS Exchange
Leidos · Fort Belvoir, VA

Sr Principal Site Reliability Engineer
Walt Disney · San Francisco, CA, USA

Staff Site Reliability Engineer - Apple Ads
Apple · Cupertino, CA

Staff Software Engineer, Tech Lead - Mobile DevOps
Toast · Remote, US

Senior IT Systems Administrator
BAE Systems · Hudson, New Hampshire, United States
关于JPMorgan Chase

JPMorgan Chase
PublicJPMorgan Chase & Co. is an American multinational banking institution headquartered in New York City and incorporated in Delaware. It is the largest bank in the United States, and the world's largest bank by market capitalization as of 2025.
300,000+
员工数
New York City
总部位置
$500B
企业估值
评价
3.8
10条评价
工作生活平衡
3.2
薪酬
4.1
企业文化
3.8
职业发展
3.0
管理层
2.5
65%
推荐给朋友
优点
Good benefits and compensation
Supportive and collaborative environment
Flexible work arrangements
缺点
Long hours and heavy workload
Management issues and lack of direction
High stress during peak times
薪资范围
41个数据点
Junior/L3
Mid/L4
Senior/L5
Junior/L3 · Analytics Solutions Associate
1份报告
$139,000
年薪总额
基本工资
$107,000
股票
-
奖金
-
$139,000
$139,000
面试经验
5次面试
难度
3.0
/ 5
时长
14-28周
录用率
40%
体验
正面 20%
中性 80%
负面 0%
面试流程
1
Application Review
2
HireVue Video Interview
3
Recruiter Screen
4
Superday/Panel Interview
5
Final Interview
6
Offer
常见问题
Behavioral/STAR
Technical Knowledge
Culture Fit
Past Experience
Case Study
新闻动态
Spirepoint Private Client LLC Purchases 3,449 Shares of JPMorgan Chase & Co. $JPM - MarketBeat
MarketBeat
News
·
3d ago
As the world’s largest bank JP Morgan tests Anthropic’s AI tool Mythos, CEO Jamie Dimon admits 'threat'; - The Times of India
The Times of India
News
·
3d ago
Fortifying the enterprise: 10 actions to take now for AI-ready cyber resilience - JPMorganChase
JPMorganChase
News
·
3d ago
JPMorgan Chase & Co. Issues Pessimistic Forecast for Super Micro Computer (NASDAQ:SMCI) Stock Price - MarketBeat
MarketBeat
News
·
4d ago