热门公司

招聘

职位JPMorgan Chase

Senior Lead Site Reliability Engineer

JPMorgan Chase

Senior Lead Site Reliability Engineer

JPMorgan Chase

Plano, TX, United States, US

·

On-site

·

Full-time

·

2w ago

Replace the first sentence with "As a Senior Lead Site Reliability Engineer at JPMorgan Chase within Consumer and Community banking team, you will set clear quality gates across requirements, design, secure coding, testing, releases, and post-production monitoring to ensure reliability, performance, security, and observability.

Job responsibilities

  • Set clear quality gates across requirements, design, secure coding, testing, releases, and post-production monitoring to ensure reliability, performance, security, and observability.
  • Turn business goals into clear, testable requirements—and hold teams to an objective “Definition of Done” before release.
  • Define and manage SLIs/SLOs and error budgets, and ensure they’re reflected in roadmaps and delivery plans.
  • Lead operational readiness reviews, assess delivery risk, and drive fixes through root-cause analysis, corrective actions, and automation to prevent repeat issues.
  • Improve logging, monitoring, and alerting so dashboards are actionable and alerts are tuned to reduce noise and speed response.
  • Own CI/CD controls (security, reliability, testing, change management) and drive automation to reduce toil and increase release confidence.
  • Lead and participate in major incident response (including outside business hours when needed), run post-incident reviews, and drive improvements against KPIs like availability, MTTR, and change failure rate.

Required qualifications, capabilities, and skills

  • 10+ years supporting critical applications in large-scale environments, including experience leading and mentoring engineers/teams.

  • Strong SDLC and secure development practices, with experience implementing objective quality gates and release readiness standards.

  • Hands-on SRE experience, including SLIs/SLOs, error budgets, incident management, and post-incident reviews/root-cause analysis.

  • Experience designing actionable monitoring/logging and dashboards (e.g., Splunk, App Dynamics, or equivalent), including alert tuning.

  • Experience with CI/CD pipelines and automated testing (unit, integration, security), plus operational controls that reduce change risk.

  • Calm, accountable incident leadership under pressure, with strong communication and stakeholder management.

  • Comfortable collaborating with global teams and engaging during critical incidents outside standard business hours.

Preferred qualifications, capabilities, and skills

  • Proficiency in Python; experience with Lang Chain, Lang Graph, or similar agentic frameworks
  • Experience implementing LLMs using vector databases and Retrieval-Augmented Generation (RAG), as well as model tuning
  • Strong SRE fundamentals: SLOs, SLIs, error budgets, blameless post-mortems, capacity planning
  • Hands-on with observability tooling (Datadog, Prometheus, Open Telemetry, distributed tracing)
  • Experience leading operational readiness reviews and maintaining “Definition of Done” checklists (SLO monitoring, runbooks, rollback validation, resilience/failover testing, vulnerability remediation, audit/control artifacts).
  • Deep public cloud expertise (AWS or equivalent), including infrastructure automation (Terraform/Terraform Enterprise, CloudFormation), capacity planning, and resilience patterns for distributed systems.
  • Track record of improving reliability outcomes (higher availability, lower MTTR, lower change failure rate) through automation and observability.
  • Splunk Administrator certification (or equivalent).
  • Familiarity with containers and orchestration (Docker, Kubernetes) and modern production operations practices.

总浏览量

0

申请点击数

0

模拟申请者数

0

收藏

0

关于JPMorgan Chase

JPMorgan Chase

JPMorgan Chase & Co. is an American multinational banking institution headquartered in New York City and incorporated in Delaware. It is the largest bank in the United States, and the world's largest bank by market capitalization as of 2025.

300,000+

员工数

New York City

总部位置

$500B

企业估值

评价

3.8

10条评价

工作生活平衡

3.2

薪酬

4.1

企业文化

3.8

职业发展

3.0

管理层

2.5

65%

推荐给朋友

优点

Good benefits and compensation

Supportive and collaborative environment

Flexible work arrangements

缺点

Long hours and heavy workload

Management issues and lack of direction

High stress during peak times

薪资范围

41个数据点

Junior/L3

Mid/L4

Senior/L5

Junior/L3 · Analytics Solutions Associate

1份报告

$139,000

年薪总额

基本工资

$107,000

股票

-

奖金

-

$139,000

$139,000

面试经验

5次面试

难度

3.0

/ 5

时长

14-28周

录用率

40%

体验

正面 20%

中性 80%

负面 0%

面试流程

1

Application Review

2

HireVue Video Interview

3

Recruiter Screen

4

Superday/Panel Interview

5

Final Interview

6

Offer

常见问题

Behavioral/STAR

Technical Knowledge

Culture Fit

Past Experience

Case Study