
Global payments and technology company
Lead, Site Reliability Engineer (Infrastructure operations) at Mastercard
About the role
Our Purpose
Mastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we’re helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships and networks combine to deliver a unique set of products and services that help people, businesses and governments realize their greatest potential.
Title and Summary
Lead, Site Reliability Engineer (Infrastructure operations)
Lead SRE Engineer, Site Reliability Engineering
Our Purpose:
Mastercard powers economies and empowers people across more than 200 countries and territories worldwide.
We are committed to building an inclusive, digital economy that benefits everyone, everywhere—by making transactions safe, simple, smart, and accessible. Through secure data, trusted networks, strong partnerships, and relentless innovation, we help individuals, financial institutions, governments, and businesses unlock their greatest potential.
About the Role:
Mastercard’s Program aligned Site Reliability Engineering (SRE) teams are dedicated to delivering a seamless experience for our customers. We achieve this by maintaining every aspect of our Programs infrastructure and technology ecosystem to the highest standards, ensuring compliance with rigorous security requirements.
Within Mastercard, SRE focuses on the reliability and performance of core infrastructure, networks, and foundational services that power our applications. Our mission is to ensure these components operate with excellence, enabling applications to deliver an outstanding customer experience.
In this role, you will join our Payments Network SRE team and take ownership of continuously assessing and elevating the end to end service quality of our platform. You will leverage data to drive root cause analysis and deliver strategic insights to key stakeholders on resource utilization, capacity forecasting, and performance trends—ensuring the availability, scalability, and resilience of our network.
Key Responsibilities:
Lead continuous assessments of the application infrastructure supporting critical Mastercard applications, focusing on health, performance, monitoring and alerting, and capacity analysis. Collaborate with Product and Development teams to forecast growth requirements and ensure scalability and resiliency.
Champion observability as a core principle for infrastructure services by assessing environments and technologies to uncover gaps in monitoring and alerting. Design and implement strategies to close these gaps, ensuring all infrastructure telemetry is integrated into a unified, single-pane-of-glass view. Build custom dashboards to investigate and perform root cause analysis on complex issues.
Lead regular incident reviews with internal support teams to ensure root causes are identified. When patterns of failure or compatibility issues between software and infrastructure emerge, develop and implement strategies to remediate or mitigate risks.
Leverage automation and AI technologies to enhance proactive issue detection, enable self-healing capabilities, reducing Mean Time to Detect (MTTD) and Mean Time to Mitigate (MTTM).
Develop testing and validation plans for new environment builds, disaster recovery exercises and post-maintenance activities to certify environment readiness before customer traffic is routed to it.
Champion continuous learning, development, and knowledge sharing across networking and other infrastructure disciplines to strengthen multi-disciplinary SRE team capabilities. Lead training initiatives for team members and Product and Development on networking aspects of the platforms.
Evaluate vendor hardware, firmware, and software upgrade roadmaps, and conduct proof-of-concept (POC) testing to identify potential risks and opportunities for improvement in upcoming releases.
All about you:
- 5–10 years of experience in an SRE or SRE related operations role, including 3+ years supporting e commerce, financial services, or large scale SaaS platforms.
- Excellent infrastructure troubleshooting and analytical problem solving skills.
- Strong hands on experience with observability and monitoring tools such as Splunk, Dynatrace, or equivalent, with a proven ability to triage and investigate complex issues.
- Familiarity with network telemetry tools such as Solar Winds and Net Scout.
- Proficiency in packet level debugging, including capturing traffic with tools like tcpdump and analyzing packets using Wireshark.
- Broad understanding of end to end infrastructure supporting payment platforms—spanning platform services, networking, databases, and storage.
- Experience with automation and Infrastructure as Code tools such as Chef, Ansible, and Terraform, as well as structured data formats (JSON/YAML).
- Excellent communication skills with the ability to coordinate cross functional troubleshooting efforts and lead RCA processes to closure.
- Demonstrated ability to troubleshoot complex production issues, perform root cause analysis, and drive long term corrective actions.
- Experience partnering with development teams to shape architecture, define SLIs/SLOs, and embed reliability into services from design through operation.
- Strong understanding of monitoring and observability ecosystems, including Prometheus, Grafana, ELK/EFK, Splunk, Dyantrace, and Open Telemetry.
- Effective incident management skills with a structured, analytical approach to problem solving.
The Payments Network SRE team is responsible for the runtime availability of some of Mastercard’s most critical core payment systems, which support national infrastructure and operate 24/7 year‑round. As a result, this role will include periodic on‑call responsibilities when required.
Corporate Security Responsibility:
All activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must:
- Abide by Mastercard’s security policies and practices;
- Ensure the confidentiality and integrity of the information being accessed;
- Report any suspected information security violation or breach, and
- Complete all periodic mandatory security trainings in accordance with Mastercard’s guidelines.
Corporate Security Responsibility
All activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must:
-
Abide by Mastercard’s security policies and practices;
-
Ensure the confidentiality and integrity of the information being accessed;
-
Report any suspected information security violation or breach, and
-
Complete all periodic mandatory security trainings in accordance with Mastercard’s guidelines.
Required skills
Site reliability engineering
Infrastructure operations
Systems administration
Monitoring
Incident response
Performance tuning
Security compliance
Total Views
0
Total Apply Clicks
0
Total Mock Apply
0
Total Bookmarks
0
More open roles at Mastercard

Customer Experience and Engagement Analyst II
Mastercard · Peterborough, England

Director, Platform Engineering (vmware)
Mastercard · Dublin, Ireland

Lead Data & AI Security Engineer
Mastercard · Arlington, Virginia

Vice President, Healthcare Commercial Payment
Mastercard · London, England (Angel Lane)

Senior Software Engineer - Test
Mastercard · Pune, India
Similar jobs

Associate Director, DT Portfolio Architect - Production (Remote)
Collins Aerospace (RTX) · US-CT-REMOTE

Enterprise Classified Cloud Sr. Manager
Collins Aerospace (RTX) · US-TX-RICHARDSON-C17 ~ 1717 Cityline Dr ~ CITYLINE C17

Senior Principal Engineer, Infrastructure Platform Architect (Onsite)
Collins Aerospace (RTX) · US-TX-PLANO-465 ~ 465 Independence Pkwy ~ INDEPENDENCE

CDS Platform Services
RTX (Raytheon) · US-CO-AURORA-S78 ~ 16201 E Centretech Pkwy ~ BLDG S78

Facilities Engineer (Onsite)
RTX (Raytheon) · US-MD-ANNAPOLIS-906 ~ 2551 Riva Rd ~ BLDG 906
About Mastercard

Mastercard
PublicA financial network that processes payments between banks and cardholders
10,001+
Employees
Purchase
Headquarters
$360B
Valuation
Reviews
10 reviews
3.8
10 reviews
Work-life balance
2.8
Compensation
4.1
Culture
4.2
Career
3.4
Management
3.1
72%
Recommend to a friend
Pros
Great team culture and supportive colleagues
Excellent benefits and compensation
Training and development opportunities
Cons
Work-life balance challenges and long hours
High pressure and stress during peak times
Management issues and lack of direction
Salary Ranges
51 data points
Junior/L3
Director
Junior/L3 · Data Engineer
5 reports
$137,800
total per year
Base
$106,000
Stock
-
Bonus
-
$107,900
$166,918
Interview experience
3 interviews
Difficulty
3.3
/ 5
Duration
14-28 weeks
Offer rate
33%
Experience
Positive 33%
Neutral 34%
Negative 33%
Interview process
1
Application Review
2
Recruiter Screen
3
Technical Phone Screen
4
Behavioral Interview
5
Super Day/Final Round
6
Offer
Common questions
Coding/Algorithm
Technical Knowledge
Behavioral/STAR
System Design
Past Experience
Latest updates
Reimagining B2B payments through fintech partnerships - Mastercard
Mastercard
News
·
1w ago
Visa, Mastercard, American Express Are Down by Double Digits in 2026: Buying Opportunity or Trap? - 24/7 Wall St.
24/7 Wall St.
News
·
1w ago
Ambassador Xie Feng met with Mastercard CEO Michael Miebach - 驻美国大使馆
驻美国大使馆
News
·
1w ago
Mastercard Before Q1 Earnings: A Smart Bet or an Expensive Checkout? - Zacks Investment Research
Zacks Investment Research
News
·
1w ago