채용
Responsibilities We are looking for 12+ years of experience in IT operations management with the key responsibilities below.
-
Key Responsibilities Problem Management
-
Own and manage the end‑to‑end lifecycle of Problems, including identification, prioritization, investigation, root cause analysis (RCA), and closure.
-
Facilitate major problem lookbacks following Sev1 and Sev2 incidents to identify systemic issues and prevent recurrence.
-
Partner with engineering and platform teams to drive permanent corrective actions and track remediation progress.
-
Analyze incident and problem trends to identify patterns, risk areas, and improvement opportunities.
-
Ensure Problem records meet enterprise quality standards, including documentation, RCA completeness, and action tracking.
-
Major Incident Management
-
Act as a Major Incident Manager during high‑severity incidents, providing leadership, structure, and coordination across technical and business stakeholders.
-
Ensure timely escalation, stakeholder communication, and executive visibility during major incidents.
-
Guide teams through incident response processes to minimize business impact and mean time to restore (MTTR).
-
Support post‑incident reviews and ensure lessons learned are incorporated into operational practices.
-
Contribute to the continuous improvement of incident response models, severity definitions, and escalation processes.
-
Availability Management
-
Monitor and manage the availability of critical applications and services, ensuring alignment with defined SLOs and availability targets.
-
Produce and review availability, outage, and MTTR metrics, translating operational data into meaningful insights for technology and business leaders.
-
Drive improvements in monitoring, alerting, and observability to enable proactive detection of service degradation. ________________________________________
Technical & Process Skills
- Deep working knowledge of Incident, Major Incident, Problem, Change, and Availability Management aligned to ITIL practices
- Proven capability to lead Sev1 / Sev2 incidents end‑to‑end
- Strong understanding of severity definitions, escalation triggers, and IC engagement expectations
- Familiarity with SLO concepts and service reliability expectations Professional Competencies Communication & Influence
- Communicates complex technical issues in clear, business‑relevant language Incident Leadership & Decision‑Making
- Maintains composure and clarity under pressure during high‑impact incidents Accountability & Ownership
- Follows incidents and problems through to sustainable resolution Collaboration & Partnership
- Works effectively across global teams and time zones Continuous Improvement Mindset
- Actively identifies opportunities to improve.
Certifications ITIL - Preferred Qualifications We are looking for 12+ years of experience in IT operations management with the key responsibilities stated in JD.
We are looking for 12+ years of experience in IT operations management with the key responsibilities below.
-
Key Responsibilities Problem Management
-
Own and manage the end‑to‑end lifecycle of Problems, including identification, prioritization, investigation, root cause analysis (RCA), and closure.
-
Facilitate major problem lookbacks following Sev1 and Sev2 incidents to identify systemic issues and prevent recurrence.
-
Partner with engineering and platform teams to drive permanent corrective actions and track remediation progress.
-
Analyze incident and problem trends to identify patterns, risk areas, and improvement opportunities.
-
Ensure Problem records meet enterprise quality standards, including documentation, RCA completeness, and action tracking.
-
Major Incident Management
-
Act as a Major Incident Manager during high‑severity incidents, providing leadership, structure, and coordination across technical and business stakeholders.
-
Ensure timely escalation, stakeholder communication, and executive visibility during major incidents.
-
Guide teams through incident response processes to minimize business impact and mean time to restore (MTTR).
-
Support post‑incident reviews and ensure lessons learned are incorporated into operational practices.
-
Contribute to the continuous improvement of incident response models, severity definitions, and escalation processes.
-
Availability Management
-
Monitor and manage the availability of critical applications and services, ensuring alignment with defined SLOs and availability targets.
-
Produce and review availability, outage, and MTTR metrics, translating operational data into meaningful insights for technology and business leaders.
-
Drive improvements in monitoring, alerting, and observability to enable proactive detection of service degradation. ________________________________________
Technical & Process Skills
- Deep working knowledge of Incident, Major Incident, Problem, Change, and Availability Management aligned to ITIL practices
- Proven capability to lead Sev1 / Sev2 incidents end‑to‑end
- Strong understanding of severity definitions, escalation triggers, and IC engagement expectations
- Familiarity with SLO concepts and service reliability expectations Professional Competencies Communication & Influence
- Communicates complex technical issues in clear, business‑relevant language Incident Leadership & Decision‑Making
- Maintains composure and clarity under pressure during high‑impact incidents Accountability & Ownership
- Follows incidents and problems through to sustainable resolution Collaboration & Partnership
- Works effectively across global teams and time zones Continuous Improvement Mindset
- Actively identifies opportunities to improve.
Certifications ITIL - Preferred:
We are looking for 12+ years of experience in IT operations management with the key responsibilities stated in JD.
Total Views
0
Apply Clicks
0
Mock Applicants
0
Scraps
0
Similar Jobs

Senior Linux & Cloud Administrator - AWS (8 to 14 years of experience)
SAP ·

Senior Functional Consultant - Workday CLM Evisort
Workday · 133 Locations

Territory Manager, NSE West Japan
Stryker · 3 Locations

Consulting - Technology Transformation - Senior - Hong Kong
EY ·

Senior Security Assurance Specialist
Vanguard · 4 Locations
About Principal

Principal
PublicPrincipal is a global investment management and insurance company providing retirement services, asset management, and insurance solutions. The company serves millions of customers worldwide through employer-sponsored retirement plans and individual investment products.
10,001+
Employees
Harrogate
Headquarters
Reviews
3.1
6 reviews
Work Life Balance
2.0
Compensation
4.0
Culture
1.8
Career
2.0
Management
1.5
25%
Recommend to a Friend
Pros
Good pay and benefits
Supportive assistant management
Quality software/codebase
Cons
Poor management and leadership
Micromanagement
Limited career advancement
Salary Ranges
2 data points
L6/Principal
Principal/L7
L6/Principal · Staff/Principal Backend SWE
1 reports
$710,000
total / year
Base
$270,000
Stock
$440,000
Bonus
-
$710,000
$710,000
Interview Experience
9 interviews
Difficulty
3.4
/ 5
Duration
14-28 weeks
Offer Rate
22%
Experience
Positive 11%
Neutral 33%
Negative 56%
Interview Process
1
Application Review
2
Recruiter Screen
3
Phone Screen
4
Technical Interview
5
System Design Interview
6
Behavioral Interview
7
Final Round
Common Questions
Technical Knowledge
System Design
Behavioral/STAR
Past Experience
Culture Fit
News & Buzz
Principal Financial Group’s 2026 Outlook Takes Center Stage Ahead of Earnings - AD HOC NEWS
Source: AD HOC NEWS
News
·
5w ago
Principal Real Estate Income Fund Announces Notification of Sources of Distribution - Business Wire
Source: Business Wire
News
·
5w ago
News | Lee & Associates of Michigan announces new principal of firm - CoStar
Source: CoStar
News
·
5w ago
AE Wealth Management LLC Boosts Stock Position in Principal Financial Group, Inc. $PFG - MarketBeat
Source: MarketBeat
News
·
5w ago