Jobs
Job Posting Title:
Principal Site Reliability Engineer:
Req ID:
10147292
Job Description:
“We Power the Magic!” That’s our motto at Disney Experiences (DX). Our team creates world-class immersive digital experiences for the Company’s premier vacation brands including Disney’s Parks & Resorts worldwide, Disney Cruise Line, Aulani, a Disney Resort & Spa, and Disney Vacation Club.
We are responsible for the end-to-end digital and physical Guest experience for all technology & digital-led initiatives across the Attractions & Entertainment, Food & Beverage, Resorts & Transportation and Merchandise lines of business as well as other initiatives including My Disney Experience and Hey, Disney!
This role sits in the US Parks & Resorts and Experiences organization within Disney Experience Technology. It works closely with other SREs, application delivery teams, and systems engineers from across the company.
About The Role & Team:
The US Parks Site Reliability Organization is responsible for the operation of an assigned portfolio of applications and services. We partner with other SRE and Tech Ops organizations to set standards for DevOps and SRE. We monitor the reliability of our assigned portfolio and work with teams to make changes that strengthen and harden services. We use and create tools and automations to reduce the toil of our technology organization to increase efficiency and create value by keeping the organization’s focus on delivering world-class experiences.
What You'll Do:
- Lead and impact the SRE culture, mentoring others on the effective use of that culture to drive observability and reduce toil
- Advocate for accelerated adoption of service level management, including advancing the adoption and tracking of SLIs, SLOs, and SLAs for all systems and applications in the assigned portfolio
- Elevate and lead the design, build, and support of products and platforms by serving as a thought leader who considers which green field products should be used and evaluating build vs. buy decisions
- Lead the adoption of AI/LLM-assisted reliability engineering by building and governing secure, production-grade workflows while also architecting and operating AI-enabled reliability capabilities
- Propel and drive development pipelines, automate infrastructure and operations, create telemetry for monitoring, engineer high reliability and reinforce best practices to secure company data
- Establish systems administration requirements in Linux and Windows platforms and bring knowledge on systems, network, operational excellence and application stability, security, performance, and capacity management, operational excellence and application stability, security, performance, and capacity management, as well as documentation
- Engage in estimation and planning across the organization, voicing recommendations and solutions from a technical perspective
- Work with on-call SEs and SREs to ensure an effective response to Major Incidents, minimize Mean Time to Resolve, and provide comprehensive MI retrospectives that result in measurable improvements to prevent future failures.
Required Qualifications:
- Minimum 10 years of related work experience
- Demonstrated experience in advancing the maturity of Site Reliability Engineering in an enterprise scale environment.
- Expertise in defining and implementing industry-leading observability strategies across diverse and highly complex distributed systems, ensuring optimal performance and reliability
- Comprehensive knowledge and hands-on experience with a comprehensive DevOps toolset for source control management, continuous integration/continuous deployment, orchestration, containerization, application performance management, observability, and reliability testing
- Demonstrated experience in engineering cloud-agnostic solutions using multiple Cloud service providers including AWS, Azure, and GCP
- Demonstrated experience evaluating and applying multiple GPT model families across cost/latency/quality tradeoffs, and engineering scalable MCP toolchains plus high-signal RAG systems to improve operational outcomes
- Mastery in architecting and managing highly available, scalable, and automated infrastructure using configuration management and orchestration tools such as Terraform, Cloud Formation, Ansible, and Chef, contributing to the organization's strategic objectives and competitive advantage
- Expert in using AI to optimize system reliability through advanced analytics and prescriptive recommendations
- An advocate for a diverse and inclusive culture that encourages innovation and ensures every team member feels a sense of belonging
Preferred Qualifications:
-
Extensive Experience with high demand releases and brands
-
PCI audit and standards experience
Required Education:
- Bachelor’s degree in Computer Science, Information Systems, Software, Electrical or Electronics Engineering, or comparable field of study, and/or equivalent work experience
Job Posting Segment:
DX Technology
Job Posting Primary Business:
Tech Delivery, Platforms, & Core Systems
Primary Job Posting Category:
Site/System Reliability Engineer
Employment Type:
Full time
Primary City, State, Region, Postal Code:
Bay Lake, FL, USA
Alternate City, State, Region, Postal Code:
Date Posted:
2026-04-09
Total Views
0
Apply Clicks
0
Weekly mock applicants
0
Bookmarks
0
Similar jobs

Senior DevOps Engineer (Cortex Research) Tel Aviv, Tel Aviv 02/08/2026
Palo Alto Networks · tel aviv

Senior Platform Engineer, Workday Financials
Capital One · 8 Locations

SENIOR NETWORK/CLOUD OPERATIONS ENGINEER
Wipro · New Jersey, United States

Senior Machine Learning Platform Engineer (Platform)
Coinbase · Remote - USA

Staff Software Engineer, Tech Lead - Mobile DevOps
Toast · Remote, US
About Walt Disney

Walt Disney
PublicThe Walt Disney Company (TWDC), commonly known as simply Disney, is an American multinational mass media and entertainment conglomerate headquartered at the Walt Disney Studios complex in Burbank, California.
10,001+
Employees
Burbank
Headquarters
$201B
Valuation
Reviews
3.8
10 reviews
Work-life balance
2.8
Compensation
3.2
Culture
3.5
Career
3.6
Management
2.9
72%
Recommend to a friend
Pros
Amazing culture and benefits
Great brand recognition and networking opportunities
Innovative projects and talented colleagues
Cons
Work-life balance challenges and long hours
High expectations and pressure to perform
Pay and compensation could be better
Salary Ranges
59 data points
Junior/L3
Principal/L7
Staff/L6
Junior/L3 · Decision Science Consultant
1 reports
$121,849
total per year
Base
$93,730
Stock
-
Bonus
-
$121,849
$121,849
Interview experience
4 interviews
Difficulty
3.3
/ 5
Duration
21-35 weeks
Offer rate
50%
Experience
Positive 0%
Neutral 75%
Negative 25%
Interview process
1
Application Review
2
HR Screen
3
Hiring Manager Interview
4
Panel Interview
5
Offer
Common questions
Behavioral/STAR
Culture Fit
Past Experience
Technical Knowledge
News & Buzz
Walt Disney Layoffs and Adtech Shake-Up Fuel Bullish Bets - TipRanks
TipRanks
News
·
3d ago
'Destination: Aulani, A Disney Resort & Spa' brings island vibe with some Disney magic - ABC11 Raleigh-Durham
ABC11 Raleigh-Durham
News
·
3d ago
Disney Legend Hayley Mills Is Glowing at 79 as She Recalls Meeting Walt Disney for the First Time - Parade
Parade
News
·
3d ago
The 20 Greatest Disney Animated Movie Masterpieces of All Time, Ranked - Collider
Collider
News
·
3d ago