refresh

지금 많이 보는 기업

지금 많이 보는 기업

Oracle
Oracle

Cloud applications and platform services.

Principal Site Reliability Engineer

직무DevOps
경력Staff+
위치Mexico, United States
근무오피스 출근
고용정규직
게시1개월 전
지원하기

As a Principal member of the Site Reliability Engineering (SRE) team, you'll take ownership of highly available systems, influence service design, and work across teams to drive resiliency, automation, and operational excellence. This is a hands-on engineering role where deep infrastructure knowledge meets software engineering expertise, ideal for experienced SREs ready to take the lead.

This is not a fully remote role but a hybrid role. Does require in office at least 3 days a week in Guadalajara.

  • Career Level
  • IC4

What You’ll Do:

  • Lead the design, automation, and support of OCI services with a focus on resiliency, security, scalability, and performance.
  • Own and improve the end-to-end reliability metrics (SLOs, SLAs, KPIs) for your services.
  • Design and implement high-availability architectures and standards for large-scale distributed systems.
  • Serve as the ultimate escalation point for complex operational issues, using a deep understanding of service topologies and interdependencies.
  • Architect and build automation and orchestration tools that reduce manual work and prevent problem recurrence.
  • Collaborate with development teams to improve service designs, optimize deployments, and implement best practices for operational efficiency.
  • Guide technical decision-making and mentor junior SREs and developers across teams.
  • Participate in and lead postmortems, root cause analysis, and preventative design changes.
  • Contribute to capacity planning, demand forecasting, and long-term service scalability strategies.
  • Participate in a rotational on-call schedule to ensure the health and availability of production services.

What We’re Looking For:

  • Advanced experience with Linux systems administration
  • This is not a fully remote role but a hybrid role. Does require in office at least 3 days a week in Guadalajara.
  • Strong programming skills in Python (with automation libraries)
  • Advanced Bash/Shell scripting
  • Deep understanding of distributed systems, networking, and service architecture
  • Solid knowledge of databases and how they behave in production (SQL or NoSQL)
  • Strong understanding of CI/CD pipelines, Agile methodologies, and DevOps best practices
  • Experience writing and maintaining unit tests and production-grade software
  • Proven ability to lead cross-functional efforts and technical problem-solving in live environments

Nice to Have:

  • Hands-on experience with monitoring and observability tools (Grafana, Prometheus, New Relic, etc.)
  • Familiarity with Oracle Cloud Infrastructure (OCI) or other cloud platforms (AWS, Azure, GCP)
  • Experience with Infrastructure-as-Code (Terraform, Ansible) and container orchestration (Kubernetes)

전체 조회수

0

전체 지원 클릭

0

전체 Mock Apply

0

전체 스크랩

0

Oracle 소개

Oracle

Oracle

Public

Cloud applications and platform services.

140,000+

직원 수

Austin

본사 위치

$300B

기업 가치

리뷰

10개 리뷰

3.5

10개 리뷰

워라밸

2.8

보상

4.0

문화

3.2

커리어

2.5

경영진

2.3

62%

지인 추천률

장점

Good compensation and benefits

Supportive team culture and colleagues

Flexible work arrangements

단점

Poor management and leadership

Work-life balance challenges

Limited career advancement opportunities

연봉 정보

31,728개 데이터

Principal/L7

Principal/L7 · Senior Principal Consultant

1,776개 리포트

$205,852

총 연봉

기본급

$181,648

주식

-

보너스

$24,204

$157,007

$275,085

면접 후기

후기 8개

난이도

3.1

/ 5

소요 기간

14-28주

경험

긍정 0%

보통 75%

부정 25%

면접 과정

1

Application Review

2

Recruiter Screen

3

Technical Phone Screen

4

Final Interview

5

Offer Decision

자주 나오는 질문

Coding/Algorithm

Technical Knowledge

Behavioral/STAR

Past Experience