トレンド企業

Oracle
Oracle

Cloud applications and platform services.

Principal Site Reliability Engineer

職種DevOps
経験Staff+
勤務地Mexico, United States
勤務オンサイト
雇用正社員
掲載1ヶ月前
応募する

As a Principal member of the Site Reliability Engineering (SRE) team, you'll take ownership of highly available systems, influence service design, and work across teams to drive resiliency, automation, and operational excellence. This is a hands-on engineering role where deep infrastructure knowledge meets software engineering expertise, ideal for experienced SREs ready to take the lead.

This is not a fully remote role but a hybrid role. Does require in office at least 3 days a week in Guadalajara.

  • Career Level
  • IC4

What You’ll Do:

  • Lead the design, automation, and support of OCI services with a focus on resiliency, security, scalability, and performance.
  • Own and improve the end-to-end reliability metrics (SLOs, SLAs, KPIs) for your services.
  • Design and implement high-availability architectures and standards for large-scale distributed systems.
  • Serve as the ultimate escalation point for complex operational issues, using a deep understanding of service topologies and interdependencies.
  • Architect and build automation and orchestration tools that reduce manual work and prevent problem recurrence.
  • Collaborate with development teams to improve service designs, optimize deployments, and implement best practices for operational efficiency.
  • Guide technical decision-making and mentor junior SREs and developers across teams.
  • Participate in and lead postmortems, root cause analysis, and preventative design changes.
  • Contribute to capacity planning, demand forecasting, and long-term service scalability strategies.
  • Participate in a rotational on-call schedule to ensure the health and availability of production services.

What We’re Looking For:

  • Advanced experience with Linux systems administration
  • This is not a fully remote role but a hybrid role. Does require in office at least 3 days a week in Guadalajara.
  • Strong programming skills in Python (with automation libraries)
  • Advanced Bash/Shell scripting
  • Deep understanding of distributed systems, networking, and service architecture
  • Solid knowledge of databases and how they behave in production (SQL or NoSQL)
  • Strong understanding of CI/CD pipelines, Agile methodologies, and DevOps best practices
  • Experience writing and maintaining unit tests and production-grade software
  • Proven ability to lead cross-functional efforts and technical problem-solving in live environments

Nice to Have:

  • Hands-on experience with monitoring and observability tools (Grafana, Prometheus, New Relic, etc.)
  • Familiarity with Oracle Cloud Infrastructure (OCI) or other cloud platforms (AWS, Azure, GCP)
  • Experience with Infrastructure-as-Code (Terraform, Ansible) and container orchestration (Kubernetes)

閲覧数

0

応募クリック

0

Mock Apply

0

スクラップ

0

Oracleについて

Oracle

Oracle

Public

Cloud applications and platform services.

140,000+

従業員数

Austin

本社所在地

$300B

企業価値

レビュー

10件のレビュー

3.5

10件のレビュー

ワークライフバランス

2.8

報酬

4.0

企業文化

3.2

キャリア

2.5

経営陣

2.3

62%

知人への推奨率

良い点

Good compensation and benefits

Supportive team culture and colleagues

Flexible work arrangements

改善点

Poor management and leadership

Work-life balance challenges

Limited career advancement opportunities

給与レンジ

31,728件のデータ

Principal/L7

Principal/L7 · Senior Principal Consultant

1,776件のレポート

$205,852

年収総額

基本給

$181,648

ストック

-

ボーナス

$24,204

$157,007

$275,085

面接レビュー

レビュー8件

難易度

3.1

/ 5

期間

14-28週間

体験

ポジティブ 0%

普通 75%

ネガティブ 25%

面接プロセス

1

Application Review

2

Recruiter Screen

3

Technical Phone Screen

4

Final Interview

5

Offer Decision

よくある質問

Coding/Algorithm

Technical Knowledge

Behavioral/STAR

Past Experience