refresh

トレンド企業

Trending

採用

JobsAmgen

Site Reliability Engineer III

Amgen

Site Reliability Engineer III

Amgen

India - Hyderabad

·

On-site

·

Full-time

·

1w ago

Required Skills

HPC

Systems design

Performance tuning

Security

Observability

Product thinking

Engineering thinking

Roadmapping

Prioritization

Stakeholder management

Communication

Career Category

Engineering

Job Description

Position Overview

The GCF5 Site Reliability Engineer is the senior technical leader for the HPC Enablement pillar. They define and socialize operational standards and patterns, lead multi-team delivery, mentor GCF4 engineers, and translate researcher needs into scalable compute enablement designs. They own pillar-level reliability, performance, cost efficiency, and SLA/SLO outcomes, and influence cross-team engineering quality.

This role reports to the GCF7 leader and partners closely with peer GCF5 domain leads across SCIP to ensure cohesive, scalable platform evolution.

Core Responsibilities

  • Own the compute reliability and enablement roadmap within SCIP.
  • Define onboarding playbooks and golden paths for HPC workloads.
  • Establish containerization and reproducible runtime standards.
  • Optimize scheduler configuration and resource allocation policies.
  • Conduct workload profiling and performance tuning.
  • Define and manage SLOs, reliability standards, and operational guardrails.
  • Lead incident response and reduce recurring failures.
  • Mentor engineers and elevate reliability practices.
  • Partner with scientific teams to translate compute requirements into scalable infrastructure patterns.

Core Competencies

  • Deep expertise in HPC Enablement (HPC) with evidence of standard‑setting and reuse.
  • Systems design at scale (HPC); performance, security, and observability fundamentals.
  • Product/engineering thinking: road mapping, prioritization, and outcome‑oriented delivery.
  • Stakeholder influence across science, engineering, and governance forums; crisp written/verbal communication.

Core Success Measures

  • HPC job success rate improvement.
  • Reduction in MTTR for compute incidents.
  • Performance improvements relative to baseline.
  • Time-to-onboard new scientific workloads.
  • Improvement in cost-per-compute-hour efficiency.
  • Reduction in operational toil via automation.

Key Relationships

  • Collaborates with GCF6 Group Lead and cross‑functional leaders (R&D/PD/Dev).
  • Mentors and develops GCF4 Data and Software Engineers, partners with platform, data, ML, and research teams.
  • Interfaces with governance (architecture, security, compliance) and vendor/partner teams.

Decision Authority

  • Approve designs within the pillar; define and waive standards/patterns with rationale.
  • Recommend buy‑vs‑build; commit pillar resources to meet SLAs/SLOs; escalate risks.
  • Prioritize pillar backlog and roadmap in alignment with strategy and OKRs.

Qualifications

Basic Qualifications:

  • BS+8 / MS+6 / PhD in CS/Engineering/Data disciplines.
  • Demonstrated production delivery experience in HPC at scale.
  • Demonstrated literacy in a relevant scientific domain (e.g., biology, chemistry, therapeutic discovery).

Preferred Qualifications:

  • Depth in HPC Enablement (HPC).
  • Kubernetes and continuous integration/continuous delivery (CI/CD) at scale; observability, performance tuning, and security-by-design.
  • Evidence of standard‑setting and cross‑team influence; mentoring experience.

.

Total Views

0

Apply Clicks

0

Mock Applicants

0

Scraps

0

About Amgen

Amgen

A biotechnology company that develops and manufactures human therapeutics for various illnesses and diseases.

10,001+

Employees

Thousand Oaks

Headquarters

$138B

Valuation

Reviews

3.8

2 reviews

Work Life Balance

2.5

Compensation

3.0

Culture

3.0

Career

4.0

Management

3.0

70%

Recommend to a Friend

Pros

Professional development opportunities

Exposure to diverse functions and projects

Large-scale project experience

Cons

Understaffed with high output expectations

Limited permanent job opportunities

Temporary contract limitations

Salary Ranges

1,544 data points

L2

L3

L4

L5

L6

L2 · Financial Analyst L2

0 reports

$94,068

total / year

Base

$37,627

Stock

$47,034

Bonus

$9,407

$65,848

$122,288

Interview Experience

3 interviews

Difficulty

2.7

/ 5

Duration

14-28 weeks

Experience

Positive 0%

Neutral 33%

Negative 67%

Interview Process

1

Application Review

2

Recruiter Screen

3

Technical Phone Screen

4

Onsite/Virtual Interviews

5

Final Round Interview

6

Offer

Common Questions

Coding/Algorithm

Technical Knowledge

Behavioral/STAR

System Design

Past Experience