Jobs
Required skills
HPC
Systems design
Performance tuning
Security
Observability
Product thinking
Engineering thinking
Roadmapping
Prioritization
Stakeholder Management
Communication
Career Category
Engineering
Job Description
Position Overview
The GCF5 Site Reliability Engineer is the senior technical leader for the HPC Enablement pillar. They define and socialize operational standards and patterns, lead multi-team delivery, mentor GCF4 engineers, and translate researcher needs into scalable compute enablement designs. They own pillar-level reliability, performance, cost efficiency, and SLA/SLO outcomes, and influence cross-team engineering quality.
This role reports to the GCF7 leader and partners closely with peer GCF5 domain leads across SCIP to ensure cohesive, scalable platform evolution.
Core Responsibilities
- Own the compute reliability and enablement roadmap within SCIP.
- Define onboarding playbooks and golden paths for HPC workloads.
- Establish containerization and reproducible runtime standards.
- Optimize scheduler configuration and resource allocation policies.
- Conduct workload profiling and performance tuning.
- Define and manage SLOs, reliability standards, and operational guardrails.
- Lead incident response and reduce recurring failures.
- Mentor engineers and elevate reliability practices.
- Partner with scientific teams to translate compute requirements into scalable infrastructure patterns.
Core Competencies
- Deep expertise in HPC Enablement (HPC) with evidence of standard‑setting and reuse.
- Systems design at scale (HPC); performance, security, and observability fundamentals.
- Product/engineering thinking: road mapping, prioritization, and outcome‑oriented delivery.
- Stakeholder influence across science, engineering, and governance forums; crisp written/verbal communication.
Core Success Measures
- HPC job success rate improvement.
- Reduction in MTTR for compute incidents.
- Performance improvements relative to baseline.
- Time-to-onboard new scientific workloads.
- Improvement in cost-per-compute-hour efficiency.
- Reduction in operational toil via automation.
Key Relationships
- Collaborates with GCF6 Group Lead and cross‑functional leaders (R&D/PD/Dev).
- Mentors and develops GCF4 Data and Software Engineers, partners with platform, data, ML, and research teams.
- Interfaces with governance (architecture, security, compliance) and vendor/partner teams.
Decision Authority
- Approve designs within the pillar; define and waive standards/patterns with rationale.
- Recommend buy‑vs‑build; commit pillar resources to meet SLAs/SLOs; escalate risks.
- Prioritize pillar backlog and roadmap in alignment with strategy and OKRs.
Qualifications
Basic Qualifications:
- BS+8 / MS+6 / PhD in CS/Engineering/Data disciplines.
- Demonstrated production delivery experience in HPC at scale.
- Demonstrated literacy in a relevant scientific domain (e.g., biology, chemistry, therapeutic discovery).
Preferred Qualifications:
- Depth in HPC Enablement (HPC).
- Kubernetes and continuous integration/continuous delivery (CI/CD) at scale; observability, performance tuning, and security-by-design.
- Evidence of standard‑setting and cross‑team influence; mentoring experience.
.
Total Views
1
Apply Clicks
0
Weekly mock applicants
0
Bookmarks
0
Similar jobs

Senior Engineer - Monitoring Tools, Event Monitoring
HCL Technologies · Hyderabad, India

Senior Staff AI/ML Scale Engineer
Marvell · 2 Locations

Sr.FinOps Analyst- AR
Amazon · Hyderabad, TS, IND
Senior Engineer- Software QA Devops (Jenkins & Groovy Scripting)
Silicon Labs · Hyderabad

Senior Team Lead - Automation
Uber · Hyderabad, India
About Amgen

Amgen
PublicA biotechnology company that develops and manufactures human therapeutics for various illnesses and diseases.
10,001+
Employees
Thousand Oaks
Headquarters
$138B
Valuation
Reviews
3.6
10 reviews
Work-life balance
3.2
Compensation
4.1
Culture
3.4
Career
2.8
Management
3.5
65%
Recommend to a friend
Pros
Excellent benefits and health benefits
Good pay and compensation
Supportive management and strong leadership
Cons
Limited career growth and promotion opportunities
Work-life balance challenges and long hours
Bureaucratic processes
Salary Ranges
1,244 data points
L2
L3
L4
L5
L6
L2 · Financial Analyst L2
0 reports
$94,068
total per year
Base
$37,627
Stock
$47,034
Bonus
$9,407
$65,848
$122,288
Interview experience
5 interviews
Difficulty
3.0
/ 5
Duration
14-28 weeks
Offer rate
40%
Experience
Positive 20%
Neutral 80%
Negative 0%
Interview process
1
Application Review
2
HR Screen
3
Hiring Manager Interview
4
Technical/Role-Specific Interview
5
Panel Interview
6
Offer
Common questions
Technical Knowledge
Behavioral/STAR
Past Experience
Data Analysis/Statistics
Culture Fit
News & Buzz
Amgen (AMGN) Laps the Stock Market: Here's Why - Yahoo Finance Singapore
Yahoo Finance Singapore
News
·
3d ago
UBS Sees Continued Upside in Amgen (AMGN), Lifts Target to $400 - Insider Monkey
Insider Monkey
News
·
3d ago
Amgen Inc. $AMGN Shares Sold by Whittier Trust Co. - MarketBeat
MarketBeat
News
·
4d ago
AE Wealth Management LLC Has $43.90 Million Holdings in Amgen Inc. $AMGN - MarketBeat
MarketBeat
News
·
4d ago