
Track Lead (Support & Operations)
About the role
Job Summary
The Site Reliability Engineer (SRE) ensures the availability, performance, and reliability of cloud‑native SaaS platforms running on Microsoft Azure. The role focuses on observability, incident management, and automation, with Dynatrace as the primary monitoring and AIOps platform.
Key Responsibilities
Operate and support production workloads on Azure (AKS, Azure PaaS services) Implement and manage end‑to‑end observability using Dynatrace (metrics, logs, traces, baselines, alerts) Define and track SLIs, SLOs, and error budgets to drive reliability decisions Lead or participate in incident response, root cause analysis, and blameless postmortems Automate operational tasks using Infrastructure as Code (Terraform, ARM/Bicep, Helm) Collaborate with DevOps and engineering teams to improve deployment safety, resilience, and performance Support on‑call rotations and ensure continuous service improvement . Strong experience with Microsoft Azure, especially AKS and cloud networking\\r\\n Hands‑on expertise with Dynatrace (services, Pure Paths, dashboards, alerting)\\r\\n Experience operating Kubernetes‑based production systems\\r\\n Solid understanding of SRE principles (reliability, toil reduction, automation)\\r\\n Experience with incident management and troubleshooting distributed systems
Skill Requirements
Strong experience with Microsoft Azure, especially AKS and cloud networking Hands‑on expertise with Dynatrace (services, Pure Paths, dashboards, alerting) Experience operating Kubernetes‑based production systems Solid understanding of SRE principles (reliability, toil reduction, automation) Experience with incident management and troubleshooting distributed systems
Other Requirements
Strong experience with Microsoft Azure, especially AKS and cloud networking Hands‑on expertise with Dynatrace (services, Pure Paths, dashboards, alerting) Experience operating Kubernetes‑based production systems Solid understanding of SRE principles (reliability, toil reduction, automation) Experience with incident management and troubleshooting distributed systems
Required skills
Operations
Technical Support
Team Leadership
About HCL Technologies
Gautam Buddha Nagar
Headquarters