
Senior Site Reliability Engineer Lead
About the role
Job Summary
The Senior Support Lead in Site Reliability engineering (SRE) will be responsible for overseeing the support and reliability operations within the organization. This role will focus on ensuring the stability, performance, and efficiency of the systems while leading a team of support engineers to provide exceptional service.
Key Responsibilities
-
Lead and manage a team of support engineers in resolving incidents, requests, and problems to ensure system uptime and reliability.
-
Collaborate with the engineering and development teams to implement efficient and scalable solutions that enhance system performance.
-
Develop and maintain support documentation, standard operating procedures, and best practices for the support team.
-
Identify opportunities for automation and implement tools to streamline support processes.
-
Monitor system performance and provide recommendations for improvements to optimize system reliability.
-
Participate in on call rotations to address critical incidents and ensure 24/7 system availability.
-
Conduct regular performance evaluations, provide feedback, and mentor team members to promote professional growth.
Skill Requirements
Detailed JD is as below Should have more than 7 or more years of IT experience.
Should be well versed with Site reliability Engineering and ITIL concept.
Automation & DevOps Tools
Ansible (Playbooks)
JenkinsXLR (or similar orchestration tools)
AI/Automation tools (preferred)Version Control GitHub / Bitbucket
Monitoring & Observability
Splunk Dynatrace, Open Telemetry (OTel)
Programming & Scripting
Python Shell Scripting:
**Experience with event-driven systems and streaming platforms is good to have.Understanding of ITIL / Incident / Change Management processes. Should have good analytical skill Team player, ready to work in 16*7 rotational shifts. *
Other Requirements
1.Relevant certifications in Site Reliability Engineering (SRE) or Cloud Services are a plus.
Benefits and perks
•Learning Budget
Required skills
SRE
ITIL
Automation
Monitoring
Incident management
DevOps
About HCL Technologies
Pune
Headquarters