
Senior Site Reliability Engineer
About the role
Job Summary
The Senior Support Engineer in Site Reliability engineering (SRE) will be responsible for ensuring the reliability and availability of the systems and applications. The role involves proactive monitoring, troubleshooting, and resolution of issues to maintain optimal performance and uptime.
Key Responsibilities
-
Collaborate within team to identify and address critical system issues.
-
Implement automation tools and processes to improve system reliability and efficiency.
-
Perform regular system audits to ensure compliance with security standards and best practices.
-
Respond to and resolve escalated technical support issues in a timely manner.
-
Develop and maintain documentation related to system configurations, processes, and procedures.
Skill Requirements
-
Proficiency in programming languages such as python, java, or go.
-
Handson experience with cloud platforms like aws, azure, or google cloud.
-
Strong knowledge of containerization tools like docker and orchestration tools like kubernetes.
-
Familiarity with monitoring and logging tools such as prometheus, grafana, elk stack.
-
Good problem-solving skills and the ability to work under pressure in a fast paced environment.
Other Requirements
1.Relevant certifications in cloud platforms (AWS Certified DevOps Engineer, Azure DevOps Engineer Expert) would be a plus.
Required skills
Python
Java
Go
AWS
Azure
Kubernetes
Prometheus
Grafana
About HCL Technologies
Bengaluru
Headquarters