
Tower Lead - Azure DevOps, Terraform
About the role
Job Summary
The Tower Lead for Support & Operations plays a critical role in managing escalations, ensuring operational excellence, and driving customer satisfaction. This position is responsible for overseeing the implementation of organizational initiatives, optimizing service delivery according to established agreements, and fostering a culture of continuous improvement.
requirements for a Site Reliability Engineering (SRE) role, emphasizing over 7 years of IT experience with expertise in Java, Spring Boot, microservices, and hands-on experience with New Relic for performance monitoring
Key Responsibilities
- Drive Revenue Generation By Implementing Infrastructure As Code (Iac) Strategies Using Terraform And Azure, Ensuring Optimal Resource Allocation And Utilization.
- Manage Escalations By Leveraging Terraform And Azure Kubernetes, Ensuring Timely Resolution Of Incidents While Adhering To Established Service Level Agreements (Slas).
- Ensure Operational Hygiene By Validating Reports Generated Through Azure Monitoring Tools, Ensuring Service Delivery Aligns With The Statement Of Work (Sow).
- Enhance Customer Satisfaction By Developing And Implementing New Frameworks Using Azure Paas Services To Address Client Needs And Feedback Effectively.
- Lead Profit Improvement Plans (Pips) By Integrating Automation Solutions In Azure And Terraform, Identifying Self-Driven Initiatives That Contribute To Operational Efficiency.
requirements for a Site Reliability Engineering (SRE) role, emphasizing over 7 years of IT experience with expertise in Java, Spring Boot, microservices, and hands-on experience with New Relic for performance monitoring
Skill Requirements
understanding SRE principles, DevOps practices, and experience with high-availability large-scale e Commerce platforms, along with operational knowledge of Angular applications and NoSQL databases such as CouchDB or Couchbase
- Proficient In Infrastructure As Code (Iac) Using Terraform And Azure.
- Strong Understanding Of Windows Azure Iaas And Paas Services.
- Familiarity With Azure Kubernetes For Container Orchestration.
- Excellent Problem-Solving Skills With A Focus On Customer Satisfaction And Operational Efficiency.
- Strong Analytical Skills To Validate And Interpret Operational Reports.
requirements for a Site Reliability Engineering (SRE) role, emphasizing over 7 years of IT experience with expertise in Java, Spring Boot, microservices, and hands-on experience with New Relic for performance monitoring
Other Requirements
- Optional But Valuable: Azure Solutions Architect Expert Certification.
- Optional But Valuable: Terraform Associate Certification
Key skills include incident management, root cause analysis, production troubleshooting, Linux, networking fundamentals, and application runtime diagnostics
Preferred qualifications include experience with cloud platforms (Azure, AWS, GCP), containerization and orchestration tools like Docker and Kubernetes, CI/CD automation, performance testing tools like JMeter, and chaos engineering concepts
Domain experience in retail or e Commerce, covering order management, payments, loyalty, promotions, customer identity and profile services, and handling high-traffic seasonal sale events is also favored
Required skills
Azure
Terraform
Azure DevOps
Kubernetes
Monitoring
Incident management
SRE
Automation
About HCL Technologies
Gautam Buddha Nagar
Headquarters