
Engineer (Tools & Automation)
About the role
Job Summary
To develop and deliver codes for the work assigned in accordance with time| quality and cost standards.
Platform Management: Oversee APM (Application Performance Monitoring) and Infrastructure monitoring stacks. Incident & RCA Management: Define alerting thresholds, reduce alert fatigue, and lead Problem Management/Root Cause Analysis. Team Leadership: Mentor engineers and manage global shift rotations for continuous monitoring. Service Level Agreements (SLA): Ensure systems meet uptime targets and compliance standards
Platform Management: Oversee APM (Application Performance Monitoring) and Infrastructure monitoring stacks.\\r\\n Incident & RCA Management: Define alerting thresholds, reduce alert fatigue, and lead Problem Management/Root Cause Analysis.\\r\\n Team Leadership: Mentor engineers and manage global shift rotations for \\r\\n\\r\\n continuous monitoring.\\r\\n Service Level Agreements (SLA): Ensure systems meet uptime targets and compliance standards
Key Responsibilities
- To develop the codes for the project as per work assignation.
- To maintain the existing project and resolving the issues occurring in the existing project.
- To work upon the new requests raised by the client and develop those features.
- Documentation work.
Platform Management: Oversee APM (Application Performance Monitoring) and Infrastructure monitoring stacks. Incident & RCA Management: Define alerting thresholds, reduce alert fatigue, and lead Problem Management/Root Cause Analysis. Team Leadership: Mentor engineers and manage global shift rotations for continuous monitoring. Service Level Agreements (SLA): Ensure systems meet uptime targets and compliance standards
Skill Requirements
Platform Management: Oversee APM (Application Performance Monitoring) and Infrastructure monitoring stacks. Incident & RCA Management: Define alerting thresholds, reduce alert fatigue, and lead Problem Management/Root Cause Analysis. Team Leadership: Mentor engineers and manage global shift rotations for continuous monitoring. Service Level Agreements (SLA): Ensure systems meet uptime targets and compliance standards
Other Requirements
Platform Management: Oversee APM (Application Performance Monitoring) and Infrastructure monitoring stacks. Incident & RCA Management: Define alerting thresholds, reduce alert fatigue, and lead Problem Management/Root Cause Analysis. Team Leadership: Mentor engineers and manage global shift rotations for continuous monitoring. Service Level Agreements (SLA): Ensure systems meet uptime targets and compliance standards
Required skills
APM
Infrastructure monitoring
Incident management
RCA
Alerting
Problem management
About HCL Technologies
Others
Headquarters