HCL Technologies
HCL Technologies

Track Manager - Windows Azure IaaS, Terraform

RoleInfrastructure
LevelManager
LocationIndia
WorkOn-site
TypeFull-time
Posted1 day ago
Apply now

About the role

Job Summary

Capacity & Availability Management\r\n· Identify scaling opportunities with virtual machines or services as required and identify zone redundancy patterns for performance.\r\n· Keep track of capacity forecasts and proactively identify performance bottlenecks.\r\n Backup & Restore Operations\r\n· Execute frequent backups (Azure Backup, Net App Snapshots) and perform basic restore tasks to ensure business continuity.\r\n· Conduct routine backup verifications/tests to confirm data integrity.\r\n\r\n Access & Permissions Management\r\n· Maintain Azure/Net App file shares, setting up and adjusting access controls and AD group permissions according to organizational policy.\r\n· Perform periodic identity and access reviews to ensure the principle of least privilege.\r\n

Primary Monitoring & Incident Response\r\n· Provide 24×7 monitoring of Azure infrastructure (computer, network, storage) using tools such as Azure Monitor, Splunk, Dynatrace, and custom dashboards.\r\n· Respond to alerts and triage P1/P2 escalations via Service Now war rooms, performing initial diagnosis and remediation where possible.\r\n· Incident / Change / Exception process adherence.\r\n

Sufficient knowledge to follow runbooks and standard operating procedures (SOPs).\\r\\n• Documentation of standard operating procedures and IaC changes should be continuously updated in a central repository (e.g., Git repos).\\r\\n• Familiarity with Epic implementations (on-prem / cloud)\\r\\n

Key Responsibilities

null

Skill Requirements

Logging & Metrics Oversight\r\n· Oversee monitoring agents (e.g., Splunk, Dynatrace, Azure Alerts, System Pulse), ensuring they are Up ToDate and generating the right alerts/metrics for L2 to act upon.\r\n· Collaborate with L3 to finetune alert thresholds and logging when chronic issues emerge.\r\n\r\n Basic Performance Testing\r\n· Execute routine performance checks (e.g., load or stress tests) in coordination with L3 teams when potential service degradation is suspected.\r\n· Document and escalate consistent performance anomalies.\r\n

Other Requirements

  • SKILL SET & STAFFING CONSIDERATIONS\r\n• Mandatory Skillset Required
  • Splunk, Dynatrace\r\n• Familiar with Azure Backup services, basic restore procedures, and file share permissions. Comfortable reading and troubleshooting logs/metrics (Azure Monitor).\r\n• Proficiency in ticketing systems (Service Now), collaborating with other technical teams for escalations.\r\n

About HCL Technologies

Others

Headquarters