
Track Lead - Ansible, Terraform, GITHub
About the role
Job Summary
Dynatrace Expert
Responsibilities of Role:
- Deploy, configure, and manage Dynatrace platform for full-stack monitoring (infrastructure, Application ,end user experience)
- Configure application performance monitoring (APM) and distributed tracing
- Set up real-time monitoring dashboards and service flow visualization.
- Define and manage alerting policies using Dynatrace AI (Davis) for anomaly detection.
- Integrate Dynatrace with CI/CD pipelines and ITSM tools (Service Now).
- Monitor microservices, containers, and cloud-native workloads
- Perform root cause analysis using Smart scape topology and AI engine
- Automate deployments using APIs, scripts, and Infrastructure-as-Code tools.
- Configure SLOs, SLIs, and synthetic monitoring for user journeys.
- Support and train teams on Dynatrace usage and best practices Terraform Expert Responsibilities of Role:
- Design and implement Infrastructure as Code (IaC) using Terraform for cloud and on-prem environments.
- Develop reusable Terraform modules for scalable infrastructure deployments.
- Manage provisioning of cloud resources across AWS, Azure, and GCP.
- Implement Terraform state management (remote backends like S3, Azure Storage).
- Integrate Terraform with CI/CD pipelines for automated deployments.
- Ensure infrastructure versioning and change tracking using Git.
- Implement security best practices and compliance policies in Terraform code.
- Perform infrastructure cost optimization and resource lifecycle management.
- Troubleshoot and resolve infrastructure provisioning issues.
- Collaborate with DevOps and Cloud teams for automation initiatives.
- Automate infrastructure provisioning using Terraform and orchestration tools.
- Maintain proper documentation and governance of IaC standards.
Ansible Expert Responsibilities of Role:
- Design, develop, and maintai
PRTG Expert Responsibilities of Role:
-
Install, configure, and manage PRTG Network Monitor environments.
-
Monitor network devices, servers, applications, and bandwidth usage.
-
Configure sensors (SNMP, WMI, HTTP, Flow, etc.) for various monitoring needs.
-
Design dashboards and maps for real-time infrastructure visibility.
-
Configure alerts, notifications, and escalation policies.
-
Optimize performance and scalability of PRTG monitoring systems.
-
Perform root cause analysis and incident troubleshooting.
-
Integrate PRTG with third-party systems and APIs.
-
Automate monitoring setup and maintenance tasks.
-
Manage distributed monitoring using remote probes.
-
Conduct regular health checks and system upgrades.
Nagios Expert Responsibilities of Role:
-
Install, configure, and maintain Nagios Core / Nagios XI monitoring systems.
-
Monitor infrastructure, applications, servers, and network devices.
-
Configure hosts, services, plugins, and custom checks.
-
Develop and manage alerting and notification mechanisms.
-
Create dashboards and reports for system health monitoring.
-
Integrate Nagios with third-party tools and scripts.
-
Develop custom plugins using Bash, Python, or Perl.
-
Perform root cause analysis for detected issues.
-
Ensure high availability and performance of monitoring systems.
-
Automate monitoring configurations and maintenance tasks.
-
Upgrade and patch Nagios environments
Key Responsibilities
PowerShell Expert - Responsibilities of Role:
-
Develop, maintain, and optimize PowerShell scripts for automation of IT operations and administrative tasks.
-
Automate infrastructure management, system configuration, and deployment processes.
-
Create scripts for user management, system monitoring, and application maintenance.
-
Integrate PowerShell automation with cloud platforms (Azure, AWS).
-
Develop reusable script modules and maintain script repositories.
-
Automate patch management, backups, and system health checks.
-
Troubleshoot and debug scripts to resolve operational issues.
-
Integrate PowerShell with APIs, REST services, and third-party tools.
-
Work with DevOps teams to embed automation into CI/CD pipelines.
-
Manage Windows Server environments using PowerShell and DSC (Desired State Configuration).
-
Implement security best practices including credential management and secure scripting.
-
Document automation processes and provide training to teams
Ansible Expert Responsibilities of Role:
-
Design, develop, and maintain Ansible playbooks, roles, and collections for automated provisioning and configuration.
-
Implement Infrastructure as Code (IaC) practices using Ansible in hybrid cloud or on-premises environments.
-
Automate repetitive tasks, reducing manual errors and deployment times across development, staging, and production systems.
-
Manage patching, system updates, and application deployments using Ansible automation.
-
Collaborate with development, security, and operations teams to implement consistent configuration standards.
-
Integrate Ansible with CI/CD pipelines (GitLabCI, Jenkins, Azure DevOps, etc.) for automated deployment workflows.
-
Maintain and optimize Ansible Tower / AWX for centralized job execution and inventory management.
-
Document infrastructure automation processes and provide training to internal teams.
Terraform Expert Responsibilities of Role:
-
Design and implement Infrastructure as Code (IaC) using Terraform for cloud and on-prem environments.
-
Develop reusable Terraform modules for scalable infrastructure deployments.
-
Manage provisioning of cloud resources across AWS, Azure, and GCP.
-
Implement Terraform state management (remote backends like S3, Azure Storage).
-
Integrate Terraform with CI/CD pipelines for automated deployments.
-
Ensure infrastructure versioning and change tracking using Git.
-
Implement security best practices and compliance policies in Terraform code.
-
Perform infrastructure cost optimization and resource lifecycle management.
-
Troubleshoot and resolve infrastructure provisioning issues.
-
Collaborate with DevOps and Cloud teams for automation initiatives.
-
Automate infrastructure provisioning using Terraform and orchestration tools.
-
Maintain proper documentation and governance of IaC standards.
Grafana Expert Responsibilities of Role:
- Design and maintain scalable Grafana dashboards for real-time monitoring of
infrastructure , applications and business KPIs
- Integrate Grafana with a wide range of data-producing systems , including infrastructure
components , application telemetry sources and cloud service monitoring endpoints to
- enable unified and real time visualization of operational metrics
Skill Requirements
Dynatrace Expert Expertise:
-
Monitoring: Full-stack observability, real user monitoring (RUM), synthetic monitoring
-
Cloud Platforms: AWS, Azure, GCP (integration with Dynatrace)
-
Container Monitoring: Kubernetes, Open Shift, Docker
Terraform Expert Expertise:
-
IaCTools: Terraform (Core, Cloud), Terragrunt
-
Cloud Platforms: AWS, Azure, GCP
-
Configuration Mgmt: Ansible (good to have)
Ansible Expert Expertise:
1.Ansible Tools: Ansible Core, Ansible Galaxy,Ansible Tower / AWX, Ansible Vault
2.Automation & CI/CD: Jenkins, GitLabCI, GitHub Actions, Azure DevOps
3.Configuration Management: Puppet (basic), Chef (basic), Salt Stack(basic)
PowerShell Expert Expertise
-
Scripting & Automation: PowerShell (advanced scripting, modules, functions)
-
Windows Administration: Active Directory, Windows Server, Exchange
-
Automation Frameworks: PowerShell DSC, Azure Automation
Other Requirements
PRTG Expert Expertise:
-
Monitoring Tools: PRTG Network Monitor
-
Protocols: SNMP, WMI, Net Flow, s Flow, HTTP/HTTPS
Nagios Expert Expertise
-
Monitoring Tools: Nagios Core, Nagios XI
-
Plugins: NRPE, NCPA, custom plugin development
Grafana Expert Expertise
1.Visualization Tools: Grafana, Kibana
2.Metrics & Monitoring: Prometheus, InfluxDB, Graphite, Telegraf, CollectD
PRTG Expert – 12+ years of IT experience with 3+ years in PRTG
PowerShell Expert - 12+ years of IT experience with 4+ years in PowerShell scripting and automation
Ansible -12+ years experience
Nagios -12+ years of IT experience with 3+ years in Nagios
Grafana Expert -12+ years experience
Dynatrace Expert - 12+ years of overall IT experience with 4+ years in Dynatrace
**Terraform -**12+ years of IT experience with 4+ years in Terraform
Required skills
Ansible
Terraform
GitHub
DevOps
automation
About HCL Technologies
Chennai
Headquarters