
Azure Infra AI Operations & Platform Engineer
About the role
Role Description :
The AI Operations & Platform Engineer is responsible for designing, building, and operating AI-powered operational solutions that improve infrastructure management, incident response, monitoring, automation, governance, and cloud operations. The role combines cloud engineering, infrastructure operations, automation, software development, and AI technologies to deliver intelligent operational capabilities across Azure and hybrid environments.
Responsibilities: 1.
AI Operations Engineering:
-
Build AI agents for incident investigation, root cause analysis, monitoring, alert triage, and operational automation.
-
Develop agentic workflows using Lang Graph, Lang Chain, Semantic Kernel, or equivalent.
2. Cloud & Infrastructure Engineering
-
Design, deploy, and maintain Azure infrastructure solutions.
-
Support hybrid cloud environments spanning Azure and on-premises infrastructure.
-
Implement Infrastructure-as-Code using Terraform or Bicep.
3. Platform Engineering & Automation
-
Develop automation solutions for provisioning, deployment, compliance, governance, and operational management.
-
Build self-service infrastructure and platform capabilities.
4. Observability & Operational Intelligence
-
Implement monitoring, logging, tracing, and observability solutions.
-
Build operational dashboards and automated investigation capabilities.
5. DevOps & Delivery Enablement
-
Design and maintain Azure DevOps pipelines and deployment frameworks.
-
Support Git-based development practices and CI/CD.
6. Governance & Security
-
Implement Azure Policies, Defender for Cloud controls, and governance frameworks.
-
Ensure AI solutions align with enterprise security and operational requirements.
Preferred Skills:
- Strong Azure infrastructure experience
- Azure networking, identity, security, monitoring, and governance
- Azure DevOps and CI/CD pipelines
- Power
Shell and Python:
- Bicep
- Understanding of AI, LLMs, AI Agents, and automation
- API integration and cloud services
- Strong troubleshooting and root cause analysis skills
Mandatory skills:
-
Lang Graph, Lang Chain, Azure AI Foundry, Azure OpenAI
-
Vector databases and RAG architectures
-
Open Telemetry
-
AKS and Container Apps
Preferred skill distribution:
- 40% Infrastructure & Cloud Engineering
- 20% Dev
Ops & Platform Engineering:
- 20% AI & Agent Development
- 20% Automation & Software Development
Educational qualification:
- BE, BTech, BCA, BSc (IT) MCA, MBA (IT) and MSc(IT)
Experience :
- Total 7+ years of experience in Azure Infra and AI Automation.
- 5+ years in Azure Infrastructure, Cloud Operations, Platform Engineering, or DevOps.
- 2+ years in automation and software development.
- Experience building operational tools, dashboards, or automation platforms.
- Exposure to AI, LLMs, AI agents, or AI-powered operational use cases.
About Bosch
bangalore
Headquarters