Jobs
Required skills
Kubernetes
About Us
Observe.AI is the enterprise-grade Customer Experience AI platform that unifies conversations, intelligence, and action to turn contact centers into performance engines. Built to optimize the full lifecycle of human and AI agents, Observe.AI enables enterprises to automate customer interactions, augment agent performance, and deliver governed AI at scale.
On a single platform, Observe.AI combines Voice and Chat AI Agents, real-time AI Copilots, and Conversation Intelligence with 100% interaction coverage for quality, compliance, and performance management. Trusted by brands like Door Dash, Affordable Care, Signify Health, and Verida, Observe.AI delivers fast time-to-value, measurable ROI, and consistent, high-quality customer experiences across every channel.
Why Join Us
Joining Observe.AI as a Lead DevOps Engineer puts you at the forefront of AI and cloud infrastructure, where you’ll own and scale systems powering real-world customer interactions. You’ll drive high-impact initiatives like GPU orchestration, self-hosting, and low-latency AI deployments while working closely with ML teams to productionize cutting-edge models. With end-to-end ownership, a modern tech stack, and the opportunity to shape MLOps best practices, this role offers strong technical leadership, tangible business impact, and accelerated growth in a fast-scaling AI company.
What you’ll be doing
-
Manager Self-Hosting tools: Lead the transition from managed services to self-hosted Elastic search, Prometheus, and other critical infrastructure components to optimize performance and cost.
-
Optimize AI Infrastructure: Work closely with ML engineers and data scientists to efficiently deploy and scale AI/ML models, ensuring high availability and low-latency inference.
-
Infrastructure Scalability & Reliability: Design and implement scalable, fault-tolerant systems capable of handling large-scale AI workloads, distributed training, and high-throughput data pipelines.
-
Technology Evaluation & Implementation: Continuously assess and introduce new technologies to enhance automation, reliability, and security in AI model deployment and training pipelines.
-
CI/CD for AI Workflows: Enhance and automate ML model deployment pipelines using MLOps best practices and tools like Kubeflow, MLflow, and Argo Workflows.
-
Observability & Monitoring: Implement and enhance monitoring, logging, and alerting strategies using Prometheus, Grafana, ELK, Open Telemetry, etc., tailored for AI workloads.
-
Security Best Practices: Implement security measures for AI data pipelines, model storage, and cloud infrastructure.
-
Mentorship & Best Practices: Set high standards by implementing best practices in DevOps and MLOps, mentoring team members to raise the technical bar.
What you bring to the role
-
6+ years of experience in DevOps, SRE, or Cloud Infrastructure roles, preferably in AI or data-intensive environments.
-
Strong expertise in Kubernetes (EKS, AKS preferred ) for deploying AI workloads and managing GPU & non-CPU clusters.
-
Experience with self-hosting services like Elasticsearch, Prometheus, Grafana, Kafka, etc.
-
Hands-on expertise in Infrastructure as Code (Terraform, CloudFormation).
-
Deep understanding of cloud platforms (AWS, Azure, GCP) and AI-focused services like AWS Sagemaker, Vertex AI, or Azure ML.
-
Strong automation and scripting skills in Python, Bash, or Go.
-
Experience in CI/CD tools (Jenkins, GitHub Actions, ArgoCD, etc.) with a focus on AI model deployment.
-
Strong leadership and mentorship skills to guide DevOps and ML teams.
-
Fin Ops expertise for optimizing GPU and AI cloud compute costs.
-
Familiarity with service meshes (Istio, Linkerd) and API gateways.
-
Knowledge of compliance frameworks (SOC2, ISO 27001, etc.) for AI data pipelines.
Perks & Benefits
-
Excellent medical insurance options and free online doctor consultations
-
Yearly privilege and sick leaves as per Karnataka S&E Act
-
Generous holidays (National and Festive) recognition and parental leave policies
-
Learning & Development fund to support your continuous learning journey and professional development
-
Fun events to build culture across the organization
-
Flexible benefit plans for tax exemptions (i.e. Meal card, PF, etc.)
Our Commitment to Inclusion and Belonging
Observe.AI* is an Equal Employment Opportunity employer that proudly pursues and hires a diverse workforce. Observe AI does not make hiring or employment decisions on the basis of race, color, religion or religious belief, ethnic or national origin, nationality, sex, gender, gender identity, sexual orientation, disability, age, military or veteran status, or any other basis protected by applicable local, state, or federal laws or prohibited by Company policy. Observe.AI also strives for a healthy and safe workplace and strictly prohibits harassment of any kind.*
We welcome all people. We celebrate diversity of all kinds and are committed to creating an inclusive culture built on a foundation of respect for all individuals. We seek to hire, develop, and retain talented people from all backgrounds. Individuals from non-traditional backgrounds, historically marginalized or underrepresented groups are strongly encouraged to apply.
*If you are ambitious, make an impact wherever you go, and you're ready to shape the future of Observe.AI, we encourage you to apply. For more information, visit www.observe.ai. *
Total Views
0
Apply Clicks
0
Weekly mock applicants
0
Bookmarks
0
Similar jobs

Staff Site Reliability Engineer, Energy Software
Tesla · Richmond Hill, Ontario

Sr. Software Engineer, Manufacturing Quality
Tesla · Fremont, California

Staff Site Reliability Engineer, Energy Software
Tesla · Palo Alto, California

Software Engineer – Golang (m/w/d) - Gigafactory Berlin-Brandenburg
Tesla · Grünheide (mark), Brandenburg

Internship, Software Engineer, Autonomy Telemetry (Summer 2026)
Tesla · Palo Alto, California
About Observe AI

Observe AI
SeedObserve AI is an artificial intelligence company focused on customer experience and conversational AI solutions.
1-50
Employees
Bengaluru
Headquarters
Salary Ranges
1 data points
Mid/L4
Mid/L4 · SDET
1 reports
$3,100,000
total per year
Base
$3,100,000
Stock
-
Bonus
-
$3,100,000
$3,100,000
News & Buzz
KMT chairwoman observes AI development in Beijing, highlighting cross-Strait cooperation - Global Times
Global Times
News
·
2w ago
Observe.Ai Secures Fresh Fund Of USD 26 Million In Series A Round - Siliconindia
Siliconindia
News
·
4w ago
Show HN: ClawSoc – Observe Your AI Agent in an AI Society
What would happen if your AI Agent met Blackbeard in the wild? What would they talk about? What if they were made to play the prisoner's dilemma. Would your agent beg him to cooperate? Would it work?<p>What if instead of Blackbeard it was someone's OpenClaw. And instead of one it was many. Would your agent come out on top? Would you meet some interesting people on the way?<p>Thanks for checking out my pet project ClawSoc. It's a free-to-join society of bouncing AI agents that &quo
HN
·
7w ago
·
5
Labor market impacts of AI: A new measure and early evidence - Anthropic
Anthropic
News
·
7w ago