热门公司

UiPath
UiPath

Principal Site Reliability Engineer

职能DevOps
级别Staff+
地点Bangalore - Engineering
方式现场办公
类型全职
发布1个月前
立即申请

LIFE AT UIPATH:

The people at Ui Path believe in the transformative power of automation to change how the world works. We’re committed to creating category-leading enterprise software that unleashes that power.

To make that happen, we need people who are curious, self-propelled, generous, and genuine. People who love being part of a fast-moving, fast-thinking growth company. And people who care—about each other, about Ui Path, and about our larger purpose.

Could that be you?

YOUR MISSION

Ui Path is seeking a Principal Site Reliability Engineer to redefine how reliability is engineered using AI. This role focuses on building intelligent reliability platforms and tooling that leverage AI/ML to improve reliability of our services, reduce operational toil for developers, and accelerate incident response across large-scale, cloud-native systems.

You will operate at the intersection of SRE, distributed systems, and applied AI, designing systems that transform raw telemetry into actionable insights, enable predictive reliability, and introduce self-healing capabilities into production environments.

You will build the next generation of reliability systems, where detection, diagnosis, and remediation are increasingly automated and data driven.

You will help define how reliability is architected, scaled, measured, and automated across our large-scale, cloud-native systems. This role requires broad technical judgment, platform thinking, and the ability to influence reliability outcomes across the various engineering and platform teams.

WHAT YOU'LL DO AT UIPATH:

  • Intelligent automation and Self-healing systems

  • Design and implement self-healing mechanisms including automated remediation workflows and intelligent retry and fallback strategies.

  • Reliability platform tooling

  • Build internal systems that enable engineering teams to debug faster using AI-assisted tooling and proactively identify and mitigate reliability risks.

  • End-to-End Reliability strategy

  • Define and evolve reliability strategy using predictive reliability models(Capacity, Failure forecasting, Reliability scoring) and embed intelligent reliability practices across the engineering teams.

AI-assisted Incident response & RCA - Build AI-powered systems that determine impact and use historical data to improve detection and response over time.

  • Technical Leadership & Org Impact
  • Influence standards for building AI-driven tooling, mentor junior and senior engineers, and elevate reliability focus across the organization.

WHAT YOU'LL BRING TO THE TEAM:

Engineering & Reliability Experience:

  • 7+ years of experience in SRE, Platform, Cloud infrastructure engineering roles with a track record of building internal tooling to improve reliability.

  • Strong conceptual understanding of distributed systems, performance bottlenecks, failure modes, and trade-offs inherent to large-scale systems.

AI/ML Application to systems & operations

  • Experience building applications or internal tools using LLMs to automate non-trivial workflows (e.g., AIOps, Automated code reviews, Automated flagging of reliability risks)

  • Hands-on experience with building Agents/Copilots using modern ML frameworks (Py Torch, vLLM or equivalent) in production setting.

Scripting & Tooling:

  • Proficiency in at least one programming language (e.g., Python, Go, or similar). Experience with Infrastructure as Code (e.g., Terraform, Pulumi) and container orchestration (e.g., Kubernetes).

Cloud & Infrastructure Expertise:

  • Hands-on experience working with one or more major cloud providers (Azure, AWS, GCP), with practical knowledge of networking, deployments, and scaling.

Observability & Operational Practices:

  • Proven experience with monitoring/observability stacks (metrics, logs, traces) and building meaningful dashboards and alerts that improve reliability signals.

Incident Response & Post-Incident Learning:

  • Experience participating in and improving incident response, blameless postmortems, and implementing systemic fixes rather than symptomatic patches.

Collaboration & Influence:

  • Ability to partner with product, infrastructure, and engineering teams to influence architecture and reliability practices without direct authority.

Maybe you don’t tick all the boxes above—but still think you’d be great for the job? Go ahead, apply anyway. Please. Because we know that experience comes in all shapes and sizes—and passion can’t be learned.

Many of our roles allow for flexibility in when and where work gets done. Depending on the needs of the business and the role, the number of hybrid, office-based, and remote workers will vary from team to team. Applications are assessed on a rolling basis and there is no fixed deadline for this requisition. The application window may change depending on the volume of applications received or may close immediately if a qualified candidate is selected.

We value a range of diverse backgrounds, experiences and ideas. We pride ourselves on our diversity and inclusive workplace that provides equal opportunities to all persons regardless of age, race, color, religion, sex, sexual orientation, gender identity, and expression, national origin, disability, neurodiversity, military and/or veteran status, or any other protected classes. Additionally, UiPath provides reasonable accommodations for candidates on request and respects applicants' privacy rights. To review these and other legal disclosures, visit our privacy policy https://www.uipath.com/legal/trust-and-security/privacy-policy.

浏览量

0

申请点击

0

Mock Apply

0

收藏

0

关于UiPath

UiPath

UiPath

Public

UiPath Inc. is a global software company that develops artificial intelligence (AI) and agentic automation and orchestration software. The company's software enables the building and orchestration of AI agents to automate complex processes and workflows.

5,001-10,000

员工数

New York

总部位置

$13B

企业估值

评价

10条评价

3.8

10条评价

工作生活平衡

3.2

薪酬

4.0

企业文化

4.1

职业发展

3.5

管理层

2.8

72%

推荐率

优点

Supportive team and colleagues

Good benefits and competitive salary

Innovative projects and cutting-edge technology

缺点

Poor management and lack of direction

High pressure and overwhelming workload

Limited career advancement opportunities

薪资范围

19个数据点

Mid/L4

Senior/L5

Mid/L4 · Customer Success Manager

1份报告

$172,500

年薪总额

基本工资

$150,000

股票

-

奖金

-

$172,500

$172,500