Jobs
Benefits & Perks
•Learning Budget
•Learning
Required Skills
Ansible
Terraform
Python
Go
Bash
PowerShell
Kubernetes
Docker
AWS
Azure
GCP
Due to the project requirement, candidates must be Singaporean citizens or already hold Singaporean Permanent Residency (PR) at the time of application.
As a Service Reliability Engineer (SRE) in DAMO service line, you will take a multifaceted approach to ensure technical excellence and operational efficiency within the infrastructure domain. Specializing in reliability, resilience and system performance, you take a lead role in championing the principles of Site Reliability Engineering. By strategically integrating automation, monitoring and incident response, you facilitate the evolution from traditional operations to a more customer-focused and agile approach. Emphasizing shared responsibility and a commitment to continuous improvement, you cultivate a collaborative culture, enabling organizations to meet and exceed their reliability and business objectives.
Job responsibilities
-
You will conduct SRE and Disaster Recovery (DR) maturity assessments.
-
You will engineer automation solutions using Ansible to replace manual workflows.
-
You will own and manage the current manual Disaster Recovery process/pipeline.
-
You will improve site reliability through mechanisms and architectures that enhance fault tolerance and reduce MTTR/MTTD.
-
You will drive the integration of observability automation into the CI/CD pipeline.
-
You will handle production incidents, lead client communication, and create root cause analysis documentation.
-
You will monitor performance of production systems and improve scaling to meet SLA and SLO targets.
-
You will work closely with application development teams to advise and implement reliability improvements.
-
You will improve system observability across logging, metrics and alerting, reducing false alarms to eliminate unnecessary toil and improving overall process efficiency, while implementing chaos engineering practices to regularly validate system reliability.
-
You have a clear understanding of client goals and business needs, setting direction for site reliability in alignment with business expectations - including high availability targets such as 99.999% with minimal/no disruption where required.
Job qualifications
Technical Skills
-
You have expertise in Ansible orchestration including advanced strategies, failure logic handling, and Jinja2 templating.
-
You have the ability to integrate Terraform with Ansible for seamless provisioning-to-configuration workflows.
-
You have hands-on experience with Python, Go, Bash or PowerShell scripting.
-
You have working knowledge of at least one public cloud (AWS/Azure/GCP).
-
You have experience with observability tools (Grafana, Datadog, NewRelic, ELK, Dynatrace, etc.) and can use data for RCA.
-
You have familiarity with DevOps, SRE and Git Ops concepts and practices.
-
You have knowledge of container technologies and orchestration (Kubernetes, EKS, Docker Swarm, Nomad, etc.).
-
You have understanding of modern architecture (microservices, serverless, NoSQL, REST APIs) and experience debugging and building metrics/dashboards.
-
You have experience designing infrastructure aligned with Cloud Well-Architected principles (reliability, security, cost, performance, operations).
Professional Skills
-
You are able to mentor team members through workshops and knowledge enablement.
-
You are able to create comprehensive documentation and runbooks.
-
You have strong communication and articulation skills in English.
-
You have strong collaboration and negotiation skills with client and cross-functional teams.
-
You have a resilient problem-solving mindset and don’t give up easily when debugging issues.
-
You can remain calm and composed during high-pressure production incidents.
-
You can recommend improvements backed by strong technical reasoning.
-
You can understand both business and technical requirements and break them down into deliverables.
-
You have strong ownership and willingness to take responsibility beyond strict role boundaries.
-
You are willing to participate in rotation-based or need-based 24x7 availability support.
Other things to know
Learning & Development
There is no one-size-fits-all career path at Thoughtworks: however you want to develop your career is entirely up to you. But we also balance autonomy with the strength of our cultivation culture. This means your career is supported by interactive tools, numerous development programs and teammates who want to help you grow. We see value in helping each other be our best and that extends to empowering our employees in their career journeys.
About Thoughtworks
Thoughtworks is a dynamic and inclusive community of bright and supportive colleagues who are revolutionizing tech. As a leading technology consultancy, we’re pushing boundaries through our purposeful and impactful work. For 30+ years, we’ve delivered extraordinary impact together with our clients by helping them solve complex business problems with technology as the differentiator. Bring your brilliant expertise and commitment for continuous learning to Thoughtworks. Together, let’s be extraordinary.
About DAMO
At DAMO™ Managed Services, we go beyond routine maintenance - we focus on continuous evolution to help organizations achieve extraordinary impact. Here, you’ll work on proactive improvements rather than reactive fixes. We're at the forefront of cost optimization, automation, and scalable solutions. Your expertise will play a key role in streamlining operations, boosting efficiency, and ensuring our systems grow with our clients’ needs. Join and be part of a team that thrives on curiosity, innovation, and purpose.
See here our AI policy.
Total Views
0
Apply Clicks
0
Mock Applicants
0
Scraps
0
Similar Jobs

Copilot Studio Developer
Collins Aerospace (RTX) · UT234: UT234: 1717 E Cityline Drive 1717 E Cityline Drive Building C17, Richardson, TX, 75082 USA

Industrial Engineer
Collins Aerospace (RTX) · US-CA-GOLETA-B03 ~ 75 Coromar Dr ~ BLDG B03

Principal Software Engineer – CXI Drivers & Kernel Networking
Juniper Networks · 3 Locations

SW Engineering (Systems) - Senior Software Engineer
Juniper Networks · Sunnyvale, California, United States of America

Power and Analog Senior Principal Electrical Engineer
Collins Aerospace (RTX) · US-MA-MARLBOROUGH-MA3 ~ 1001 Boston Post Rd ~ BLDG 3
About Thoughtworks

Thoughtworks
PublicThoughtworks Holding, Inc. is a privately held, global technology company with 49 offices in 18 countries. It provides software design and delivery, and tools and consulting services.
10,001+
Employees
Chicago
Headquarters
Reviews
4.0
23 reviews
Work Life Balance
3.5
Compensation
4.2
Culture
4.0
Career
4.5
Management
3.8
86%
Recommend to a Friend
Pros
Strong engineering culture with focus on code quality
Cutting-edge technology stack and interesting technical challenges
Opportunities for continuous learning and growth
Cons
Internal politics in some teams
Some legacy systems that need modernization
Work-life balance can be challenging during product launches
Salary Ranges
25 data points
Junior/L3
Mid/L4
Senior/L5
Junior/L3 · Data Engineer
1 reports
$210,184
total / year
Base
$161,680
Stock
-
Bonus
-
$210,184
$210,184
Interview Experience
7 interviews
Difficulty
3.6
/ 5
Duration
14-28 weeks
Offer Rate
43%
Experience
Positive 29%
Neutral 43%
Negative 28%
Interview Process
1
Application Review
2
Recruiter Screen
3
Technical Phone Screen
4
Pair Programming Round
5
Onsite/Virtual Interviews
6
Code Review
7
Offer
Common Questions
Coding/Algorithm
Pair Programming
Technical Knowledge
Behavioral/STAR
Culture Fit
News & Buzz
Thoughtworks Looking Glass Report: Enterprises Must "Rewire" Core Architectures for the Agentic AI Era - PR Newswire
Source: PR Newswire
News
·
5w ago
Thoughtworks Recognized as an AI-First Consulting Firm by Constellation Research - The AI Journal
Source: The AI Journal
News
·
6w ago
Thoughtworks unveils AI/works for legacy modernisation - IT Brief Asia
Source: IT Brief Asia
News
·
6w ago
Thoughtworks unveils AI/works for legacy modernisation - IT Brief Australia
Source: IT Brief Australia
News
·
6w ago