
Global leader in business and financial data and analytics
Cloud Site Reliability Engineer SRE Data Management Analytics Platform at Bloomberg
About the role
At Bloomberg, data is at the heart of everything we do. As part of the Data Management and Analytics Platform (DMAP) SRE team you will play a critical role in driving analytics throughout the organization to improve our products, better engage with our customers, create greater efficiencies, and unlock new business opportunities through data-driven insights.
Our team is responsible for capturing and processing the who, what, when, where, and why of how clients use Bloomberg products, how our systems perform, and how employees interact with customers. We ingest and prepare massive volumes of data to power reporting, dashboards, self-service tools, and advanced analytics used across the company.
We are looking for a Cloud Site Reliability Engineer (SRE) who is passionate about building and operating highly reliable, scalable data platforms in the cloud. In this role, you will focus on ensuring the availability, performance, and scalability of critical data pipelines and analytics infrastructure. You will work at the intersection of software engineering and infrastructure, applying automation, observability, and reliability best practices to support large-scale distributed systems.
You’ll Be Trusted To:
-
Design, build, and operate highly available, scalable, and resilient cloud infrastructure supporting large-scale data ingestion and analytics platforms
-
Define, implement, and monitor SLIs/SLOs for data systems and services; drive reliability improvements using error budgets and operational metrics
-
Improve observability across data pipelines and platforms through logging, metrics, tracing, and alerting
-
Automate infrastructure provisioning and system management using Infrastructure as Code (IaC)
-
Lead incident response efforts, perform root cause analysis (RCA), and implement post-incident improvements
-
Optimize performance, reliability, and cost efficiency of cloud-based data systems
-
Ensure data platform reliability, including batch and streaming pipelines, storage systems, and reporting infrastructure
-
Partner with data engineers, software engineers, and stakeholders to improve system reliability and operational maturity
-
Strengthen platform security through proactive monitoring, vulnerability management, and cloud security best practices
-
Continuously improve CI/CD pipelines and deployment processes for data infrastructure
You’ll Need To Have:
-
5+ years of experience in Site Reliability Engineering, DevOps, or Cloud Infrastructure roles
-
Strong proficiency in at least one programming or scripting language (Python, and/or Go)
-
Experience supporting production systems with a focus on reliability, scalability, and observability
-
Hands-on experience operating or designing highly available distributed systems.
-
A Bachelor’s degree in Computer Science, Engineering, Mathematics, or a related field, or equivalent professional experience
We’d Love To See:
-
Experience supporting large-scale data platforms, data pipelines, or analytics infrastructure
-
Strong experience operating production systems in AWS at scale
-
Experience defining and managing SLIs, SLOs, and error budgets
-
Strong background in monitoring and observability tools (e.g., Prometheus, Grafana, CloudWatch, Datadog)
-
Experience leading incident management and conducting postmortems
-
Hands-on experience with Infrastructure as Code (Terraform or CloudFormation)
-
Experience building and maintaining CI/CD pipelines
-
Strong understanding of distributed systems and cloud architecture
-
Experience with containerized workloads (Docker, Kubernetes)
-
Knowledge of AWS services related to data platforms (e.g., S3, EMR, Lambda, Kinesis, Glue, Redshift)
-
Knowledge of Databricks or Snowflake platform
-
Experience with cloud networking concepts (VPCs, routing, security groups)
-
Experience optimizing cloud costs in large-scale environments
-
AWS certification (Associate level or above)
-
A security-first mindset and familiarity with compliance and data governance best practices
-
Experience using operational metrics and data to drive continuous improvement
Our most successful engineers are collaborative, data-driven, and take strong ownership of production systems end-to-end, ensuring the reliability of the data platforms that power Bloomberg’s analytics and insights.
Salary Range = 160,000 - 240,000 USD Annual + Benefits + Bonus
Required skills
SRE
Cloud infrastructure
Observability
Automation
Distributed systems
Reliability engineering
Data platforms
Monitoring
Total Views
0
Total Apply Clicks
0
Total Mock Apply
0
Total Bookmarks
0
More open roles at Bloomberg
Similar jobs

Senior. Principal Platform IAC Engineer
RTX (Raytheon) · US-CO-AURORA-S75 ~ 16800 E Centretech Pkwy ~ BLDG S75

Senior Platform DevOps Engineer (Onsite)
RTX (Raytheon) · US-CO-AURORA-S75 ~ 16800 E Centretech Pkwy ~ BLDG S75

Platform Operations Engineer
RTX (Raytheon) · US-MA-MARLBOROUGH-MA2 ~ 1001 Boston Post Rd ~ BLDG 2

Deployment Specialist
Wipro · Copenhagen, Denmark

Azure Cloud Engineer
Accenture
About Bloomberg

Bloomberg
PublicBloomberg L.P. is an American privately held financial, software, data, and media company headquartered in Midtown Manhattan, New York City. It was co-founded by Michael Bloomberg in 1981, with Thomas Secunda, Duncan MacMillan, Charles Zegar, and a 12% ownership investment by Merrill Lynch.
10,001+
Employees
Midtown Manhattan
Headquarters
Reviews
15 reviews
4.0
15 reviews
Work-life balance
4.2
Compensation
4.5
Culture
3.2
Career
3.0
Management
2.8
65%
Recommend to a friend
Pros
High compensation and competitive total compensation
Good work-life balance
Company stability and job security
Cons
Slow career progression and promotion speed
Management issues and micromanagement
Limited remote work flexibility
Salary Ranges
2,046 data points
L2
L6
Senior/L5
L3
L4
L5
L2 · Cybersecurity Analyst L2
0 reports
$141,700
total per year
Base
$56,680
Stock
$70,850
Bonus
$14,170
$99,190
$184,210
Interview experience
3 interviews
Difficulty
3.3
/ 5
Duration
14-28 weeks
Experience
Positive 0%
Neutral 67%
Negative 33%
Interview process
1
Application Review
2
Recruiter Screen
3
Technical Phone Screen
4
Virtual Onsite/Superday
5
Team Matching
6
Offer
Common questions
Coding/Algorithm
System Design
Behavioral/STAR
Technical Knowledge
Past Experience
Latest updates
Tech Bulls Are Taking Charge of the Stock Market - Bloomberg
Bloomberg
News
·
2w ago
Batteries and Natural Gas Become Unlikely Companions - Bloomberg
Bloomberg
News
·
2w ago
IHeartMedia Holds Merger Talks With Sirius XM, Bloomberg News Reports - U.S. News Money
U.S. News Money
News
·
2w ago
China to restricts AI startups from taking U.S. funding, Bloomberg reports - Investing.com
Investing.com
News
·
2w ago