Bloomberg

Global leader in business and financial data and analytics

Senior Software Engineer SRE Core Communications

职能DevOps

级别资深

方式现场办公

类型全职

发布2个月前

立即申请

About Core Communications (CC):

We build the core messaging products that power Bloomberg’s internal and client communication: IB (Instant Bloomberg), MSG (Message), and other collaboration platforms. These systems are used by the financial industry to exchange billions of messages daily, from trade ideas and pricing quotes to mission-critical communications. We're building the backbone of financial dialogue, operating at massive scale and high stakes.

About our Team:

The Core Communications SRE team are the guardians of reliability and stability for all CC products. Our focus is on enabling teams to build and operate resilient, observable, and scalable systems. We define standards, provide tools, and lead reliability-focused initiatives across all stages of the development lifecycle. Our scope spans infrastructure, application health, and incident response, working closely with over 100 developers and multiple product and platform teams.

We view our systems holistically, from application code and cluster provisioning to monitoring pipelines and reliability governance. As our platforms evolve and scale, we proactively identify architectural and operational risks, and partner with teams to mitigate them. This includes defining meaningful SLOs with Product, strengthening our observability stack, and developing cross-cutting tools that improve diagnosis and response.

We’ll Trust You To:

Define and promote reliability-focused standards and best practices across observability, alerting, incident response, and provisioning
Build and maintain troubleshooting tools leveraging distributed tracing and health signals to accelerate root cause analysis
Partner with Product teams to define and measure meaningful SLOs aligned with user experience
Lead initiatives to identify and mitigate reliability risks across CC systems — spanning performance, capacity, and resiliency
Collaborate with developers to embed reliability into the software development lifecycle, from design through deployment
Contribute to the creation of a culture of reliability by advocating for failure-aware design and sharing best practices across teams
Develop automation to reduce manual operational effort and support scalable, safe growth of our infrastructure

What’s in it for you:

You’ll have a direct and visible impact on the stability, resilience, and scalability of Bloomberg’s most fundamental and critical products — IB and MSG, which are relied upon daily by the global financial industry for essential decision-making and communication. The work you do will directly shape the reliability experience of our clients and internal users alike.

This role gives you the autonomy to drive reliability initiatives end-to-end, from infrastructure design and tooling to rollout and adoption across engineering teams. You’ll play a key role in fostering a culture of reliability within Core Communications, influencing how systems are built, monitored, and maintained.

In your day-to-day, you’ll help create tooling and frameworks to define and track reliability metrics that guide long-term stability efforts across our platforms. You’ll collaborate with teams to implement distributed tracing and end-to-end health monitoring, enabling faster debugging and deeper visibility into system behavior. You’ll contribute to the development of libraries, dashboards, and automation that bring consistency to alerting, provisioning, and incident response across the broader CC organization. And you’ll help lead the adoption of chaos testing and failure injection practices to validate how our systems perform under real-world stress.

You’ll work closely with engineers, product managers, and SREs across multiple teams and regions — building deep technical expertise and a strong cross-functional network. We also support ongoing learning through conference attendance, industry engagement, and knowledge-sharing, so you can continue to grow and bring fresh perspectives back into the team.

You’ll need to have:

4+ years of experience in software engineering, and experience working on a SRE team
Proficiency in Python and proven experience with C++
Strong understanding of distributed systems and system reliability
Familiarity with SLOs, SLIs, and SLAs, and how to relate system performance back to client impact
Strong collaboration and communication skills
A degree in Computer Science, Engineering, or equivalent practical experience

We’d love to see:

Hands-on experience with monitoring and alerting tools (e.g., Grafana, Splunk, distributed tracing)
Experience with Kafka and Java
Experience with chaos engineering, failure injection, or resilience testing frameworks
Exposure to capacity planning and scaling analysis
An interest in treating security as part of reliability
Contributions to open source or involvement in SRE communities
Awareness of industry compliance frameworks (e.g., DORA, SOC 2) and how they relate to system reliability
Experience with big data technologies like Apache Spark, Amazon S3

Salary Range = 160000 - 240000 USD Annually + Benefits + Bonus

浏览量

申请点击

Mock Apply

相似职位

Senior Automation Lead

Sr, Dev Sec Ops Engineer

Lockheed Martin · King of Prussia, Pennsylvania

Senior Microsoft FastTrack Architect (FTA) - FastTrack

Microsoft · Singapore, Singapore, Singapore

Sr. Database Site Reliability Engineer (DB SRE)

McKesson · USA

AWS Data Senior Technical Specialist - Glue, Snowflake and DevOps

HCL Technologies

关于Bloomberg

Bloomberg

Public

Bloomberg L.P. is an American privately held financial, software, data, and media company headquartered in Midtown Manhattan, New York City. It was co-founded by Michael Bloomberg in 1981, with Thomas Secunda, Duncan MacMillan, Charles Zegar, and a 12% ownership investment by Merrill Lynch.

10,001+

员工数

Midtown Manhattan

总部位置

评价

15条评价

4.0

15条评价

工作生活平衡

4.2

薪酬

4.5

企业文化

3.2

职业发展

3.0

管理层

2.8

65%

推荐率

优点

High compensation and competitive total compensation

Good work-life balance

Company stability and job security

缺点

Slow career progression and promotion speed

Management issues and micromanagement

Limited remote work flexibility

薪资范围

2,046个数据点

Senior/L5

L2 · Cybersecurity Analyst L2

0份报告

$141,700

年薪总额

基本工资

$56,680

股票

$70,850

奖金

$14,170

$99,190

$184,210

面试评价

3条评价

难度

3.3

/ 5

时长

14-28周

体验

正面 0%

中性 67%

负面 33%

面试流程

Application Review

Recruiter Screen

Technical Phone Screen

Virtual Onsite/Superday

Team Matching

Offer

常见问题

Coding/Algorithm

System Design

Behavioral/STAR

Technical Knowledge

Past Experience