招聘

Site Reliability Engineering Lead, Vice President
NEW YORK, New York, United States of America
·
On-site
·
Full-time
·
1mo ago
We are seeking an experienced and motivated team member to support our AI and DevOps Platform Support team in North America. This role is responsible for contributing to the stability, reliability, and performance of our critical AI and DevOps platforms. The team supports a wide range of services, including multiple AI applications, developer tools, and CI/CD pipeline technologies used across the organization. The ideal candidate will help lead a team of SRE and Support engineers, facilitate incident and problem resolution, and collaborate with engineering and development teams to enhance platform services and supportability. The role includes short‑term planning and coordination of actions and resources within the team.
Responsibilities:
- Demonstrates a strong understanding of how application support contributes to the overall technology function and organizational objectives.
- Assist with vendor relationship management, including coordination with offshore managed services.
- Support efforts to improve service levels for end users by enhancing operational efficiencies and strengthening incident management, problem management, and knowledge‑sharing practices.
- Partner with development teams to guide improvements in application stability and supportability.
- Contribute to frameworks for managing capacity, throughput, and latency.
- Assist in defining and implementing application onboarding guidelines and standards.
- Support team members by fostering a collaborative environment and encouraging skill development.
- Participate in cost‑reduction efforts through Root Cause Analysis reviews, knowledge management, performance tuning, and user training.
- Participate in business review meetings to help align technology tools and strategies with business requirements.
- Ensure adherence to support processes and tool standards, and assist in enhancing processes to promote consistency and quality across the support program.
- Perform other duties and functions as assigned.
- Support platform leadership in defining the platform roadmap and partnering with engineering teams and business stakeholders.
- Assist in executing resilience activities such as wargaming scenarios, chaos engineering tests, and disaster recovery drills.
- Contribute to automation initiatives aimed at reducing manual toil and improving platform efficiency.
- Support the enterprise‑wide observability strategy, including monitoring, logging, tracing, and alerting.
- Maintain hands‑on familiarity with platform architecture and services as needed for operational support.
- Assist in overseeing the operational health of production platforms (including Open Shift, ECS, CI/CD), ensuring SLAs are supported and incident processes are followed.
- Help implement and operate effective monitoring and observability strategies to support proactive issue detection and system health assessments.
Qualifications:
- 6–10 years of relevant experience in a hands‑on technical or support leadership role.
- Experience contributing to architecture discussions and ensuring solutions align with enterprise standards and long‑term maintainability.
- Experience working with senior stakeholders or technology partners.
- Demonstrated experience supporting IT service improvements or platform stability initiatives.
- Strong communication and presentation skills, with the ability to convey technical concepts clearly.
- Experience supporting or contributing to technical roadmaps or operational workstreams.
- Experience participating in resilience‑related activities such as incident simulations, disaster recovery exercises, or stability testing.
- Ability to collaborate with cross‑functional support teams and technology groups.
- Strong organizational and workload‑planning skills.
- Consistently demonstrates clear and concise written and verbal communication skills.
- Ability to communicate appropriately with relevant stakeholders.
- Working knowledge of Generative AI concepts preferred.
- Experience with CI/CD and configuration management tools preferred.
- Experience with Red Hat Open Shift or similar Kubernetes technologies preferred.
- Experience working with databases such as Postgres, Oracle, MongoDB, or Redis preferred.
- Experience writing or maintaining code in Java, Python, Go, or similar languages preferred.
- Hands‑on experience with modern observability and monitoring tools (e.g., Prometheus, Grafana, Splunk, ELK) preferred.
Education:
- Bachelor’s/University degree required; Master’s degree preferred.
Job Family Group:
Technology
Job Family:
Applications Support
Time Type:
Full time
Primary Location:
New York New York United States:
Primary Location Full Time Salary Range:
$142,320.00 - $213,480.00
In addition to salary, Citi’s offerings may also include, for eligible employees, discretionary and formulaic incentive and retention awards. Citi offers competitive employee benefits, including: medical, dental & vision coverage; 401(k); life, accident, and disability insurance; and wellness programs. Citi also offers paid time off packages, including planned time off (vacation), unplanned time off (sick leave), and paid holidays. For additional information regarding Citi employee benefits, please visit citibenefits.com. Available offerings may vary by jurisdiction, job level, and date of hire.
Most Relevant Skills
Please see the requirements listed above.
Other Relevant Skills
For complementary skills, please see above and/or contact the recruiter.
Anticipated Posting Close Date:
May 15, 2026
Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.
If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.
View Citi’s EEO Policy Statement and the Know Your Rights poster.
总浏览量
1
申请点击数
0
模拟申请者数
0
收藏
0
相似职位

Manager, Cloud Infrastructure Demand & Capacity Planning
Apple · Cupertino, CA

Lead Software Engineer-AI Platform Engineer
JPMorgan Chase · Jersey City, NJ, United States, US

Director, Observability Platform Engineering Technical Lead
Fidelity · Merrimack, New Hampshire, USA

Lead Site Reliability Engineer
JPMorgan Chase · New York, NY, United States, US

Senior Manager, Site Reliability Engineering (FedRAMP) - ThousandEyes
Cisco · 3 Locations
关于Citigroup

Citigroup
PublicCitigroup Inc. or Citi is an American multinational investment bank and financial services company based in New York City. The company was formed in 1998 by the merger of Citicorp, the bank holding company for Citibank, and Travelers; Travelers was spun off from the company in 2002.
10,001+
员工数
New York City
总部位置
$86B
企业估值
评价
3.7
10条评价
工作生活平衡
4.0
薪酬
2.8
企业文化
4.2
职业发展
3.5
管理层
3.3
68%
推荐给朋友
优点
Good work-life balance
Supportive management and colleagues
Good benefits
缺点
Low/uncompetitive salary and pay
Poor management and lack of direction
Heavy workload and long hours
薪资范围
38个数据点
Mid/L4
Senior/L5
Staff/L6
Mid/L4 · Business Risk Intermediate Analyst
1份报告
$77,165
年薪总额
基本工资
$67,100
股票
-
奖金
-
$77,165
$77,165
面试经验
3次面试
难度
3.3
/ 5
时长
14-28周
体验
正面 0%
中性 33%
负面 67%
面试流程
1
Application Review
2
HR Screen
3
Technical Assessment
4
Hiring Manager Interview
5
Final Round Interview
6
Offer Decision
常见问题
Technical Knowledge
Behavioral/STAR
Past Experience
Problem Solving
Culture Fit
新闻动态
Citigroup Tokenized Stock (Ondo): Latest News, Social Media Updates and Insights - CryptoRank
CryptoRank
News
·
3d ago
Citigroup Inc. $C Stock Position Raised by Merit Financial Group LLC - MarketBeat
MarketBeat
News
·
3d ago
Top Citigroup Insiders Quietly Cash Out Millions in Stock Sales - TipRanks
TipRanks
News
·
3d ago
Citigroup (C) Valuation Check After Strong Q1 Earnings Beat And Decade High Quarterly Revenue - Yahoo Finance
Yahoo Finance
News
·
4d ago