We are seeking an experienced and highly skilled Senior DevOps Engineering Lead with 7-10 years of progressive experience to drive our data platform infrastructure automation, CI/CD excellence, and operational efficiency. This role is pivotal in managing and scaling our resilient and performant systems, with a strong focus on empowering microservices architecture. The ideal candidate will possess deep technical expertise in a wide range of DevOps tools and practices, coupled with strong leadership and problem-solving abilities.
**Key Responsibilities:**
- **CI/CD Pipeline Ownership:** Design, implement, and maintain robust, scalable, and secure CI/CD pipeline architecture for microservices applications, ensuring continuous integration, delivery, and deployment.
- **Infrastructure Planning & Management:** Lead the design, provisioning, optimization and management of scalable data infrastructure (compute, storage, networking) across cloud , ECS and/or on-premise environments, specifically supporting a data mesh paradigm.
- **Elastic Stack Expertise:** Manage and optimize Elastic Stack (Elasticsearch, Kibana, Logstash, Beats) for centralized logging, monitoring, and analytics.
- **Automation & Scripting:** Design, Develop and maintain automation scripts and tools using Shell scripts, Python, Java, or other relevant languages to streamline operational tasks and improve efficiency.
- **Infrastructure Procurement & Lifecycle:** Oversee the end-to-end Solution (SLTN) process for infrastructure procurement, ensuring timely and compliant acquisition of resources.
- **Capacity Estimation & Planning:** Conduct thorough capacity planning and performance analysis for microservices and underlying infrastructure to ensure scalability and reliability.
- **Access Management & Security:** Design and implement secure machine-to-machine communication strategies and manage infrastructure access, adhering to security best practices.
- **Microservices Operations:** Provide operational expertise and support for highly distributed microservices architectures, including troubleshooting, performance tuning, and incident response.
- **Governance & Observability:** Implement and enforce data governance policies through automation, and establish comprehensive observability (monitoring, logging, alerting) for data pipelines and infrastructure.
- **Mentorship & Best Practices:** Mentor junior DevOps engineers, promote DevOps best practices (e.g., IaC, GitOps, observability), and foster a culture of continuous improvement.
- **Cross-Functional Collaboration:** Work closely with partner development, QA, and infra security teams to ensure seamless integration and deployment processes.
**Qualifications:**
- Bachelor's or Master's degree in Computer Science, Engineering, or a related technical field.
- 7-10 years of hands-on experience in DevOps, Site Reliability Engineering (SRE), or a similar role.
- **Proven expertise in designing and implementing DevOps leadership**
- **In-depth experience with container orchestration platforms** like AWS, Openshift ECS (or Kubernetes).
- **Strong practical experience with Elastic Stack** (Elasticsearch, Kibana, Logstash, Beats) for transaction, logging and monitoring.
- **Proficiency in scripting languages (Schell scripting, Python is a must), and strong Java development skills**, particularly for tooling and automation.
- Demonstrated knowledge of microservices architecture principles and operational challenges.
- Familiarity with machine-to-machine authentication and authorization mechanisms.
- Must have knowledge of automation principles and practices.
- Experience with job scheduling tools like Autosys.
- Familiarity with Helix Blueprint or similar automation frameworks.
- Excellent problem-solving, communication, and collaboration skills.
- **Modern Engineering Practices:** Familiarity in modern engineering practices such as Trunk-Based Development, Test-Driven Development (TDD), Behavior-Driven Development (BDD), Contract Testing, and Agile methodologies.
**Good to Have:**
- AWS Certifications (e.g., Solutions Architect, DevOps Engineer, SysOps Administrator).
- Experience with Kafka and Tibco messaging infrastructure, including best practices for messaging design and operations.
------------------------------------------------------
**Job Family Group: **
Technology------------------------------------------------------
**Job Family:**
Applications Development------------------------------------------------------
**Time Type:**
Full time------------------------------------------------------
**Most Relevant Skills **
Please see the requirements listed above.------------------------------------------------------
**Other Relevant Skills **
For complementary skills, please see above and/or contact the recruiter.------------------------------------------------------
*Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.*
* *
*If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review **Accessibility at Citi.*
*View Citi’s EEO Policy Statement and the Know Your Rights poster.*