채용
필수 스킬
Docker
Kubernetes
Jira
Job Summary
As a Cloud Infrastructure/Site Reliability Engineer, you will be operating at the intersection of development and operations. Your role will involve engaging in and enhancing the lifecycle of cloud services - from design through deployment, operation, and refinement. You will be responsible for maintaining these services by measuring and monitoring their availability, latency, and overall system health.
You will play a crucial role in sustainably scaling systems through automation and driving changes that improve reliability and velocity. As part of your responsibilities, you will administer cloud-based environments that support our SaaS/IaaS offerings, which are implemented on a microservices, container-based architecture (Kubernetes).
In addition, you will oversee a portfolio of customer-centric cloud services (SaaS/IaaS), ensuring their overall availability, performance, and security. You will work closely with both Net App and cloud service provider teams, including those from Google, located across the globe in regions such as RTP, Reykjavík, Bangalore, Sunnyvale, Redmond, and more.
Due to the critical nature of the services we support, this position involves participation in a rotation-based on-call schedule as part of our global team. This role offers the opportunity to work in a dynamic, global environment, ensuring the smooth operation of vital cloud services. To be successful in this role, you should be a motivated self-starter and self-learner, possess strong problem-solving skills, and be someone who embraces challenges.
Job Requirements
- Incident Response and Troubleshooting: Address and perform root cause analysis (RCA) of complex live production incidents and cross-platform issues involving OS, Networking, and Database in cloud-based SaaS/IaaS environments. Implement SRE best practices for effective resolution.
- Analysis, and Infrastructure Maintenance: Continuously monitor, analyze, and measure system health, availability, and latency using tools like Prometheus, Stackdriver, Elastic Search, Grafana, and Solar Winds. Develop strategies to enhance system and application performance, availability, and reliability. In addition, maintain and monitor the deployment and orchestration of servers, docker containers, databases, and general backend infrastructure.
- Document system knowledge as you acquire it, create runbooks, and ensure critical system information is readily accessible.
- Security Management: Stay updated with security protocols and proactively identify, diagnose, and resolve complex security issues.
- Automation and Efficiency: Identify tasks and areas where automation can be applied to achieve time efficiencies and risk reduction. Develop software for deployment automation, packaging, and monitoring visibility.
- Issue Tracking and Resolution: Use Atlassian Jira, Google Buganizer, and Google IRM to track and resolve issues based on their priority.
- Team Collaboration and Influence: Work in tandem with other Cloud Infrastructure Engineers and developers to ensure maximum performance, reliability, and automation of our deployments and infrastructure. Additionally, consult and influence developers on new feature development and software architecture to ensure scalability.
- Debugging, Troubleshooting, and Advanced Support: Undertake debugging and troubleshooting of service bottlenecks throughout the entire software stack. Additionally, provide advanced tier 2 and 3 support for Net App's Cloud Data Services solutions.
- Directly influence the decisions and outcomes related to solution implementation: measure and monitor availability, latency, and overall system health.
- Proficiency in Linux/Unix and CORE OS.
- Demonstrated experience in scripting and infrastructure automation using tools such as Ansible, Python, Go or Ruby.
- Deep working knowledge of Containers, Kubernetes, and Serverless computing implementation.
- DevOps development methodologies.
- Experience with distributed systems design patterns using tools such as Kubernetes.
- Experience with cloud platforms such as AWS, Azure, or Google Cloud.
Education
- A minimum of 8-12 years of experience is required.
- A Bachelor of Science Degree in Computer Science, a master’s degree; or equivalent experience is required.
총 조회수
0
총 지원 클릭 수
0
모의 지원자 수
0
스크랩
0
비슷한 채용공고

Senior Developer
HCL Technologies · Bengaluru, India

Senior Staff Software Engineer (Backend)
Databricks · Bengaluru, India

Senior o9 APS / IBP Solution Developer – CoE
Juniper Networks · Bengaluru, Karnātaka, India

Senior Network Dev Engineer, Customer Experience Infrastructure (CXI)
Amazon · Bengaluru, KA, IND

PRINCIPAL ENGINEER D365
Wipro · Bengaluru, India
NetApp 소개
NetApp
PublicNetApp is a multinational computer storage and data management company that provides software, systems and services for managing enterprise data.
10,001+
직원 수
San Jose
본사 위치
$18.2B
기업 가치
리뷰
3.5
10개 리뷰
워라밸
4.0
보상
2.8
문화
4.2
커리어
2.5
경영진
2.7
65%
친구에게 추천
장점
Good work-life balance and flexible hours
Supportive management and colleagues
Diverse and inclusive environment
단점
Limited career advancement opportunities
Poor management and leadership direction
Pay and compensation issues
연봉 정보
48개 데이터
Mid/L4
Mid/L4 · Network Security Engineer
1개 리포트
$164,136
총 연봉
기본급
$143,182
주식
-
보너스
-
$164,136
$164,136
면접 경험
1개 면접
난이도
3.0
/ 5
면접 과정
1
Application Review
2
Recruiter Screen
3
Technical Phone Screen
4
Coding Challenge
5
Hiring Manager Interview
6
Onsite/Virtual Interviews
자주 나오는 질문
Coding/Algorithm
Technical Knowledge
Behavioral/STAR
System Design
Past Experience
뉴스 & 버즈
Is NetApp (NTAP) Offering Value After Strong Multi Year Share Price Performance - simplywall.st
simplywall.st
News
·
1d ago
NetApp Walks the AI Talk with Google - - Enterprise Times
Enterprise Times
News
·
1d ago
Enabling AI-powered analytics on enterprise file data: Configuring S3 Access Points for Amazon FSx for NetApp ONTAP with Active Directory - Amazon Web Services
Amazon Web Services
News
·
1d ago
Universal Beteiligungs und Servicegesellschaft mbH Increases Position in NetApp, Inc. $NTAP - MarketBeat
MarketBeat
News
·
1d ago