Site Reliability Engineer (OpenSearch)
About the role
Job Summary
Net App is seeking a Technical Operations Engineer (Open Search) to join our growing Instaclustr team in Bangalore, India. In this role, you will be part of a frontline Site Reliability Engineering (SRE) team responsible for ensuring the availability, performance, and reliability of large-scale, cloud-hosted Open Search clusters.
You will work in a highly automated environment managing distributed open-source systems at scale, collaborating with global customers across industries such as banking, telecom, gaming, and technology. This role requires strong operational expertise, problem-solving skills, and a passion for learning and working with modern cloud-native and open-source technologies.
Job Requirements
- Provide end-to-end operational support for Open Search clusters deployed across public cloud platforms (AWS, Azure, GCP).
- Monitor, troubleshoot, and resolve complex production issues, ensuring high availability and performance.
- Perform cluster lifecycle operations, including upgrades, migrations, maintenance, and scaling activities.
- Participate in L2 on-call rotations, ensuring timely incident response and resolution.
- Collaborate with customer engineering teams to diagnose and resolve issues related to Open Search and other supported technologies.
- Work closely with internal teams to enhance reliability, automation, and operational efficiency.
- Develop and improve automation tools, scripts, and operational processes.
- Analyse system behaviour and proactively identify opportunities for performance optimisation and reliability improvements.
- Contribute to knowledge sharing, documentation, and continuous improvement initiatives.
Required Skills & Experience
- Hands-on experience with Open Search (including troubleshooting, upgrades, and migrations) or strong willingness to develop deep expertise.
- Experience with public cloud platforms such as AWS, Azure, or GCP.
- Strong Linux system administration skills and comfort with command-line environments.
- Solid understanding of distributed systems, networking, and OS internals.
- Experience with containerisation technologies (e.g., Docker).
- Strong problem-solving skills with the ability to debug complex production issues.
- Excellent communication skills (written and verbal) with a customer-focused mindset.
- Ability to work effectively in a collaborative, fast-paced environment and take ownership of tasks.
Preferred Skills
- Experience working with other distributed systems such as Cassandra or Kafka.
- Familiarity with source code debugging and issue investigation (e.g., Jira, codebase review).
- Programming/scripting skills in Python, Java, or Bash.
- Experience with Git or version control systems.
- Prior experience in customer support or technical operations roles
Education
- Typically requires a minimum of 4-8 years of related experience with a Bachelor’s degree or 6 years and a Master’s degree; or a PhD with 3 years experience; or equivalent experience.
At Net App, we embrace a hybrid working environment designed to strengthen connection, collaboration, and culture for all employees. This means that most roles will have some level of in-office and/or in-person expectations, which will be shared during the recruitment process.
Equal Opportunity Employer:
Net App is firmly committed to Equal Employment Opportunity (EEO) and to compliance with all federal, state and local laws that prohibit employment discrimination based on age, race, color, gender, sexual orientation, gender identity, national origin, religion, disability or genetic information, pregnancy, protected veteran status, and any other protected classification.
Why You'll Thrive at Net App
At Net App, you won't wait for the perfect moment—you'll make it. The early planning, the extra thought, the bold idea that turns good into great: That's how our people operate and how we continue to push the boundaries of data infrastructure.
Net App is the trusted partner for organizations transforming data into opportunity. As the only enterprise-grade storage service natively embedded in Google Cloud, AWS, and Microsoft Azure, we empower customers to run everything from traditional workloads to enterprise AI with unmatched performance, resilience, and security.
Our culture
We celebrate mold breakers, bold thinkers, and problem solvers. We reward initiative, impact, and ownership. We provide flexibility so you can balance professional ambition with your personal life. Here, differences are not just welcomed—they drive everything we do.
If you're ready to innovate, rise to the challenge, and own every moment - make your next move your best one. Apply now.
Benefits and perks
•Learning Budget
Required skills
site reliability engineering
OpenSearch
cloud operations
incident response
automation
monitoring
troubleshooting
About NetApp
Bangalore
Headquarters