Infosys
Infosys

Hadoop Admin

RoleData Engineering
LevelSenior
LocationBangalore, India
WorkOn-site
TypeFull-time
Posted2 months ago
Apply now

About the role

We are seeking a skilled Hadoop Administrator with strong hands-on experience in managing Hadoop ecosystems, specifically Presto and Hive Metastore (HMS). The role focuses on platform administration, performance tuning, security, monitoring, and ensuring high availability of big data platforms supporting enterprise analytics and reporting workloads.

Administer, configure, and maintain Hadoop clusters (HDFS, YARN, Hive)
Manage and support Presto clusters for interactive and federated query execution
Administer Hive Metastore (HMS) including schema management, metadata consistency, and performance tuning
Perform installation, upgrades, patching, and configuration changes for Hadoop ecosystem components
Monitor cluster health, resource utilization, and query performance
Troubleshoot performance issues related to Presto queries, metadata latency, and Hadoop services
Manage security configurations including:

Kerberos authentication
Role-based access control
Ranger / Sentry integration (if applicable)

Ensure high availability, backup, and disaster recovery for Hadoop services
Support data ingestion and integration pipelines relying on Hive and Presto
Collaborate with data engineering, analytics, and BI teams
Capacity planning and optimization of storage and compute resources
Create and maintain platform documentation, SOPs, and operational runbooks
Handle production support activities and on-call responsibilities as required

5+ years of experience in Hadoop administration
Strong hands-on experience with:

HDFS, YARN, Hive
Presto (cluster setup, tuning, troubleshooting)
Hive Metastore (HMS) administration

Good understanding of metadata management and data lake architectures
Experience with Linux system administration
Familiarity with cluster monitoring tools (Ambari, Cloudera Manager, Prometheus, Grafana)
Strong troubleshooting, performance tuning, and root-cause analysis skills
Knowledge of SQL and distributed query processing

Experience with Trino (formerly PrestoSQL)
Exposure to cloud-based Hadoop platforms (AWS EMR, Azure HDInsight, GCP Dataproc)
Experience with orchestration tools such as Airflow or Oozie
Knowledge of data governance tools (Ranger, Atlas)
Scripting experience (Shell, Python)
Experience with BI tools consuming Presto (Tableau, Power BI, Superset)

Infosys is a global leader in next-generation digital services and consulting. We enable clients in more than 50 countries to navigate their digital transformation. With over four decades of experience in managing the systems and workings of global enterprises, we expertly steer our clients through their digital journey. We do it by enabling the enterprise with an AI-powered core that helps prioritize the execution of change. We also empower the business with agile digital at scale to deliver unprecedented levels of performance and customer delight. Our always-on learning agenda drives their continuous improvement through building and transferring digital skills, expertise, and ideas from our innovation ecosystem. Infosys provides equal employment opportunities to applicants and employees without regard to race, color, sex, gender identity; sexual orientation, religious practices and observances; national origin; pregnancy, childbirth, or related medical conditions; status as a protected veteran or spouse/family member of a protected veteran; or disability.

Education: Bachelor of Engineering

  • Preferred skills: Technology->Big Data
  • Hadoop->Hadoop Administration

Benefits and perks

Learning Budget

Required skills

Hadoop

HDFS

YARN

Hive

Presto

Hive Metastore

Security

Performance tuning

About Infosys

BANGALORE

Headquarters