
Hadoop Admin
About the role
We are seeking a skilled Hadoop Administrator to manage and maintain our Hadoop ecosystem, ensuring high availability, performance, and security of big data platforms. The ideal candidate will have hands-on experience in cluster management, monitoring, and troubleshooting within distributed environments.
Install, configure, and maintain Hadoop clusters (HDFS, YARN, Map Reduce).
Administer Hadoop ecosystem components such as Hive, HBase, Spark, Sqoop, Kafka, and Oozie.
Monitor cluster health, performance, and storage capacity using tools like Ambari, Cloudera Manager, or similar.
Configure and manage HDFS storage, replication, and data balancing.
Implement and maintain security policies (Kerberos authentication, encryption, access controls).
Perform cluster upgrades, patching, and capacity planning.
Troubleshoot issues related to cluster performance, jobs, and data processing failures.
Manage data ingestion and pipeline workflows.
Ensure high availability and disaster recovery for Hadoop clusters.
Collaborate with data engineers and architects for performance optimization and system improvements.
Maintain documentation for cluster configurations and processes.
2–5 years of experience in Hadoop Administration / Big Data Platforms.
Strong knowledge of Hadoop ecosystem components (HDFS, YARN, Hive, Spark).
Hands-on experience with cluster management tools (Cloudera Manager, Ambari).
Good understanding of Linux/Unix systems administration.
Proficiency in shell scripting for automation and monitoring.
Experience with performance tuning, troubleshooting, and cluster optimization.
Familiarity with data ingestion tools (Sqoop, Kafka, Flume).
Knowledge of distributed computing concepts and data storage systems.
Understanding of networking and system architecture in distributed environments.
Certifications such as Cloudera Certified Administrator (CCA) or equivalent.
Experience with cloud-based Hadoop platforms (Azure HDInsight, AWS EMR, GCP Dataproc).
Knowledge of DevOps practices and CI/CD pipelines.
Exposure to containerization tools (Docker, Kubernetes) is a plus.
Familiarity with data warehousing and ETL processes.
Infosys is a global leader in next-generation digital services and consulting. We enable clients in more than 50 countries to navigate their digital transformation. With over four decades of experience in managing the systems and workings of global enterprises, we expertly steer our clients through their digital journey. We do it by enabling the enterprise with an AI-powered core that helps prioritize the execution of change. We also empower the business with agile digital at scale to deliver unprecedented levels of performance and customer delight. Our always-on learning agenda drives their continuous improvement through building and transferring digital skills, expertise, and ideas from our innovation ecosystem. Infosys provides equal employment opportunities to applicants and employees without regard to race, color, sex, gender identity; sexual orientation, religious practices and observances; national origin; pregnancy, childbirth, or related medical conditions; status as a protected veteran or spouse/family member of a protected veteran; or disability.
Education: Bachelor of Engineering
- Preferred skills: Technology->Big Data
- Hadoop->Hadoop Administration
Benefits and perks
•Learning Budget
Required skills
Hadoop
HDFS
YARN
Hive
Spark
Linux
Shell scripting
Cluster management
About Infosys
BANGALORE
Headquarters