
Kafka Admin
About the role
We are looking for an experienced Senior Kafka Administrator to lead the design, implementation, and management of highly scalable and resilient Kafka ecosystems. This role requires deep expertise in distributed systems, strong troubleshooting capabilities, and the ability to architect enterprise-grade streaming platforms. You will play a key role in ensuring system reliability, optimizing performance, and guiding teams on best practices for real-time data streaming.
Design, deploy, and manage large-scale Kafka clusters across on-premise and cloud environments
Lead architecture decisions for high availability, scalability, and fault tolerance
Monitor, troubleshoot, and optimize Kafka clusters to achieve optimal performance and uptime
Handle capacity planning, cluster sizing, and performance tuning
Manage advanced configurations including multi-cluster replication (Mirror Maker), tiered storage, and KRaft mode
Configure and enforce security best practices (SASL, SSL/TLS, RBAC, ACLs)
Implement robust backup, disaster recovery, and failover strategies
Collaborate with engineering teams to design efficient event-driven architectures and streaming pipelines
Resolve complex production issues and lead root cause analysis (RCA)
Automate infrastructure provisioning and operational tasks using tools like Terraform, Ansible, or scripts
Lead Kafka upgrades, migrations, and platform enhancements with minimal downtime
Establish and maintain monitoring, alerting, and logging frameworks
Mentor junior engineers and provide technical leadership
Create and maintain detailed documentation and operational runbooks
Primary skills:Technology->Java->Apache
5–9 years of IT experience with at least 3+ years in Kafka administration/engineering
Strong expertise in Apache Kafka internals (brokers, partitions, replication, ISR)
Deep understanding of distributed systems, messaging frameworks, and event streaming
Experience with Kafka ecosystem tools: Kafka Connect, Kafka Streams, Schema Registry
Hands-on experience with monitoring tools like Prometheus, Grafana, ELK, or Splunk
Strong Linux/Unix administration skills
Experience with cloud platforms (AWS MSK, Azure Event Hubs/Kafka, GCP Pub/Sub-Kafka integration)
Familiarity with containerization and orchestration (Docker, Kubernetes)
Expertise in scripting/programming (Python, Bash, or Java preferred)
Strong problem-solving skills with the ability to handle critical incidents
Bachelor’s or Master’s degree in Computer Science, IT, or a related field (or equivalent practical experience)
Education: Bachelor of Engineering
Preferred skills: Technology->Java->Apache->Kafka
Required skills
Apache Kafka
Kafka Connect
Kafka Streams
Schema Registry
MirrorMaker
KRaft
Terraform
Ansible
About Infosys
HYDERABAD
Headquarters