
The bridge to possible.
Kubernetes Platform Engineer - AI Infrastructure ( Golang/Python) - (8+ Years) at Cisco
About the role
Meet the Team
You will be pivotal in contributing to the team responsible for designing and developing the next generation of scalable Kubernetes infrastructure with machine learning platforms that support both traditional ML and state-of-the-art Large Language Models (LLMs). This is a position for experienced engineers where you will lead the technical direction, ensuring the performance, reliability, and scalability of AI systems while collaborating closely with data scientists, researchers, and other engineering teams.
Your Impact
You will take ownership of sophisticated & highly scalable Kubernetes Platforms for microservices workload. Your leadership will be pivotal in driving the adoption and integration of both established Kubernetes platforms and emerging AI/ML technologies. You will mentor junior engineers to reinforce the team’s core technical expertise, ensuring a strong foundation in traditional container orchestration as well as modern AI-driven solutions. This role is ideal for someone passionate about tackling engineering challenges in dynamic environments, with a commitment to delivering scalable, high-impact solutions that blend proven infrastructure methodologies with innovative AI/ML advancements.
Core Responsibilities
As a Platform Engineer with AI/ML Experience you will:
-
Architect and design scalable Kubernetes platforms supporting both traditional and Large Language Models (LLMs).
-
Provide On Call & client support for all Kubernetes platforms
-
Participate in troubleshooting the Operational Issues and drive Upgrades.
-
Proficient in Kubernetes (K8) platform to design, develop, and maintain scalable software solutions.
-
Drive cross-functional collaboration across infrastructure teams to ensure seamless integration and delivery of services.
-
Engage directly with clients to gather IT requirements, translate business needs into technical solutions, and architect robust systems.
-
Drive technical brainstorming sessions with technical teams to innovate and build effective architectures aligned with client goals.
-
Act as a key technical liaison between clients and internal teams, ensuring clear communication and successful project outcomes.
-
Provide design, implementation, and operational support for a traditional Kubernetes platform tailored for microservices architecture.
-
Enhance and maintain the existing platform to reliably support a large portfolio of business applications.
-
Automate platforms to operate as infrastructure as code, improving efficiency and consistency in platform management.
-
Architect GPU as a Service Platform offering and provide client support for hosting AI/ML workload powered by GPU
-
Drive AIOps initiative across PaaS platforms by collaborating with multi-functional teams, including SRE, Software Engineers to operationalize and optimize ML models effectively.
-
Develop infrastructure automation tools and frameworks to improve efficiency across teams.
-
Ensure platform reliability, scalability, and performance through meticulous engineering practices.
-
Conduct code reviews, establish standard processes, and mentor junior engineers.
-
Stay updated on the latest trends in AI/ML to influence platform enhancements.
Minimum Qualifications / Requirement
-
Experience: 8+ years of software engineering experience, including at least 2+ years in machine learning-related roles.
-
Expertise in Golang or Python, with expertise in Kubernetes platform along with ML frameworks (Tensor Flow, Py Torch).
-
Consistent track record in designing and deploying scalable machine learning systems in production.
-
Deep understanding of ML algorithms, data pipelines, and optimization techniques.
-
Experience building CI/CD pipelines for ML workflows, including model monitoring and retraining.
-
Proficiency in cloud platforms and orchestration tools for distributed systems.
-
Strong problem-solving and debugging skills for complex, large-scale systems.
-
Experience in mentoring engineers and driving technical decision-making.
Preferred Qualifications / Requirements
Kubernetes and Container Orchestration:
-
Expertise in Kubernetes for managing enterprise grade systems and ensuring scalability.
-
Experience with Docker and orchestration of complex services.
-
Software development: Expertise in Golang or Python
-
MLOps Tools and Frameworks: Experience with architecting and optimizing workflows using Kubeflow pipelines, KServe, Airflow, and MLflow.
-
Ability to design and implement efficient CI/CD pipelines for ML systems.
-
Large Language Models (LLMs): Understanding of Lang Chain and experience designing RAG systems.
-
Knowledge of integrating and scaling vector databases (e.g., Pinecone, FAISS) for real-world applications.
Why Cisco?
At Cisco, we’re revolutionizing how data and infrastructure connect and protect organizations in the AI era – and beyond. We’ve been innovating fearlessly for 40 years to create solutions that power how humans and technology work together across the physical and digital worlds. These solutions provide customers with unparalleled security, visibility, and insights across the entire digital footprint.
Fueled by the depth and breadth of our technology, we experiment and create meaningful solutions. Add to that our worldwide network of doers and experts, and you’ll see that the opportunities to grow and build are limitless. We work as a team, collaborating with empathy to make really big things happen on a global scale. Because our solutions are everywhere, our impact is everywhere.
We are Cisco, and our power starts with you.
Required skills
Kubernetes
Platform engineering
AI infrastructure
Operations
Troubleshooting
Scalability
Mentoring
Client support
Total Views
0
Total Apply Clicks
0
Total Mock Apply
0
Total Bookmarks
0
More open roles at Cisco

Embedded Software Engineer - 4 to 8 yrs
Cisco · Bangalore, India

Virtual Sales Account Executive - Splunk Melbourne
Cisco · Melbourne, Australia

SR ASIC Design Verification Engineer
Cisco · San Jose, California, US

Hardware Engineer
Cisco · Bangalore, India

Sales Development Representative (Cantonese Speaking)
Cisco · Singapore, Singapore
Similar jobs

Associate Director, DT Portfolio Architect - Production (Remote)
Collins Aerospace (RTX) · US-CT-REMOTE

Enterprise Classified Cloud Sr. Manager
Collins Aerospace (RTX) · US-TX-RICHARDSON-C17 ~ 1717 Cityline Dr ~ CITYLINE C17

Senior Principal Engineer, Infrastructure Platform Architect (Onsite)
Collins Aerospace (RTX) · US-TX-PLANO-465 ~ 465 Independence Pkwy ~ INDEPENDENCE

CDS Platform Services
RTX (Raytheon) · US-CO-AURORA-S78 ~ 16201 E Centretech Pkwy ~ BLDG S78

Facilities Engineer (Onsite)
RTX (Raytheon) · US-MD-ANNAPOLIS-906 ~ 2551 Riva Rd ~ BLDG 906
About Cisco

Cisco
PublicCisco Systems, Inc. is an American multinational technology conglomerate corporation that develops, manufactures, and sells hardware, software, telecommunications equipment and other high-technology services and products focused on networking, cyber security and AI.
10,001+
Employees
San Jose
Headquarters
$317B
Valuation
Reviews
10 reviews
4.3
10 reviews
Work-life balance
3.5
Compensation
4.2
Culture
4.6
Career
3.8
Management
4.0
78%
Recommend to a friend
Pros
Supportive and friendly team culture
Flexible work arrangements and remote options
Excellent benefits and competitive compensation
Cons
High-pressure and demanding work environment
Work-life balance challenges
Limited career advancement opportunities
Salary Ranges
0 data points
L2
L6
L3
L4
L5
L2 · Business Analyst L2
0 reports
$70,294
total per year
Base
$28,118
Stock
$35,147
Bonus
$7,029
$49,206
$91,382
Interview experience
4 interviews
Difficulty
3.0
/ 5
Duration
14-28 weeks
Experience
Positive 0%
Neutral 25%
Negative 75%
Interview process
1
Application Review
2
Phone Screen
3
Technical Interview Round 1
4
Technical Interview Round 2
5
Behavioral Interview
6
Team Matching
7
Final Round
Common questions
Coding/Algorithm
System Design
Behavioral/STAR
Technical Knowledge
Latest updates
Interchange Capital Partners LLC Grows Position in Cisco Systems, Inc. $CSCO - MarketBeat
MarketBeat
News
·
1w ago
Why This Quantum Switch Prototype Might Be Cisco’s Most Important Announcement in Years - inc.com
inc.com
News
·
1w ago
How connectivity is shaping the future of surgical care - Cisco Blogs
Cisco Blogs
News
·
1w ago
Cisco loops partner channels into Google Cloud Marketplace procurements - SDxCentral
SDxCentral
News
·
1w ago