
Gen AI / Agentic AI Lead
About the role
Infosys is seeking a hands-on Gen AI / Agentic AI Lead to drive the development and deployment of next-generation AI solutions using Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), and Agentic AI frameworks. This role is ideal for a mid-level engineer with strong technical depth, a passion for building, and the ability to lead small teams or workstreams in a fast-paced, innovation-driven environment.
Required Qualifications
-
Bachelor's degree or foreign equivalent required from an accredited institution. Will also consider three years of progressive experience in the specialty in lieu of every year of education.
-
4 years of experience in software engineering or data science, with 2-3 years in Gen AI or LLM-based systems.
-
Strong Python programming skills and experience with ML/AI libraries (Hugging Face Transformers, Lang Chain, Py Torch).
-
Hands-on experience with vector databases (FAISS, Pinecone, Weaviate, Azure AI Search).
-
Familiarity with cloud platforms and Gen AI services (AWS, Azure, GCP).
-
Experience with REST API development (FastAPI, Flask) and containerization (Docker).
-
Solid understanding of AI governance, model safety, and prompt engineering.
-
This position is located in Bridgewater, NJ; Sunnyvale, CA; Austin, TX; Raleigh, NC; Richardson, TX; Tempe, AZ; Phoenix, AZ; Charlotte, NC; Houston, TX; Denver, CO; Hartford, CT; New York, NY, Palm Beach, FL; Tampa, FL or Alpharetta, GA, or is willing to relocate.
-
Candidates authorized to work for any employer in the United States without employer-based visa sponsorship are welcome to apply. Infosys is unable to provide immigration sponsorship for this role at this time
Key Responsibilities
-
Design, develop, and deploy Gen AI applications using LLMs and agentic frameworks (e.g., Lang Graph, Auto Gen, Crew AI).
-
Fine-tune open-source and proprietary LLMs using techniques like LoRA, QLoRA, and PEFT.
-
Build and optimize RAG pipelines with hybrid retrieval, semantic chunking, and vector search.
-
Integrate Gen AI solutions with cloud-native services (AWS Bedrock, Azure OpenAI, GCP Vertex AI).
-
Work with unstructured data (PDFs, HTML, audio, images) and multimodal models.
-
Implement LLMOps practices including prompt versioning, caching, observability, and cost tracking.
-
Evaluate model performance using tools like RAGAS, Deep Eval, and FMeval.
-
Collaborate with product managers, data engineers, and UX teams to deliver production-ready solutions.
-
Mentor junior engineers and contribute to code reviews, design discussions, and best practices.
Preferred Qualifications:
-
Exposure to agentic workflows and autonomous agents.
-
Experience with CI/CD pipelines and DevOps tools (GitHub Actions, Jenkins, Terraform).
-
Familiarity with front-end integration (React, Angular, TypeScript) and GraphQL APIs.
-
Knowledge of model interpretability, bias mitigation, and human-in-the-loop systems.
-
Experience with multimodal models and perception systems (e.g., vision + language).
The job entails sitting as well as working at a computer for extended periods of time. Should be able to communicate by telephone, email or face-to-face.
Estimated annual compensation range for candidates in the below locations will be-
Sunnyvale, CA; Bridgewater, NJ; New York, NY, Denver, CO: $73000 to $122275
Along with competitive pay, as a full-time Infosys employee, you are also eligible for the following benefits:
- Medical/Dental/Vision/Life Insurance
Long-term/Short-term Disability
Health and Dependent Care Reimbursement Accounts:
Insurance (Accident, Critical Illness, Hospital Indemnity, Legal)
401(k) plan and contributions dependent on salary level
Paid holidays plus Paid Time Off.
About Infosys
Alpharetta
Headquarters