
Focusing on consulting, software, and technology.
Lead AI Engineer
ZS is a place where passion changes lives. As a management consulting and technology firm focused on improving life and how we live it, we transform ideas into impact by bringing together data, science, technology and human ingenuity to deliver better outcomes for all. Here you’ll work side-by-side with a powerful collective of thinkers and experts shaping life-changing solutions for patients, caregivers and consumers, worldwide. ZSers drive impact by bringing a client-first mentality to each and every engagement. We partner collaboratively with our clients to develop custom solutions and technology products that create value and deliver company results across critical areas of their business. Bring your curiosity for learning, bold ideas, courage and passion to drive life-changing impact to ZS.
What you'll do: Lead AI Engineer in the Platforms and Products will…
We are seeking a highly motivated Applied AI Engineer with a strong foundation in Machine Learning and a deep interest in **Large Language Models (LLMs)**and Generative AI. This role focuses on building, optimizing, and evaluating production-grade LLM systems, including Retrieval-Augmented Generation (RAG), fine-tuning workflows, and scalable inference pipelines.
- Design and implement LLM-powered applications using state-of-the-art transformer models.
- Build and optimize RAG pipelines using embeddings, chunking strategies, and vector search.
- Experiment with prompt engineering,structured outputs(JSON schemas/function calling), and tool-augmented LLMs (agents/workflows).
- Fine-tune models using techniques such as LoRA,PEFT, and instruction tuning.
- Develop and evaluate embedding models for similarity search and semantic retrieval.
- Conduct LLM evaluation using automated and human-in-the-loop techniques (offline + online).
- Optimize inference workflows for latency,GPU utilization, and cost efficiency (quantization, batching, caching).
- Build and maintain REST API Services (FastAPI etc.) to deploy LLM/RAG endpoints, integrate with product systems, and support scalable inference.
- Contribute to integration of AI systems into production software environments (CI/CD, monitoring, reliability).
- Research and prototype cutting-edge approaches in Generative AI and share learnings with the team.
What you’ll bring:
- A master's or bachelor's degree in Computer Science or related field from a top university
- 4+ years' hands-on experience in Machine Learning (ML) with production LLM systems
- Good fundamentals of machine learning, deep learning and fine tuning models (LLM) including:Understanding of transformer architectures
- Prompt engineering expertise
- Embeddings and vector search
- Experienced in backend API design with FastAPI, async patterns, rate limiting
- Experience with vector DB including:Pinecone, Weaviate, or Chroma
- Embedding storage and similarity search
- Hybrid search implementations
- Strong programming expertise in Python is must including:Async programming (asyncio, async/await)
- Type hints and Pydantic
- SOLID principles and design patterns
- Experience in ML Ops to measure and track model performance including:MLFlow for model tracking
- Langfuse for LLM observability (strongly preferred)
- Model versioning and A/B testing
- Experience in working with NLP & computer vision
- Fluency in English
- Client-first mentality
- Intense work ethic
- Collaborative spirit and problem-solving approach
At ZS, your growth matters. We offer a comprehensive total rewards package that supports your health and well‑being, financial future, time away, and professional development. With robust skills‑building programs, multiple career progression paths, internal mobility, and a deeply collaborative culture, you’ll have the opportunity to do meaningful work, expand your capabilities, and thrive as part of a global community. For details on total rewards in United States, visit ZS US office locations | Where we work | ZS.
What you'll do: Lead AI Engineer in the Platforms and Products will…
We are seeking a highly motivated Applied AI Engineer with a strong foundation in Machine Learning and a deep interest in **Large Language Models (LLMs)**and Generative AI. This role focuses on building, optimizing, and evaluating production-grade LLM systems, including Retrieval-Augmented Generation (RAG), fine-tuning workflows, and scalable inference pipelines.
- Design and implement LLM-powered applications using state-of-the-art transformer models.
- Build and optimize RAG pipelines using embeddings, chunking strategies, and vector search.
- Experiment with prompt engineering,structured outputs(JSON schemas/function calling), and tool-augmented LLMs (agents/workflows).
- Fine-tune models using techniques such as LoRA,PEFT, and instruction tuning.
- Develop and evaluate embedding models for similarity search and semantic retrieval.
- Conduct LLM evaluation using automated and human-in-the-loop techniques (offline + online).
- Optimize inference workflows for latency,GPU utilization, and cost efficiency (quantization, batching, caching).
- Build and maintain REST API Services (FastAPI etc.) to deploy LLM/RAG endpoints, integrate with product systems, and support scalable inference.
- Contribute to integration of AI systems into production software environments (CI/CD, monitoring, reliability).
- Research and prototype cutting-edge approaches in Generative AI and share learnings with the team.
What you’ll bring:
- A master's or bachelor's degree in Computer Science or related field from a top university
- 4+ years' hands-on experience in Machine Learning (ML) with production LLM systems
- Good fundamentals of machine learning, deep learning and fine tuning models (LLM) including:Understanding of transformer architectures
- Prompt engineering expertise
- Embeddings and vector search
- Experienced in backend API design with FastAPI, async patterns, rate limiting
- Experience with vector DB including:Pinecone, Weaviate, or Chroma
- Embedding storage and similarity search
- Hybrid search implementations
- Strong programming expertise in Python is must including:Async programming (asyncio, async/await)
- Type hints and Pydantic
- SOLID principles and design patterns
- Experience in ML Ops to measure and track model performance including:MLFlow for model tracking
- Langfuse for LLM observability (strongly preferred)
- Model versioning and A/B testing
- Experience in working with NLP & computer vision
- Fluency in English
- Client-first mentality
- Intense work ethic
- Collaborative spirit and problem-solving approach
At ZS, your growth matters. We offer a comprehensive total rewards package that supports your health and well‑being, financial future, time away, and professional development. With robust skills‑building programs, multiple career progression paths, internal mobility, and a deeply collaborative culture, you’ll have the opportunity to do meaningful work, expand your capabilities, and thrive as part of a global community. For details on total rewards in United States, visit ZS US office locations | Where we work | ZS.
전체 조회수
0
전체 지원 클릭
0
전체 Mock Apply
0
전체 스크랩
0
비슷한 채용공고

AI LEAD L1
Wipro · Chennai, India

Lead Solution Engineer, DTS Analytics
Kimberly-Clark · IT Centre Bengaluru GDTC

Engineering - Cloud Development - Software Engineer - Vice President - Dallas
Goldman Sachs · Dallas, Texas, United States

Applied AI ML Director Machine Learning Center of Excellence
JPMorgan Chase · New York, NY, United States, US

Lead Agentic Engineer
Marsh McLennan · Dublin - Charlotte
ZS Associates 소개

ZS Associates
BootstrappedZS Associates is a management consulting and professional services firm focusing on consulting, software, and technology. Headquartered in Evanston, Illinois, it provides healthcare, private equity, and technology services.
10,001+
직원 수
Evanston
본사 위치
리뷰
10개 리뷰
4.4
10개 리뷰
워라밸
3.2
보상
4.1
문화
4.5
커리어
4.2
경영진
4.0
78%
지인 추천률
장점
Supportive and approachable management
Good compensation and benefits
Training and career development opportunities
단점
Long hours and heavy workload during peak projects
High performance pressure and expectations
Fast-paced competitive environment
연봉 정보
1,722개 데이터
Mid/L4
Senior/L5
Staff/L6
Director
Mid/L4 · Advanced Data Science Associate
8개 리포트
$175,000
총 연봉
기본급
$175,000
주식
-
보너스
-
$163,540
$245,440
면접 후기
후기 2개
난이도
3.5
/ 5
소요 기간
14-28주
경험
긍정 0%
보통 50%
부정 50%
면접 과정
1
Application Review
2
HR Screen
3
Technical/Case Interview
4
Panel Interview
5
Offer
자주 나오는 질문
Technical Knowledge
Case Study
Behavioral/STAR
Past Experience
Problem Solving
최근 소식
ZS Associates: Revolutionizing Data Engineering with Self-Service Platform on AWS for Life Sciences - Amazon Web Services
Amazon Web Services
News
·
5w ago
ZS Associates Expands to 28,000 SQFT Across Two Bellevue CBD Locations - The Registry Pacific Northwest Real Estate
The Registry Pacific Northwest Real Estate
News
·
5w ago
Insights at Scale With Agentic AI: How AI Ready Data is Transforming Commercial Life Sciences - Databricks
Databricks
News
·
5w ago
ZS Associates Data Engineer Interiview
Hi guys I have an interview set up for Business Technology Solutions Associate Consultant R&D for Data Engineer role. If anyone appeared for interview recently or currently working there can tell about the interview experience, I would really appreciate it. Thanks
·
6w ago
·
19
·
11