채용
Work Flexibility: Hybrid or Onsite
Vocera, now part of Stryker, is seeking a visionary and hands-on Principal Engineer – AI Test, Evaluation & Data Architecture to define and lead the enterprise-wide strategy for AI validation, model evaluation, and data governance across our speech and GenAI platforms.
This role serves as the AI Quality Architect for real-time speech systems, NLP pipelines, and LLM-powered applications deployed in mission-critical healthcare environments. You will establish scalable evaluation frameworks, design AI testing platforms, define data governance standards, and ensure production reliability of AI systems at scale.
This is a high-impact architectural leadership role requiring deep expertise in LLM validation, RAG evaluation, speech benchmarking, automation, MLOps, and AI lifecycle governance.
What You Will Do
Enterprise AI Evaluation Architecture
-
Define and own the end-to-end AI evaluation architecture across speech, NLP, and GenAI platforms.
-
Establish standardized evaluation frameworks for:
ASR systems (WER, latency, robustness, domain adaptation),
NLP systems (intent accuracy, entity F1, confusion analysis),
LLM systems (hallucination rate, groundedness, factual accuracy, consistency, safety)
-
Define measurable AI quality SLAs and release gating criteria.
-
Architect benchmarking standards across model versions, prompt changes, and retrieval updates.
-
Institutionalize regression evaluation pipelines for all AI releases.
LLM & RAG Reliability Strategy
- Architect validation frameworks for:
RAG-based systems,
Prompt orchestration workflows,
Multi-agent or multi-model AI pipelines
-
Define groundedness measurement strategies for enterprise RAG.
-
Establish adversarial testing, stress testing, and edge-case validation frameworks.
-
Implement hallucination detection standards and mitigation measurement.
-
Drive responsible AI practices, including bias detection and safety validation.
AI Testing Platform & Automation Architecture
- Design and lead implementation of a scalable AI testing platform that includes:
Offline evaluation pipelines,
Golden dataset-driven regression systems,
Synthetic data generation frameworks,
Online A/B testing & shadow deployment strategies
-
Integrate AI validation workflows into CI/CD and MLOps pipelines.
-
Define drift detection and performance degradation monitoring strategies.
-
Establish real-time observability dashboards for AI quality metrics.
AI Data Governance & Lifecycle Management
- Define enterprise-wide data governance strategy for AI systems, including:
Data collection and curation standards,
Annotation workflows and validation,
Dataset versioning and reproducibility,
Traceability across model iterations
- Establish gold datasets for:
Speech systems,
NLP pipelines,
Clinical and conversational workflows
-
Drive continuous learning loops between production telemetry and training data.
-
Ensure compliance with healthcare data privacy and regulatory standards.
Speech & Domain-Specific AI Validation
- Define evaluation strategies for:
Accent variability,
Noisy clinical environments,
Domain-specific vocabulary adaptation
-
Establish measurable latency and reliability benchmarks for real-time AI systems.
-
Lead failure mode analysis and systemic AI quality improvements.
Technical Leadership & Organizational Influence
-
Serve as the principal authority on AI testing and evaluation strategy.
-
Influence architecture decisions alongside Principal AI Architects and platform leaders.
-
Mentor senior engineers in AI validation, benchmarking, and data governance practices.
-
Drive AI quality maturity across multiple pods and engineering teams.
-
Partner with Product and Executive stakeholders to align AI quality metrics with business outcomes.
-
Shape long-term AI reliability roadmap for the organization.
Required Qualifications
-
Bachelor’s or Master’s degree in Computer Science, Engineering, AI, or related field.
-
13+ years of experience in software engineering, AI engineering, or AI validation roles.
-
5+ years of hands-on experience with LLM, RAG, NLP, or speech-based AI platforms.
-
Proven experience designing AI evaluation or testing frameworks at scale.
-
Strong expertise in:
Hallucination detection,
Golden dataset regression strategies,
Adversarial and edge-case testing,
Prompt validation and benchmarking
-
Strong proficiency in Python and data analysis for AI evaluation.
-
Experience building automated AI validation pipelines integrated with CI/CD.
-
Strong system design and distributed architecture understanding.
-
Experience leading cross-team technical initiatives.
Preferred / Strongly Desired Qualifications
AI & GenAI
-
Experience in architecting evaluation frameworks for production RAG systems.
-
Familiarity with semantic search validation and retrieval benchmarking.
-
Experience designing LLM guardrails and structured output validation.
-
Knowledge of Responsible AI, fairness evaluation, and compliance auditing.
Speech & Voice Systems
-
Experience evaluating ASR/TTS systems in production environments.
-
Strong understanding of speech benchmarking metrics and domain adaptation strategies.
Cloud & Platform
-
Experience with Azure ML, Azure OpenAI, Azure AI Search.
-
Familiarity with MLOps and model lifecycle automation.
-
Experience designing scalable evaluation infrastructure in cloud-native environments.
Travel Percentage: 10%
총 조회수
0
총 지원 클릭 수
0
모의 지원자 수
0
스크랩
0
비슷한 채용공고

Sr Advanced AI Engr
Honeywell · Bengaluru, Karnataka, India, IN

Senior Gen AI Architect
Wipro · Bengaluru, India

Sr. Applied Scientist, Trust CX Innovations&AI Policy
Amazon · Bengaluru, KA, IND

Machine Learning Senior Analyst
Cigna · Bengaluru, India

Engineer, Staff - Machine Learning (AISW)
Qualcomm · Bangalore, Karnataka, India
Stryker 소개

Stryker
PublicStryker Corporation is an American multinational medical technologies corporation based in Kalamazoo, Michigan.
10,001+
직원 수
Kalamazoo
본사 위치
$75B
기업 가치
리뷰
2.9
2개 리뷰
워라밸
3.0
보상
2.5
문화
2.5
커리어
2.8
경영진
2.0
35%
친구에게 추천
장점
Well-known company in medical field
Potential for career growth
Opportunity to build teams
단점
Low compensation
Poor management practices
Unfulfilled promotion promises
연봉 정보
2,009개 데이터
Senior/L5
Senior/L5 · Senior Portfolio Manager
1개 리포트
$173,157
총 연봉
기본급
$150,571
주식
-
보너스
-
$173,157
$173,157
면접 경험
4개 면접
난이도
2.8
/ 5
소요 기간
14-28주
경험
긍정 0%
보통 50%
부정 50%
면접 과정
1
Application Review
2
HR Screen
3
Hiring Manager Interview
4
Gallup Assessment
5
Field Experience/Ride Along
6
Final Presentation
자주 나오는 질문
Behavioral/STAR
Sales Experience
Culture Fit
Past Experience
Situational Judgment
뉴스 & 버즈
Stryker Corporation (NYSE:SYK) Given Consensus Rating of "Moderate Buy" by Brokerages - MarketBeat
MarketBeat
News
·
3d ago
Stryker Sports Medicine Associate
Hello, I am interested in applying for a Stryker Sports medicine associate position, and was wondering if anyone would recommend the role? I just want to make sure this lines up with my goals and desires before I go into an interview and waste someone’s time. I would love to hear some people’s experiences who worked in this specialty, and the biggest worries I have is career trajectory and work/life balance. I've worked in Logistics and in SaaS so it'll be another industry change for me.
·
4d ago
·
2
·
3
Stryker Hack Affects First Quarter Results - BankInfoSecurity
BankInfoSecurity
News
·
4d ago
Stryker leads West Michigan public companies with $25.1B in revenue - Crain's Grand Rapids
Crain's Grand Rapids
News
·
5d ago