Jobs

Technical Program Manager, Human Evaluation Operations
United States, California, Mountain View; United States, Washington, Redmond
·
On-site
·
Full-time
·
1mo ago
Compensation
$100,600 - $199,000
Required Skills
Technical program management
Cross-functional project management
Operations management
Overview:
Microsoft AI (MAI) is building the world’s most advanced AI systems—and rigorous, scalable human evaluation is foundational to ensuring our models are safe, aligned, and high‑quality. The Human Evaluation Operations (Human Eval Ops) team powers this by running one of the largest and most reliable human‑in‑the‑loop pipelines at Microsoft.
We are hiring two Technical Program Managers to join this team and own end‑to‑end evaluation operations for model quality, safety, and capability development. These TPMs will partner closely with product squads, engineering, data scientists, researchers, and external annotation vendors to deliver high‑quality human evaluations at scale.
You will drive programs that ensure MAI has the people, processes, training pipelines, and tooling needed to enable fast, trustworthy, and efficient evaluation across a wide range of AI tasks.
This is a highly cross‑functional, execution‑oriented TPM role ideal for someone who thrives in operational complexity, is deeply organized, and loves working at the intersection of people, process, and product quality.
Responsibilities-
Lead Human Evaluation Programs: Drive end‑to‑end human evaluation workflows supporting model quality, safety, and capability initiatives across MAI. Coordinate evaluation planning, task design alignment, and delivery with product squads, engineering, and research partners.
- Manage Evaluation Workforce & Readiness: Oversee the health, performance, and scalability of MAI’s human evaluation workforce—including onboarding, qualification, training, and continuous performance management—to ensure reliable, high‑quality evaluation signals.
- Operational Excellence & Quality Governance: Maintain high operational standards across human‑in‑the‑loop pipelines by monitoring quality signals, resolving issues, and guiding teams toward consistent, trustworthy evaluation outcomes.
- Cross‑Functional Program Leadership: Partner with product squads to scope evaluation needs, define instructions and scorecards, support experimentation, and ensure teams are equipped to use human evaluations effectively.
- Platform & Vendor Partnership Management: Represent MAI needs to platform and vendor partners, shaping their roadmaps and ensuring capacity, reliability, and compliance with MAI standards.
- Insights, Tooling, & Documentation: Provide evaluation insights to product teams, maintain essential documentation, and influence the evolution of internal tools, dashboards, and processes that enable scalable human evaluations.
- Specialized Evaluation Programs: Support domain‑specific or advanced evaluation initiatives (e.g., expert reviews, structured scoring programs) in collaboration with MAI stakeholders.
Qualifications Required Qualifications:
- Bachelor's Degree AND 2+ years experience in engineering, product/technical program management, data analysis, or product development OR equivalent experience.
- 1+ year(s) of experience managing cross-functional and/or cross-team projects.
Preferred Qualifications
- 3+ years of technical program management, operations management, data operations, or equivalent experience
- 1+ year(s) of experience reading and/or writing code (e.g., sample documentation, product demos).
- Experience working cross‑functionally with engineering, research, PM, vendors, and operations partners.
- Experience managing vendor relations or external workforce programs.
- Strong analytical skills and comfort working with dashboards, metrics, and evaluation data.
- Experience running human‑in‑the‑loop data pipelines (e.g., annotations, RLHF, safety evals, quality assurance, crowdsourcing).
- Familiarity with LLM and AI model evaluation practices, data annotation platforms and systems.
- Ability to quickly understand product quality signals, debug task design issues, and iterate with engineering teams.
- Experience in operational excellence, process automation, or scaling manual workflows.
Technical Program Management IC3 - The typical base pay range for this role across the U.S. is USD $100,600 - $199,000 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $131,400 - $215,400 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
https://careers.microsoft.com/us/en/us-corporate-pay
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Total Views
0
Apply Clicks
0
Mock Applicants
0
Scraps
0
Similar Jobs

Engineering Program Manager, Audio
Apple · Cupertino, CA

Technical Program Manager EMEA - Comcast Business Global
Comcast · Great Britain - Virtual - London

Technical Program Manager
Raytheon (RTX) · columbia, Maryland, United States of America

SoC DRAM Memory Subsystem Validation Engineering Program Manager
Apple · San Diego, CA

Lead Technical Program Manager - Compute Infrastructure
JPMorgan Chase · Jersey City, NJ; Plano, TX
About Microsoft
Reviews
3.8
5 reviews
Work Life Balance
4.1
Compensation
4.3
Culture
3.4
Career
3.2
Management
3.0
65%
Recommend to a Friend
Pros
Excellent compensation and benefits package
Four-day workweek with improved work-life balance
Supportive managers and teams
Cons
High-pressure environment causing anxiety
Unprofessional interview processes
Limited creative work opportunities
Salary Ranges
5,571 data points
Junior/L3
Mid/L4
Junior/L3 · Advertising Client Success
2 reports
$163,358
total / year
Base
$141,875
Stock
-
Bonus
-
$163,358
$163,358
Interview Experience
7 interviews
Difficulty
3.7
/ 5
Duration
14-28 weeks
Offer Rate
14%
Experience
Positive 14%
Neutral 29%
Negative 57%
Interview Process
1
Application Review
2
Recruiter Screen
3
Technical Phone Screen
4
Technical Interview
5
Onsite/Virtual Interviews
6
Final Round
7
Offer
Common Questions
Coding/Algorithm
System Design
Behavioral/STAR
Technical Knowledge
Past Experience
News & Buzz
Microsoft loses $400 billion in few hours, what's behind one of the worst stock market days for the compa - Times of India
Source: Times of India
News
·
5w ago
Microsoft Stock Tumbles 12.1% In Worst Day For Company In Years - HuffPost
Source: HuffPost
News
·
5w ago
Microsoft: The 'question' the company needs to answer - Yahoo Finance
Source: Yahoo Finance
News
·
5w ago
AI is a planet-sized bubble — and Microsoft's slump is a taste of the crash to come, tech guru Erik Gordon says - Business Insider
Source: Business Insider
News
·
5w ago