Microsoft

Empowering every person and organization on the planet to achieve more.

Principal Technical Program Manager - Applied Science

직무프로덕트

경력Staff+

위치United States, Washington, Redmond

근무오피스 출근

고용정규직

게시2주 전

지원하기

Overview

AI evaluations are at a critical inflection point. Static benchmarks are saturated; benchmarks like MMLU, Human Eval, and SWE-Bench have reached their limits as models become increasingly familiar with public test data, and capable of autonomously finding answer keys online. The gap between benchmark scores and real developer experience is growing, making it hard to understand which problems are truly ‘solved’, and which are worth deeper investment.

GitHub is uniquely positioned to lead the industry through this transition. We have direct feedback and deep insight into real production workflows from millions of developers, and the scale to build evaluation systems that truly reflect developer success. We’re looking for a Principal Technical Program Manager to help us build the future of AI evaluation.

The Applied Science team for GitHub Copilot sits at the intersection of frontier AI research and the world's largest developer platform. We ship AI-powered experiences (ex: code completion, code review, coding agents) used by millions of professional developers every day. As a member of the team, you will help lead GitHub Copilot's AI evaluation strategy end-to-end — from benchmark design and lifecycle governance, through evaluation infrastructure and internal adoption, to community engagement and public transparency. You are the person who ensures that every model swap, product harness, and feature launch is measured against what actually matters to developers — and that the world can see the results.

Responsibilities

In this role you’ll:

Partner with Applied Science researchers to translate cutting-edge evaluation research into production systems: adaptive testing (IRT), agent-centric co-evolution, adversarial benchmarking, and telemetry-driven benchmark generation.
Lead the deprecation of saturated benchmarks and design their next-generation replacements — including procedurally-generated code evaluations that can't be memorized and adaptive testing systems that skip trivial questions for frontier models.
Build GitHub's community benchmark submission program — enabling external researchers, enterprises, and open-source developers to contribute domain-specific evaluations — and publish GitHub's first external benchmark transparency reports showing how models perform on real developer workflows.
Design and operationalize multi-tier evaluation frameworks — from fast automated regression suites and LLM-as-judge systems, through expert human evaluation, to production A/B testing — so teams can iterate in hours, not weeks.
Design feedback-to-benchmark pipelines that convert thumbs-down signals, user frustrations, and support tickets into candidate regression tests — systematizing informal practices into scalable, automated systems.
Establish evaluation as a first-class discipline across GitHub Copilot — creating the rituals, dashboards, and communication cadences that make evaluation results accessible and actionable for every team.

Qualifications

Required Qualifications:

Bachelor's Degree AND 6+ years experience in engineering, product/technical program management, data analysis, or product development OR equivalent experience.
3+ years of experience managing cross-functional and/or cross-team projects.

Other Requirements

Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings:

Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years

Preferred Qualifications:

5+ years of experience in technical program management, product management, applied science, or equivalent
2+ years managing programs in machine learning, AI/ML evaluation, or data science
2+ years managing cross-functional and/or cross-team projects
Deep, firsthand experience with AI/ML evaluation methodologies: benchmark design and validity, human evaluation frameworks, automated scoring systems (including LLM-as-judge), A/B testing, and statistical significance.
Deep personal experience with AI coding tools — you use Copilot, Cursor, Claude Code, or similar tools daily and have strong opinions about what "good" looks like from a developer's perspective.
Understanding of software engineering workflows at scale — code review, CI/CD, testing, debugging, refactoring — and how AI tools should integrate into each.
Experience with community or open-source program management — contributor programs, external research partnerships, or developer relations in a technical context.
Proven ability to navigate competing priorities across teams and build shared commitment to common goals in ambiguous, fast-moving environments.
Track record of building evaluation systems that directly influenced product or model shipping decisions at scale.

Technical Program Management IC5 - The typical base pay range for this role across the U.S. is USD $139,900 - $274,800 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $188,000 - $304,200 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
https://careers.microsoft.com/us/en/us-corporate-pay

This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.

전체 조회수

전체 지원 클릭

전체 Mock Apply

전체 스크랩

비슷한 채용공고

Principal Technical Program Manager -ProdDev

Oracle · United States, US

Senior Customer Project Manager

Schneider Electric · United States

Principal Technical Program Manager

Oracle · United States, US

Senior Manager, Technical Program Management (Guest & Host Technology)

Airbnb · United States

Senior Product Manager (Career Sites)

SmartRecruiters · United States

Microsoft 소개

Microsoft

Public

Microsoft Corporation is an American multinational technology conglomerate headquartered in Redmond, Washington.

10,001+

직원 수

Redmond

본사 위치

$3000B

기업 가치

리뷰

10개 리뷰

4.4

10개 리뷰

워라밸

3.2

보상

4.1

문화

4.3

커리어

3.8

경영진

4.0

82%

지인 추천률

장점

Cutting-edge technology and innovative projects

Great team culture and collaborative atmosphere

Excellent benefits and competitive compensation

단점

Heavy workload and frequent overtime

High expectations and stressful environment

Bureaucratic processes can be slow

연봉 정보

5,620개 데이터

Mid/L4

Principal/L7

Senior/L5

Staff/L6

Director

Mid/L4 · Program Manager

0개 리포트

$191,472

총 연봉

기본급

$146,773

주식

$26,238

보너스

$18,461

$138,474

$273,204

면접 후기

후기 1개

난이도

4.0

/ 5

소요 기간

14-28주

경험

긍정 0%

보통 0%

부정 100%

면접 과정

Application Review

Recruiter Screen

Technical Phone Screen

Onsite/Virtual Interviews

Team Matching

Offer

자주 나오는 질문

Coding/Algorithm

System Design

Behavioral/STAR

Technical Knowledge

최근 소식

'Players Are Frustrated.' Microsoft Execs Tease a Return to Xbox-Exclusive Games - PCMag

PCMag

News

1w ago

Microsoft Announces Major Changes to Windows Update in Windows 11 - Thurrott.com

Thurrott.com

News

1w ago

Microsoft to offer voluntary retirement to thousands of US employees for the first time - CNN

CNN

News

1w ago

20,000 job cuts at Meta, Microsoft raise concern that AI-driven labor crisis is here - CNBC

CNBC

News

1w ago