招聘

Senior Product Operations Manager, Evaluation

Harvey AI

San Francisco

On-site

Full-time

2mo ago

薪酬

$178,000 - $210,000

必备技能

SQL

Python

Statistical analysis

Program management

Systems thinking

WHY HARVEY

At Harvey, we’re transforming how legal and professional services operate — not incrementally, but end-to-end. By combining frontier agentic AI, an enterprise-grade platform, and deep domain expertise, we’re reshaping how critical knowledge work gets done for decades to come.

This is a rare chance to help build a generational company at a true inflection point. With 1000+ customers in 58+ countries, strong product-market fit, and world-class investor support, we’re scaling fast and defining a new category in real time. The work is ambitious, the bar is high, and the opportunity for growth — personal, professional, and financial — is unmatched.

Our team is sharp, motivated, and deeply committed to the mission. We move fast, operate with intensity, and take real ownership of the problems we tackle — from early thinking to long-term outcomes. We stay close to our customers — from leadership to engineers — and work together to solve real problems with urgency and care. If you thrive in ambiguity, push for excellence, and want to help shape the future of work alongside others who raise the bar, we invite you to build with us.

At Harvey, the future of professional services is being written today — and we’re just getting started.

ROLE OVERVIEW:

We’re looking for a technical, systems-minded operator to build and scale the evaluation engine behind Harvey’s platform. As we expand globally, ensuring our models behave reliably, accurately, and jurisdictionally correctly is mission-critical—and evaluation complexity is increasing 10x.

As a member of our Product Operations team, you’ll work closely with Applied Legal Researchers, Product, Engineering, AI Research, and human data providers to operationalize evaluation methodologies and embed them into our product development lifecycle. You’ll create the workflows, systems, and tooling that make evaluation a first-class product capability at Harvey.

This is a high-ownership role for someone who thrives in ambiguity, loves building structure from ambiguity, and wants to help scale the evaluation infrastructure of a global AI company.

WHAT YOU’LL DO:

Build and scale the systems that power model and product evaluations across Harvey
Embed evaluation workflows and readiness checkpoints into the product development lifecycle
Create the single source of truth for evaluation status, results, history, and launch readiness
Turn Expert-designed evaluation methodologies into scalable, repeatable operational processes
Manage relationships with human data vendors and ensure evaluation quality meets legal standards
Work with Engineering and Research to improve evaluation tooling, automation, and dashboards
Drive evaluation readiness for major product and model launches across geographies and jurisdictions
Document and operationalize evaluation governance as complexity increases
Help define how Harvey ensures model accuracy, reliability, and trust at global scale

WHAT YOU HAVE:

4–7+ years in technical program management, product operations, research operations, or evaluation/benchmarking roles
Experience working with ML/AI evaluations, benchmarking frameworks, or scientific workflows
Comfort with statistical methodologies and SQL or Python, or similar tools to interpret evaluation data
Ability to work deeply with legal experts and operationalize complex evaluation methodologies
Strong cross-functional coordination skills across Product, Engineering, Research, and data providers/vendors
High attention to detail and a bias toward clarity, rigor, and reproducibility
Ability to navigate extreme ambiguity and bring order to complex systems
Strong communication skills and comfort translating technical nuance for diverse stakeholders
Desire to do whatever it takes to make evaluation systems successful—from writing documentation to diagnosing pipeline issues

BONUS POINTS

Experience in legal tech or working with domain experts in regulated industries
Experience managing human data providers or human-in-the-loop evaluation pipelines
Background in ML research, data quality management, or evaluation science
Early employee at a hyper-growth startup
Experience at world-class product or platform operations orgs (ex: Stripe, Ramp)

COMPENSATION:

$178,000 - $210,000 USD

PLEASE FIND OUR CA APPLICANT PRIVACY NOTICE HERE https://www.harvey.ai/legal/california-applicant-privacy-notice.

Harvey is an equal opportunity employer and does not discriminate on the basis of race, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition, or any other basis protected by law.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made by emailing accommodations@harvey.ai

总浏览量

申请点击数

模拟申请者数

相似职位

Marketing Operations Manager

LangChain · San Francisco, CA

Product Operations Manager

Abridge · SF Office

Sr Product Operations Manager

LendingClub · San Francisco, CA

Senior Marketing Operations Manager, B2B Sales

Brex · San Francisco, California, United States

Operations Manager, Maiden Lane San Francisco

Chanel · San Francisco, Ca

关于Harvey AI

Harvey AI

Series B

Harvey AI develops artificial intelligence software for legal professionals, providing AI-powered tools for legal research, document analysis, and workflow automation.

51-200

员工数

Boca Raton

总部位置

$1.5B

企业估值

评价

4.0

10条评价

工作生活平衡

3.8

薪酬

2.5

企业文化

4.2

职业发展

3.2

管理层

4.3

75%

推荐给朋友

优点

Supportive and collaborative team environment

Flexible work arrangements and remote options

Approachable and understanding management

缺点

Low compensation and entry-level pay

Limited career advancement opportunities

High workload and occasional long hours

薪资范围

1个数据点

Senior/L5

Senior/L5 · Software Engineer

1份报告

$243,846

年薪总额

基本工资

$187,574

股票

奖金

$243,846

面试经验

1次面试

难度

3.0

/ 5

时长

14-28周

录用率

100%

体验

正面 100%

中性 0%

负面 0%

面试流程

Application Review

Recruiter Screen

Technical Phone Screen

Onsite/Virtual Interviews

Team Matching

Offer

常见问题

Coding/Algorithm

System Design

Behavioral/STAR

Technical Knowledge

Culture Fit

新闻动态

Legal Tech Valuations Surge In 2026 Because of AI - Broadband Breakfast

Broadband Breakfast

News

4d ago

Harvey AI Upgrades Review Tables as Platform Hits 700K Daily Legal Tasks - MEXC

MEXC

News

6d ago

Torys partners with Harvey to drive firmwide AI adoption - Torys LLP

Torys LLP

News

1w ago

Legal Is Next

2w ago