Jobs
Required Skills
SQL
Python
Statistical analysis
Program management
Systems thinking
WHY HARVEY
At Harvey, we’re transforming how legal and professional services operate — not incrementally, but end-to-end. By combining frontier agentic AI, an enterprise-grade platform, and deep domain expertise, we’re reshaping how critical knowledge work gets done for decades to come.
This is a rare chance to help build a generational company at a true inflection point. With 1000+ customers in 58+ countries, strong product-market fit, and world-class investor support, we’re scaling fast and defining a new category in real time. The work is ambitious, the bar is high, and the opportunity for growth — personal, professional, and financial — is unmatched.
Our team is sharp, motivated, and deeply committed to the mission. We move fast, operate with intensity, and take real ownership of the problems we tackle — from early thinking to long-term outcomes. We stay close to our customers — from leadership to engineers — and work together to solve real problems with urgency and care. If you thrive in ambiguity, push for excellence, and want to help shape the future of work alongside others who raise the bar, we invite you to build with us.
At Harvey, the future of professional services is being written today — and we’re just getting started.
ROLE OVERVIEW:
We’re looking for a technical, systems-minded operator to build and scale the evaluation engine behind Harvey’s platform. As we expand globally, ensuring our models behave reliably, accurately, and jurisdictionally correctly is mission-critical—and evaluation complexity is increasing 10x.
As a member of our Product Operations team, you’ll work closely with Applied Legal Researchers, Product, Engineering, AI Research, and human data providers to operationalize evaluation methodologies and embed them into our product development lifecycle. You’ll create the workflows, systems, and tooling that make evaluation a first-class product capability at Harvey.
This is a high-ownership role for someone who thrives in ambiguity, loves building structure from ambiguity, and wants to help scale the evaluation infrastructure of a global AI company.
WHAT YOU’LL DO:
-
Build and scale the systems that power model and product evaluations across Harvey
-
Embed evaluation workflows and readiness checkpoints into the product development lifecycle
-
Create the single source of truth for evaluation status, results, history, and launch readiness
-
Turn Expert-designed evaluation methodologies into scalable, repeatable operational processes
-
Manage relationships with human data vendors and ensure evaluation quality meets legal standards
-
Work with Engineering and Research to improve evaluation tooling, automation, and dashboards
-
Drive evaluation readiness for major product and model launches across geographies and jurisdictions
-
Document and operationalize evaluation governance as complexity increases
-
Help define how Harvey ensures model accuracy, reliability, and trust at global scale
WHAT YOU HAVE:
-
4–7+ years in technical program management, product operations, research operations, or evaluation/benchmarking roles
-
Experience working with ML/AI evaluations, benchmarking frameworks, or scientific workflows
-
Comfort with statistical methodologies and SQL or Python, or similar tools to interpret evaluation data
-
Ability to work deeply with legal experts and operationalize complex evaluation methodologies
-
Strong cross-functional coordination skills across Product, Engineering, Research, and data providers/vendors
-
High attention to detail and a bias toward clarity, rigor, and reproducibility
-
Ability to navigate extreme ambiguity and bring order to complex systems
-
Strong communication skills and comfort translating technical nuance for diverse stakeholders
-
Desire to do whatever it takes to make evaluation systems successful—from writing documentation to diagnosing pipeline issues
BONUS POINTS
-
Experience in legal tech or working with domain experts in regulated industries
-
Experience managing human data providers or human-in-the-loop evaluation pipelines
-
Background in ML research, data quality management, or evaluation science
-
Early employee at a hyper-growth startup
-
Experience at world-class product or platform operations orgs (ex: Stripe, Ramp)
COMPENSATION:
$178,000 - $210,000 USD
PLEASE FIND OUR CA APPLICANT PRIVACY NOTICE HERE https://www.harvey.ai/legal/california-applicant-privacy-notice.
Harvey is an equal opportunity employer and does not discriminate on the basis of race, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition, or any other basis protected by law.
We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made by emailing accommodations@harvey.ai
Total Views
0
Apply Clicks
0
Mock Applicants
0
Scraps
0
Similar Jobs

Manager, Financial Planning & Analysis
Salesforce · 2 Locations
District Manager - South Central Georgia Macon, GA 4501 Log Cabin Drive, Macon, GA, USA, 31204 3003 Watson Blvd, Warner Robins, GA, USA, 31093 1805 US Highway 82 W, Tifton, GA, USA, 31793 Job Category | District Manager Position Type | Full-Time
Aldi · macon

Director, EMEA Customer Briefing Program
Adobe · London

Channel Manager, Microbiology
Thermo Fisher · Vietnam, Vietnam

Risk and Control Operations Manager
Citigroup · Chennai, India
About Harvey AI

Harvey AI
Series BHarvey AI develops artificial intelligence software for legal professionals, providing AI-powered tools for legal research, document analysis, and workflow automation.
51-200
Employees
Boca Raton
Headquarters
$1.5B
Valuation
Reviews
3.8
1 reviews
Work Life Balance
3.0
Compensation
3.0
Culture
3.0
Career
2.5
Management
3.0
45%
Recommend to a Friend
Pros
AI results improve significantly after optimization
Tool effectively handles RFP review tasks
Capable platform with proper tuning
Cons
Initial outputs are poor quality
May reduce need for legal review positions
Limited career growth opportunities
Salary Ranges
1 data points
Senior/L5
Senior/L5 · Software Engineer
1 reports
$243,846
total / year
Base
$187,574
Stock
-
Bonus
-
$243,846
$243,846
Interview Experience
2 interviews
Difficulty
3.0
/ 5
Duration
14-28 weeks
Interview Process
1
Application Review
2
Recruiter Screen
3
Pre-onsite Assessment
4
Onsite/Virtual Interviews
5
Team Matching
6
Offer
Common Questions
Coding/Algorithm
System Design
Behavioral/STAR
Technical Knowledge
Past Experience
News & Buzz
Harvey Showcases Law Firm Adoption as AI Tool Gains Traction in Legal Workflows - TipRanks
Source: TipRanks
News
·
5w ago
Harvey AI Snaps Up Hexus in First Major Acquisition Move - MLQ.ai
Source: MLQ.ai
News
·
5w ago
HSBC rolls out legal AI platform with Harvey - Financial News London
Source: Financial News London
News
·
6w ago
HSBC selects Harvey as its legal AI platform - Legal IT Insider
Source: Legal IT Insider
News
·
6w ago