
Research Engineer
Required skills
SQL
Machine Learning
ABOUT MERCOR
Mercor is at the intersection of labor markets and AI research. We partner with leading AI labs and enterprises to provide the human intelligence essential to AI development.
Our vast talent network trains frontier AI models in the same way teachers teach students: by sharing knowledge, experience, and context that can't be captured in code alone. Today, more than 30,000 experts in our network collectively earn over $2 million a day.
Mercor is creating a new category of work where expertise powers AI advancement. Achieving this requires an ambitious, fast-paced and deeply committed team. You’ll work alongside researchers, operators, and AI companies at the forefront of shaping the systems that are redefining society.
Mercor is a profitable Series C company valued at $10 billion. We work in-person five days a week in our new San Francisco headquarters.
ABOUT THE ROLE:
As a Research Engineer at Mercor, you’ll work at the intersection of engineering and applied AI research. You’ll contribute directly to post-training and RLVR, synthetic data generation, and large-scale evaluation workflows that meaningfully impact frontier language models.
Your work will be used to train large language models to master tool use, agentic behavior, and real-world reasoning in real-world production environments. You’ll shape rewards, run post-training experiments, and build scalable systems that improve model performance. You’ll help design and evaluate datasets, create scalable data augmentation pipelines, and build rubrics and evaluators that push the boundaries of what LLMs can learn.
WHAT YOU’LL DO:
-
Work on post-training and RLVR pipelines to understand how datasets, rewards, and training strategies impact model performance.
-
Design and run reward-shaping experiments and algorithmic improvements (e.g., GRPO, DAPO) to improve LLM tool-use, agentic behavior, and real-world reasoning.
-
Quantify data usability, quality, and performance uplift on key benchmarks.
-
Build and maintain data generation and augmentation pipelines that scale with training needs.
-
Create and refine rubrics, evaluators, and scoring frameworks that guide training and evaluation decisions.
-
Build and operate LLM evaluation systems, benchmarks, and metrics at scale.
-
Collaborate closely with AI researchers, applied AI teams, and experts producing training data.
-
Operate in a fast-paced, experimental research environment with rapid iteration cycles and high ownership.
WHAT WE’RE LOOKING FOR:
-
Strong applied research background, with a focus on post-training and/or model evaluation.
-
Strong coding proficiency and hands-on experience working with machine learning models.
-
Strong understanding of data structures, algorithms, backend systems, and core engineering fundamentals.
-
Familiarity with APIs, SQL/NoSQL databases, and cloud platforms.
-
Ability to reason deeply about model behavior, experimental results, and data quality.
-
Excitement to work in person in San Francisco, five days a week (with optional remote Saturdays), and thrive in a high-intensity, high-ownership environment.
NICE TO HAVE:
-
Real-world post-training team experience in industry (highest priority).
-
Publications at top-tier conferences (NeurIPS, ICML, ACL).
-
Experience training models or evaluating model performance.
-
Experience in synthetic data generation, LLM evaluations, or RL-style workflows.
-
Work samples, artifacts, or code repositories demonstrating relevant skills.
BENEFITS:
-
Generous equity grant vested over 4 years
-
A $20K relocation bonus (if moving to the Bay Area)
-
A $10K housing bonus (if you live within 0.5 miles of our office)
-
A $1K monthly stipend for meals
-
Free Equinox membership
-
Health insurance
Total Views
0
Total Apply Clicks
0
Total Mock Apply
0
Total Bookmarks
0
Similar jobs

Research Engineer, Frontier Evals & Environments - Finance
OpenAI · San Francisco

Liquid Labs - Research Engineer
Liquid AI · San Francisco

Research Engineer, Environment Scaling
Anthropic · Remote-Friendly, United States

Research Engineer
Cohere · Toronto

Research Engineer 4/5 – AI for Member Systems
Netflix · USA - Remote
About Mercor

Mercor
SeedMercor is an AI-powered platform that connects companies with vetted software engineers and technical talent through automated screening and matching processes.
1-50
Employees
San Francisco
Headquarters
Reviews
10 reviews
4.0
10 reviews
Work-life balance
3.2
Compensation
3.8
Culture
4.3
Career
3.5
Management
4.2
72%
Recommend to a friend
Pros
Supportive and approachable management
Great team culture and collaborative environment
Good benefits and flexible work options
Cons
Heavy workload and frequent overtime
Communication issues and miscommunication
Non-competitive pay and limited career progression
Salary Ranges
6 data points
Mid/L4
Mid/L4 · Machine Learning Engineer
1 reports
$210,126
total per year
Base
$161,637
Stock
-
Bonus
-
$210,126
$210,126
Interview experience
3 interviews
Difficulty
3.0
/ 5
Offer rate
67%
Experience
Positive 0%
Neutral 67%
Negative 33%
Interview process
1
Application Review
2
AI Interview Screen
3
Technical Assessment
4
Final Review
5
Offer
Common questions
Domain Expertise
Behavioral/STAR
Industry Knowledge
Leadership Experience
Problem Solving
Latest updates
Mercor competitor Deccan AI raises $25M, sources experts from India - MSN
MSN
News
·
1w ago
News - Mercor faces class-action lawsuits after data breach exposes contractor information - teiss
teiss
News
·
1w ago
AI Startup Mercor Faces Lawsuit Over Data Breach - PYMNTS.com
PYMNTS.com
News
·
2w ago
AI recruiting startup Mercor hit with at least seven class-action lawsuits after hacking: What the compan - The Times of India
The Times of India
News
·
2w ago