採用
必須スキル
Python
SQL
Data Science
Machine Learning
Perplexity serves tens of millions of users daily with reliable, high-quality answers grounded in an LLM-first search engine and our specialized data sources. We aim to use the latest models as they are released, but the intelligence frontier is a jagged one, and popular benchmarks do not effectively cover our use cases. In this role, you will build specialized evals to improve answer quality across Perplexity, covering search-based LLM answers and other scenarios popular with our users.
RESPONSIBILITIES:
-
Architect and maintain automated evaluation pipelines to assess answer quality across Perplexity's products, ensuring high standards for accuracy and helpfulness
-
Design evaluation sets and methods specifically to measure the impact of tool calls (particularly web search retrieval) on the final answer's quality
-
Develop VLM-based solutions to programmatically evaluate how final answers render visually across different platforms and devices
-
Continuously review public benchmarks and academic evaluations for their applicability to the Perplexity product, adapting and incorporating them into our regular performance measurements
-
Operate within a small, high-impact team where your evaluation metrics directly shape product changes, collaborating closely with technical leadership to measure and improve Answer Quality
QUALIFICATIONS:
-
PhD or MS in a technical field or equivalent experience
-
4+ years of experience in data science or machine learning
-
Strong proficiency in Python and SQL (expected to write production-grade code)
-
Experience building within a modern cloud data stack, specifically AWS and Databricks
-
Comfortable with agentic coding workflows and using AI-assisted development tools to iterate faster
PREFERRED QUALIFICATIONS:
-
1+ years of experience working with LLMs at scale, specifically with LLM-as-a-judge setups
-
Prior experience working on customer-facing web products or consumer apps, with real user traffic at scale
-
A strong research background, with experience applying research methods to real-world ML problems
-
Experience defining evaluation metrics (e.g., factual consistency, hallucination rate, retrieval precision) and building ground truth datasets
総閲覧数
0
応募クリック数
0
模擬応募者数
0
スクラップ
0
類似の求人

Research Engineer, Science of Scaling
Anthropic · London, UK

Data Scientist, Subscriptions
Spotify · London

IFRS9 Modeller
Monzo · London

Digital Analytics Specialist
Accenture · London

Research Engineer, Frontier Safety Risk Assessment
Google DeepMind · London, UK; New York City, New York, US; San Francisco, California, US
Perplexity AIについて

Perplexity AI
Series BPerplexity AI, Inc., or simply Perplexity, is an American privately held software company offering a web search engine that processes user queries and synthesizes responses.
51-200
従業員数
San Francisco
本社所在地
$1B
企業価値
レビュー
3.8
10件のレビュー
ワークライフバランス
3.2
報酬
2.5
企業文化
4.0
キャリア
2.5
経営陣
2.8
65%
友人に勧める
良い点
Supportive team and management
Good work-life balance and flexibility
Cutting-edge technology and interesting projects
改善点
Low compensation compared to industry standards
Poor management and lack of leadership direction
Fast-paced and overwhelming workload
給与レンジ
26件のデータ
Senior/L5
Intern
Senior/L5 · MEMBER OF TECHNICAL STAFF AI RESEARCH ENGINEER
1件のレポート
$337,217
年収総額
基本給
$259,397
ストック
-
ボーナス
-
$337,217
$337,217
面接体験
1件の面接
難易度
4.0
/ 5
期間
14-28週間
体験
ポジティブ 0%
普通 0%
ネガティブ 100%
面接プロセス
1
Application Review
2
HR Screen
3
Take-home Marketing Challenge
4
Hiring Manager Interview
5
Panel Interview
6
Offer
よくある質問
Digital Marketing Strategy
Campaign Performance Analysis
Behavioral/STAR
Technical Marketing Knowledge
Case Study
ニュース&話題
Perplexity launches Personal Computer that brings AI agents Directly on your Mac - The Times of India
The Times of India
News
·
3d ago
"Perplexity" Unveils a Broader Vision for the Role of Artificial Intelligence in Personal Computing - وكالة صدى نيوز
وكالة صدى نيوز
News
·
4d ago
Perplexity AI Cheat Sheet: How an ‘Answer Engine’ Is Challenging Gemini, ChatGPT - eWeek
eWeek
News
·
4d ago
Perplexity priced me out of its OpenClaw clone - PCWorld
PCWorld
News
·
4d ago