채용
필수 스킬
Python
SQL
Data Science
Machine Learning
Perplexity serves tens of millions of users daily with reliable, high-quality answers grounded in an LLM-first search engine and our specialized data sources. We aim to use the latest models as they are released, but the intelligence frontier is a jagged one, and popular benchmarks do not effectively cover our use cases. In this role, you will build specialized evals to improve answer quality across Perplexity, covering search-based LLM answers and other scenarios popular with our users.
RESPONSIBILITIES:
-
Architect and maintain automated evaluation pipelines to assess answer quality across Perplexity's products, ensuring high standards for accuracy and helpfulness
-
Design evaluation sets and methods specifically to measure the impact of tool calls (particularly web search retrieval) on the final answer's quality
-
Develop VLM-based solutions to programmatically evaluate how final answers render visually across different platforms and devices
-
Continuously review public benchmarks and academic evaluations for their applicability to the Perplexity product, adapting and incorporating them into our regular performance measurements
-
Operate within a small, high-impact team where your evaluation metrics directly shape product changes, collaborating closely with technical leadership to measure and improve Answer Quality
QUALIFICATIONS:
-
PhD or MS in a technical field or equivalent experience
-
4+ years of experience in data science or machine learning
-
Strong proficiency in Python and SQL (expected to write production-grade code)
-
Experience building within a modern cloud data stack, specifically AWS and Databricks
-
Comfortable with agentic coding workflows and using AI-assisted development tools to iterate faster
PREFERRED QUALIFICATIONS:
-
1+ years of experience working with LLMs at scale, specifically with LLM-as-a-judge setups
-
Prior experience working on customer-facing web products or consumer apps, with real user traffic at scale
-
A strong research background, with experience applying research methods to real-world ML problems
-
Experience defining evaluation metrics (e.g., factual consistency, hallucination rate, retrieval precision) and building ground truth datasets
총 조회수
0
총 지원 클릭 수
0
모의 지원자 수
0
스크랩
0
비슷한 채용공고
Perplexity AI 소개

Perplexity AI
Series BPerplexity AI, Inc., or simply Perplexity, is an American privately held software company offering a web search engine that processes user queries and synthesizes responses.
51-200
직원 수
San Francisco
본사 위치
$1B
기업 가치
리뷰
3.8
10개 리뷰
워라밸
3.2
보상
2.5
문화
4.0
커리어
2.5
경영진
2.8
65%
친구에게 추천
장점
Supportive team and management
Good work-life balance and flexibility
Cutting-edge technology and interesting projects
단점
Low compensation compared to industry standards
Poor management and lack of leadership direction
Fast-paced and overwhelming workload
연봉 정보
26개 데이터
Senior/L5
Intern
Senior/L5 · MEMBER OF TECHNICAL STAFF AI RESEARCH ENGINEER
1개 리포트
$337,217
총 연봉
기본급
$259,397
주식
-
보너스
-
$337,217
$337,217
면접 경험
1개 면접
난이도
4.0
/ 5
소요 기간
14-28주
경험
긍정 0%
보통 0%
부정 100%
면접 과정
1
Application Review
2
HR Screen
3
Take-home Marketing Challenge
4
Hiring Manager Interview
5
Panel Interview
6
Offer
자주 나오는 질문
Digital Marketing Strategy
Campaign Performance Analysis
Behavioral/STAR
Technical Marketing Knowledge
Case Study
뉴스 & 버즈
Perplexity launches Personal Computer that brings AI agents Directly on your Mac - The Times of India
The Times of India
News
·
3d ago
"Perplexity" Unveils a Broader Vision for the Role of Artificial Intelligence in Personal Computing - وكالة صدى نيوز
وكالة صدى نيوز
News
·
3d ago
Perplexity AI Cheat Sheet: How an ‘Answer Engine’ Is Challenging Gemini, ChatGPT - eWeek
eWeek
News
·
4d ago
Perplexity priced me out of its OpenClaw clone - PCWorld
PCWorld
News
·
4d ago



