채용

Model Behavior Architect

Perplexity AI

San Francisco

On-site

Full-time

1mo ago

필수 스킬

Python

LLMs

Prompt Engineering

Evaluation Design

ABOUT THE ROLE:

We're looking for a Model Behavior Architect to help build Perplexity's AI products and evaluations. You'll sit within our AI team and collaborate closely with research and product teams, designing prompt and context engineering strategies to deliver high quality user experiences across multiple domains and models.

This role is equal parts craft and science. You'll develop a deep understanding of our answer engine by pressure-testing model capabilities and working across our AI infrastructure (including system and tool prompts, skills, and evaluations) to create a stellar product experience for our users.

You'll serve as a go-to expert on prompting, model quality, and behavioral consistency across new product features and model releases.

KEY RESPONSIBILITIES:

Context Engineering: Design, test, and optimize context strategies and system prompts that shape answer engine behavior across products, features, and use cases.
Evaluation Systems: Build automated and semi-automated evaluation pipelines that measure model quality, catch regressions, and scale across product surfaces.
Model Launch Support: Partner with research and engineering to validate model behavior before and during rollouts, ensuring smooth transitions with no degradation.
Research & Analysis: Identify inconsistencies and failure modes in model outputs through well-designed research projects — for both internal and production-facing systems.
Cross-functional Collaboration: Work closely with design, product, and research teams to translate product goals into concrete model behavior requirements.
Knowledge Sharing: Help engineers across teams build intuition for prompt design, context engineering, and evaluation best practices.
Staying Current: Track the latest alignment, evaluation, and prompting techniques from industry and academia, and bring the best ideas back to the team.

WHAT WE'RE LOOKING FOR:

REQUIRED

Experience designing evaluations, benchmarks, or metrics for AI systems.
Strong written and verbal communication skills, particularly in explaining complex concepts to diverse stakeholders.
Ability to manage multiple concurrent projects in a fast-moving environment.
Strong experience with Perplexity or other frontier AI models in production settings.
Demonstrated experience with Python — you'll prototype, debug, automate, and build systems at scale.
3+ years of experience working with LLMs in a product or research setting.

PREFERRED

Experience with A/B testing or experimentation frameworks.
Track record of improving AI system performance through systematic evaluation and iteration.

THIS ROLE MAY BE A GREAT FIT FOR YOU IF YOU:

Get excited about edge cases in model behavior and love digging into how an answer could be better.
Enjoy turning qualitative "this feels off" intuitions into quantitative metrics and systematic fixes.
Want to work at the intersection of research and product, where your work ships to real users same-day.
Are comfortable with ambiguity and can define what "good" looks like for novel AI features.
Have a hacker spirit — you'd rather build a quick prototype to test a hypothesis than debate it in a doc.
Care deeply about making AI more reliable and useful for our users.

총 조회수

총 지원 클릭 수

모의 지원자 수

비슷한 채용공고

Machine Learning Engineer

Together AI · San Francisco

ML Infra Engineer - Supercomputing

Physical Intelligence · San Francisco

Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI

Scale AI · San Francisco, CA; New York, NY

Machine Learning Engineer, Marketplace

Mercor · San Francisco

Machine Learning Engineer II

Uber · San Francisco, CA; Seattle, WA; Sunnyvale, CA

Perplexity AI 소개

Perplexity AI

Series B

Perplexity AI, Inc., or simply Perplexity, is an American privately held software company offering a web search engine that processes user queries and synthesizes responses.

51-200

직원 수

San Francisco

본사 위치

$1B

기업 가치

리뷰

3.8

10개 리뷰

워라밸

3.2

보상

2.5

문화

4.0

커리어

2.5

경영진

2.8

65%

친구에게 추천

장점

Supportive team and management

Good work-life balance and flexibility

Cutting-edge technology and interesting projects

단점

Low compensation compared to industry standards

Poor management and lack of leadership direction

Fast-paced and overwhelming workload

연봉 정보

26개 데이터

Senior/L5

Intern

Senior/L5 · MEMBER OF TECHNICAL STAFF AI RESEARCH ENGINEER

1개 리포트

$337,217

총 연봉

기본급

$259,397

주식

보너스

$337,217

면접 경험

1개 면접

난이도

4.0

/ 5

소요 기간

14-28주

경험

긍정 0%

보통 0%

부정 100%

면접 과정

Application Review

HR Screen

Take-home Marketing Challenge

Hiring Manager Interview

Panel Interview

Offer

자주 나오는 질문

Digital Marketing Strategy

Campaign Performance Analysis

Behavioral/STAR

Technical Marketing Knowledge

Case Study

뉴스 & 버즈

Perplexity launches Personal Computer that brings AI agents Directly on your Mac - The Times of India

The Times of India

News

3d ago

"Perplexity" Unveils a Broader Vision for the Role of Artificial Intelligence in Personal Computing - وكالة صدى نيوز

وكالة صدى نيوز

News

3d ago

Perplexity AI Cheat Sheet: How an ‘Answer Engine’ Is Challenging Gemini, ChatGPT - eWeek

eWeek

News

4d ago

Perplexity priced me out of its OpenClaw clone - PCWorld

PCWorld

News

4d ago