Jobs
Our mission is to automate coding. The first step in our journey is to build the best tool for professional programmers, using a combination of inventive research, design, and engineering. Our organization is very flat, and our team is small and talent dense. We particularly like people who are truth-seeking, passionate, and creative. We enjoy spirited debate, crazy ideas, and shipping code.
About the Role
You will lead the Model Routing & Inference team at Cursor, owning the inference platform that powers every AI interaction in the product. This team owns the full inference path: making Cursor's AI faster, more reliable, and more cost-effective at a scale few teams in the world get to operate at. Every agent session, every tab completion, and every chat message flows through your stack.
You'll set technical direction for cluster management, inference optimization, and traffic egress, building the platform that lets the rest of the company move fast without worrying about provider complexity. You'll lead a team of strong engineers, set strong direction for the business, and make the calls that balance latency, cost, reliability, and user experience across millions of daily requests.
What you’ll do
-
Building and evolving our inference gateway, a single abstraction over every provider's API semantics, so model onboarding becomes a config change.
-
Building the systems that dynamically select the best model for each request based on cost, latency, and quality.
-
Managing GPU cluster utilization and capacity planning across providers, optimizing for cost and performance.
-
Designing routing backpressure and admission control so traffic spikes don't cascade into providers.
-
Hiring and growing the team: sourcing, interviewing, and closing top inference and systems talent, while developing your engineers through coaching, mentorship, and high-leverage project assignments.
You may be a fit if
-
You have led engineering teams building high-throughput, low-latency distributed systems, especially in inference serving, traffic routing, or real-time data pipelines.
-
You're comfortable reasoning about cost/performance tradeoffs at scale (GPU utilization, provider economics, capacity planning) and making decisions with incomplete information.
-
You have strong software engineering fundamentals and enjoy shipping production systems that handle millions of requests.
-
Experience with model serving frameworks (vLLM, TensorRT-LLM, TGI), load balancing, or building resilient multi-provider architectures is a plus.
-
You make good calls in the gray area: weighing reliability, cost, latency, and user experience when there isn't a single "right" answer.
Total Views
0
Apply Clicks
0
Weekly mock applicants
0
Bookmarks
0
Similar jobs

Sr. Engineering Manager, Platform
Vercel · Hybrid - San Francisco, New York City

Engineering Manager, Platform
Mercor · San Francisco

Engineering Manager, Core Infrastructure
Harvey AI · San Francisco

Engineering Manager - Security
Plaid · San Francisco

Applications of ML Engineering Manager
Apple · San Francisco, CA
About Anysphere (Cursor)

Anysphere (Cursor)
Series BAnysphere develops Cursor, an AI-powered code editor that integrates language models to assist developers with code completion, generation, and editing. The company focuses on enhancing developer productivity through artificial intelligence tools.
51-200
Employees
San Francisco
Headquarters
$400M
Valuation
Reviews
4.0
10 reviews
Work-life balance
3.8
Compensation
2.7
Culture
4.2
Career
3.0
Management
4.0
72%
Recommend to a friend
Pros
Supportive and approachable management
Great team culture and collaborative environment
Flexible hours and work-life balance
Cons
Below market compensation and pay
Heavy workload and demanding projects
Limited growth and career opportunities
Salary Ranges
7 data points
Senior/L5
Senior/L5 · Product Designer
1 reports
$325,000
total per year
Base
$250,000
Stock
-
Bonus
-
$325,000
$325,000
Interview experience
2 interviews
Difficulty
3.0
/ 5
Experience
Positive 0%
Neutral 0%
Negative 100%
Interview process
1
Application Review
2
Technical Interview
3
Work Trial/Take-home Assignment
4
Final Review
5
Offer Decision
Common questions
Coding/Algorithm
Technical Knowledge
Past Experience
Problem Solving
News & Buzz
Cursor in talks to raise $2B at $50B valuation after hitting $2B ARR in three years - The Next Web
The Next Web
News
·
3d ago
Cursor 2.6.20, win11, crashing (extension host terminated)
This is what shows in the developer tools window: \_onExtensionHostCrashed @ workbench.desktop.main.js:49774 workbench.desktop.main.js:64 ERR Extension host (LocalProcess \[role: retrieval-always-local\] pid: 27824) terminated unexpectedly. The following extensions were running: anysphere.cursor-always-local, anysphere.cursor-resolver, anysphere.cursor-retrieval Bisecting extensions does nothing, and reinstalling, wiping app data and the cursor preferences in %HOME% folder does nothing. Wh
·
4w ago
·
2
·
3
Cursor Launches Composer 2 AI Model to Challenge OpenAI & Anthropic - MLQ.ai
MLQ.ai
News
·
4w ago
Anysphere Stock: Will Cursor Debug the IPO Code? - Access IPOs
Access IPOs
News
·
4w ago