招聘
Staff Engineer — API Core Platform
About the role
Together AI is seeking an experienced Backend Engineer to found Together’s API Platform team within the Production Foundations organization. In this role, you will define, build, and scale the core systems and architecture that power Together’s mission-critical APIs — including public customer APIs used directly by customers and via SDKs, CLIs, as well as the client APIs powering Together’s Cloud UI.
In the near term, you will improve and standardize the backend API layer within our primary Next.js monolith, raising the bar on reliability, performance, and consistency. In parallel, you will design and lead the evolution toward scalable, purpose-built next-gen API platform solutions optimized for different Public API and Client API use cases and traffic patterns — defining the long-term architecture and driving its incremental rollout.
This is a deeply hands-on role for an engineer who thrives on writing critical-path code and building platforms that unify engineering efforts across teams. You will work across backend systems, infrastructure layers, identity and access flows, and developer tooling to establish a cohesive API strategy that supports Together’s rapidly growing AI Cloud.
Responsibilities
-
Design and drive the evolution of Together’s API platform, defining how APIs are built, versioned, secured, tested, and operated across the company
-
Own and improve the backend API layer within our primary Next.js monolith, raising the bar on consistency, reliability, and performance
-
Architect and lead the transition toward scalable, purpose-built API platforms optimized for different traffic patterns and product surfaces
-
Write and maintain critical-path platform code that multiple services and product teams depend on
-
Design and implement robust authentication, authorization, and identity-aware access patterns across public and internal APIs
-
Establish performance standards for high-throughput APIs, implementing caching, rate limiting, fan-out control, and graceful degradation strategies
-
Raise the bar on API observability and reliability, defining SLOs, monitoring, alerting, and incident response practices
-
Drive API data modeling and schema generation strategies to ensure long-term maintainability and developer ergonomics
-
Partner with infrastructure and security teams to maintain a strong security posture and evolve toward zero-trust architectures
-
Mentor engineers, influence architectural direction across teams, and help define hiring standards as the API Platform grows
Required Qualifications
-
8+ years of experience building and operating large-scale, distributed backend systems in production environments
-
Proven experience building or significantly evolving an API platform used by multiple teams or customer-facing products
-
Expert-level proficiency in one or more of Golang, TypeScript, C++, or Java
-
Deep expertise in API performance and scalability, including caching strategies, rate limiting, parallelization, fan-out control, and graceful degradation
-
Strong experience designing and implementing production-grade authentication and authorization systems for customer-facing APIs
-
Demonstrated ability to drive cross-team architectural initiatives without formal authority, aligning multiple stakeholders around long-term platform direction
-
Experience building and operating systems using Infrastructure as Code (Terraform, AWS CDK, Pulumi) and modern CI/CD workflows
-
Bachelor’s or Master’s degree in Computer Science, Computer Engineering, or equivalent practical experience
Nice to Have
-
Experience with GraphQL or schema-based API federation systems
-
Experience evolving APIs from monoliths to modular, platform-oriented architectures
-
Experience designing and operating API schema generation and validation systems
-
Experience building developer-facing SDKs or command-line tools
-
Experience designing and operating multi-region, globally distributed API systems
-
Experience designing horizontally scalable API systems capable of handling high request volume and burst traffic patterns
-
Experience running production workloads in Kubernetes-based environments
-
Experience building services in zero-trust or identity-aware architectures
-
Experience with AWS networking, traffic management, and load balancing
-
Experience with Cloudflare or CDN-level API performance optimization
About Together AI
Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as Flash Attention, Hyena, Flex Gen, and Red Pajama. We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure.
Compensation
We offer competitive compensation, startup equity, health insurance and other competitive benefits. The US base salary range for this full-time position is: $240,000 - $275,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge.
Equal Opportunity
Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.
Please see our privacy policy at https://www.together.ai/privacy
Total Views
0
Apply Clicks
0
Mock Applicants
0
Scraps
0
Similar Jobs
About Together AI

Together AI
Series BData annotation company.
51-200
Employees
San Francisco
Headquarters
$1.25B
Valuation
Reviews
4.0
19 reviews
Work Life Balance
3.5
Compensation
4.5
Culture
4.0
Career
4.3
Management
3.5
79%
Recommend to a Friend
Pros
Cutting-edge technology stack and interesting technical challenges
Competitive compensation packages with equity
Strong engineering culture with focus on code quality
Cons
Organizational changes and restructuring can be disruptive
Internal politics in some teams
Work-life balance can be challenging during product launches
Salary Ranges
0 data points
Mid/L4
Mid/L4 · Product Designer
0 reports
$156,800
total / year
Base
$156,800
Stock
-
Bonus
-
$133,280
$180,320
Interview Experience
3 interviews
Difficulty
3.3
/ 5
Duration
14-28 weeks
Interview Process
1
Application Review
2
Recruiter Screen
3
Technical Phone Screen
4
Coding Round
5
System Design Round
6
Onsite/Virtual Interviews
Common Questions
Coding/Algorithm
System Design
Technical Knowledge
Infrastructure/SRE
MLOps/Machine Learning
News & Buzz
Together AI – Weekly Recap - TipRanks
Source: TipRanks
News
·
5w ago
Tech firms move toward significant office leases in S.F.’s Showplace Square - San Francisco Chronicle
Source: San Francisco Chronicle
News
·
5w ago
Together AI Emphasizes Practical Framework for Selecting Open-Source Models in Production - TipRanks
Source: TipRanks
News
·
5w ago
Baseten nabs $300M from IVP, CapitalG to challenge Together AI in inference - Tech Funding News
Source: Tech Funding News
News
·
6w ago



