
Video understanding AI platform
Senior Backend Software Engineer, Rodeo
필수 스킬
AWS
GCP
WHO WE ARE:
At Twelve Labs, we are pioneering the development of cutting-edge multimodal foundation models that have the ability to comprehend videos just like humans do. Our models have redefined the standards in video-language modeling, empowering us with more intuitive and far-reaching capabilities, and fundamentally transforming the way we interact with and analyze various forms of media.
With a remarkable $107 million in Seed and Series A funding, our company is backed by top-tier venture capital firms such as NVIDIA’s NVentures, NEA, Radical Ventures, and Index Ventures, and prominent AI visionaries and founders such as Fei-Fei Li, Silvio Savarese, Alexandr Wang and more. Headquartered in San Francisco, with an influential APAC presence in Seoul, our global footprint underscores our commitment to driving worldwide innovation.
We are a global company that values the uniqueness of each person’s journey. It is the differences in our cultural, educational, and life experiences that allow us to constantly challenge the status quo. We are looking for individuals who are motivated by our mission and eager to make an impact as we push the bounds of technology to transform the world. Join us as we revolutionize video understanding and multimodal AI.
ABOUT THE ROLE:
As a Senior Backend Software Engineer at Twelve Labs, you’ll build the server-side infrastructure powering our new agentic application layer. You'll join a small, high-impact team and own the critical transition from prototype to production-ready platform.
Up to 25% travel to Los Angeles is expected.
Candidates must be able to travel up to 10% of the time annually to attend conferences, off-site meetings, and other business-related events as required by the role. This role may require participation in on-site interviews and/or completion of in-person onboarding processes.
IN THIS ROLE, YOU WILL
BACKEND
-
Design and build backend services for video processing workflows — ingestion, transcoding, 4K export, metadata extraction, and timeline operations
-
Architect scalable, high-availability systems to support enterprise-grade video workloads across cloud-native infrastructure (AWS, GCP)
-
Build and optimize APIs that power real-time and async frontend workflows, including streaming data delivery and long-running job orchestration
-
Own performance and reliability for distributed video processing pipelines with low latency and high throughput requirements
-
Collaborate closely with frontend engineers on API design, data models, and streaming strategies
ML INTEGRATION
-
Integrate and run inference on computer vision models for tasks like video resizing, scene detection, automatic audio noise cleaning, and visual analysis
-
Deploy and serve ML models on cloud-based or cloud-native platforms — evaluate build-vs-buy for model serving and SaaS alternatives
-
Work with the research team to productionize model outputs into reliable, scalable backend services
-
Build pipelines that bridge Twelve Labs’ foundation models with third-party CV models to power intelligent video workflows
YOU MAY BE A GOOD FIT IF YOU HAVE:
-
7+ years building production backend systems with a track record of designing scalable web services and APIs
-
Experience with video-specific tools and frameworks (FFmpeg, AWS Media Services, transcoding pipelines)
-
Deep experience with service-oriented architecture, microservices, and distributed systems
-
Strong proficiency in Python for backend services, model integration, and tooling
-
Hands-on experience running inference on ML/CV models in production — not research, but engineering models into reliable services
-
Cloud-native development experience (AWS or GCP), including containerization (Docker, Kubernetes) and serverless patterns
-
Comfort working across the stack and making pragmatic tradeoffs in a fast-moving product environment
PREFERRED QUALIFICATIONS:
-
Advanced API design skills (RESTful, streaming, async patterns)
-
Familiarity with model serving platforms (Torch Serve, Triton, Sage Maker endpoints, or similar)
-
Experience with MLOps practices — model deployment, monitoring, versioning
-
Background in media, entertainment, or video streaming platforms
-
Exposure to CI/CD pipelines and observability tools (Prometheus, Grafana) for production systems
-
Experience with AI-powered product features or agentic application architectures
BENEFITS AND PERKS:
🤝 An open and inclusive culture and work environment.
🚀 Work closely with a collaborative, mission-driven team on cutting-edge AI technology.
🏥 Full health, dental, and vision benefits
✈️ Extremely flexible PTO and parental leave policy. Office closed the week of Christmas and New Years.
🛂 VISA support where applicable
전체 조회수
0
전체 지원 클릭
0
전체 Mock Apply
0
전체 스크랩
0
비슷한 채용공고

Senior Backend Engineer, LangSmith Deployments
LangChain · San Francisco, CA

Staff+ Software Engineer, Backend
Anthropic · San Francisco, CA

Senior Backend Engineer, Inference Platform
Together AI · San Francisco

Senior Software Engineer, Backend - HR Product
Rippling · San Francisco, CA

Sr. Backend Engineer, tvScientific
Pinterest · San Francisco, CA, US; Remote, US
Twelve Labs 소개

Twelve Labs
Series AIntel Capital Corporation started off as the investment arm of Intel Corporation in 1991 and in January 2025, it spun off as a standalone investment fund.
51-200
직원 수
San Francisco
본사 위치
리뷰
10개 리뷰
3.8
10개 리뷰
워라밸
4.2
보상
2.8
문화
4.0
커리어
3.2
경영진
3.5
65%
지인 추천률
장점
Great work-life balance
Supportive team and environment
Good company culture and friendly coworkers
단점
Compensation/pay not competitive
Limited career advancement opportunities
Poor management and lack of direction
연봉 정보
5개 데이터
Senior/L5
Intern
Senior/L5 · MACHINE LEARNING ENGINEER
1개 리포트
$318,500
총 연봉
기본급
$245,000
주식
-
보너스
-
$318,500
$318,500
최근 소식
Local model for video annotation on Mac Mini
I am looking for a local model to annotate terabytes of video on a Mac Mini. It should ideally provide timestamped descriptions of the scene, similar to Twelve Labs or Descript. The annotation can be done slowly. I will leave the Mac running 24/7 for weeks/months to get this done. Any thoughts on models to use?
·
1w ago
·
1
·
1
Top 5 Products from Yesterday on Product Hunt
Hey everyone 👋 Check out these 5 awesome products that stood out yesterday on Product Hunt! A nice mix of AI, productivity, and innovation: **1. Dune** Context-aware Mac keypad that automates workflows + meetings. Awesome for streamlining tasks. **2. Claude Desktop Buddy** Bringing Claude into the physical world with maker hardware. A unique blend of AI and hardware for personal productivity. **3. The New Waydev** Measure the full AI SDLC. From token to production – perfect for developer
·
2w ago
·
13
·
5
Full-Feature ElevenLabs MCP - TwelveLabs!
**NOTE: This is NOT related to the TwelveLabs video platform**. This is purely for ElevenLabs Conversational AI. -- I thought I was clever coming up with the name until I saw that 🫠 The official ElevenLabs MCP connector is great for TTS/voice cloning, but it doesn't cover the Conversational AI API at all — no agent config access, no conversation transcripts, no knowledge base management, etc. **Official ElevenLabs MCP:** * Create agent ✅ * Get agent details ✅ * List agents ✅ * Get conve
·
4w ago
·
5
Machine Learning Engineer, 6+ Years Experience at Twelve Labs San Francisco, CA
·
4w ago
·
1