Twelve Labs

Video understanding AI platform

Research Scientist, Public Sector

职能机器学习

级别中级

地点Remote US

方式远程

类型全职

发布1个月前

立即申请

WHO WE ARE:

At Twelve Labs, we are pioneering the development of cutting-edge multimodal foundation models that have the ability to comprehend videos just like humans do. Our models have redefined the standards in video-language modeling, empowering us with more intuitive and far-reaching capabilities, and fundamentally transforming the way we interact with and analyze various forms of media.

With a remarkable $107 million in Seed and Series A funding, our company is backed by top-tier venture capital firms such as NVIDIA’s NVentures, NEA, Radical Ventures, and Index Ventures, and prominent AI visionaries and founders such as Fei-Fei Li, Silvio Savarese, Alexandr Wang and more. Headquartered in San Francisco, with an influential APAC presence in Seoul, our global footprint underscores our commitment to driving worldwide innovation.

We are a global company that values the uniqueness of each person’s journey. It is the differences in our cultural, educational, and life experiences that allow us to constantly challenge the status quo. We are looking for individuals who are motivated by our mission and eager to make an impact as we push the bounds of technology to transform the world. Join us as we revolutionize video understanding and multimodal AI.

ABOUT THE ROLE:

As a Research Scientist on the Public Sector team, you play a major role in bringing Twelve Labs' video AI capabilities - including our multimodal foundation models - to mission-critical government applications. This role focuses on applying our video intelligence technology to classified and government-specific use cases, including model training, finetuning and evaluation grounded in operational requirements.

You will be the dedicated research scientist for the Public Sector team, bridging Twelve Labs' cutting-edge multimodal AI research and the unique requirements of U.S. federal, defense, and intelligence community customers. This is an opportunity to have direct operational impact and define major components of this applied science practice from the ground up.

IN THIS ROLE, YOU WILL:

Adapt Twelve Labs' video understanding and multimodal models for government-specific use cases (defense, intelligence analysis, federal records management)
Run training and fine-tuning experiments on cutting edge GPU infrastructure
Develop supervised fine-tuning pipelines tailored to government-specific datasets and annotation workflows
Design rigorous evaluation frameworks, including domain-specific benchmarks and operational performance metrics, that are tailored to public sector requirements
Work closely with Solutions Engineering and the engineering team to translate customer requirements into technical implementations

YOU MAY BE A GOOD FIT IF YOU HAVE:

Strong research experience in one or more of: deep learning, computer vision, multimodal representation learning, temporal video understanding, or neural networks.
Hands-on experience with model fine-tuning, supervised fine-tuning (SFT), or domain adaptation at scale
Experience leading or contributing to research projects for government, DoD, or Intelligence Community programs - including applied research, model development, or technical delivery in mission-driven environments
Proficiency in Python and Py Torch
Comfort working within constrained or regulated compute environments
Active Top Secret clearance or ability to obtain
PhD or Master's in Computer Science, Mathematics, or related field

STRONG CANDIDATES MAY ALSO HAVE:

Active TS/SCI clearance
Experience leading or contributing to research projects for government, DoD, or Intelligence Community programs - including applied research, model development, or technical delivery in mission-driven environments
Experience with government deployment requirements (FedRAMP, FIPS, air-gapped networks)
Background in video understanding or video-language models
Publications in top conferences (CVPR, NeurIPS, etc.)

Candidates must be able to travel up to 10% of the time annually to attend conferences, off-site meetings, and other business-related events as required by the role. This role may require participation in on-site interviews and/or completion of in-person onboarding processes.

BENEFITS AND PERKS:

🤝 An open and inclusive culture and work environment.

🚀 Work closely with a collaborative, mission-driven team on cutting-edge AI technology.

🏥 Full health, dental, and vision benefits

✈️ Extremely flexible PTO and parental leave policy. Office closed the week of Christmas and New Years.

🛂 VISA support where applicable

浏览量

申请点击

Mock Apply

相似职位

Applied AI Scientist - Multimodal Intelligence

Apple · Seattle, WA

Computer Vision Architect

Infosys · Dallas, TX

Machine Learning Engineer

Apple · Seattle, WA

SOLUTION ARCHITECT - MACHINE LEARNING & GEN AI

Wipro · Bengaluru, India

Software Engineer, BigQuery AI/ML

Google

关于Twelve Labs

Twelve Labs

Series A

Intel Capital Corporation started off as the investment arm of Intel Corporation in 1991 and in January 2025, it spun off as a standalone investment fund.

51-200

员工数

San Francisco

总部位置

评价

10条评价

3.8

10条评价

工作生活平衡

4.2

薪酬

2.8

企业文化

4.0

职业发展

3.2

管理层

3.5

65%

推荐率

优点

Great work-life balance

Supportive team and environment

Good company culture and friendly coworkers

缺点

Compensation/pay not competitive

Limited career advancement opportunities

Poor management and lack of direction

薪资范围

5个数据点

Senior/L5

Intern

Senior/L5 · MACHINE LEARNING ENGINEER

1份报告

$318,500

年薪总额

基本工资

$245,000

股票

奖金

$318,500

最新动态

Local model for video annotation on Mac Mini

I am looking for a local model to annotate terabytes of video on a Mac Mini. It should ideally provide timestamped descriptions of the scene, similar to Twelve Labs or Descript. The annotation can be done slowly. I will leave the Mac running 24/7 for weeks/months to get this done. Any thoughts on models to use?

1w ago

Top 5 Products from Yesterday on Product Hunt

Hey everyone 👋 Check out these 5 awesome products that stood out yesterday on Product Hunt! A nice mix of AI, productivity, and innovation: **1. Dune** Context-aware Mac keypad that automates workflows + meetings. Awesome for streamlining tasks. **2. Claude Desktop Buddy** Bringing Claude into the physical world with maker hardware. A unique blend of AI and hardware for personal productivity. **3. The New Waydev** Measure the full AI SDLC. From token to production – perfect for developer

2w ago

Full-Feature ElevenLabs MCP - TwelveLabs!

**NOTE: This is NOT related to the TwelveLabs video platform**. This is purely for ElevenLabs Conversational AI. -- I thought I was clever coming up with the name until I saw that 🫠 The official ElevenLabs MCP connector is great for TTS/voice cloning, but it doesn't cover the Conversational AI API at all — no agent config access, no conversation transcripts, no knowledge base management, etc. **Official ElevenLabs MCP:** * Create agent ✅ * Get agent details ✅ * List agents ✅ * Get conve

4w ago

Machine Learning Engineer, 6+ Years Experience at Twelve Labs San Francisco, CA

4w ago