トレンド企業

Twelve Labs
Twelve Labs

Video understanding AI platform

Research Scientist or Engineer, Embedding & Search

職種データサイエンス
経験ミドル級
勤務地Seoul, South Korea
勤務オンサイト
雇用正社員
掲載2ヶ月前
応募する

必須スキル

AWS

WHO WE ARE:

At Twelve Labs, we are pioneering the development of cutting-edge multimodal foundation models that have the ability to comprehend videos just like humans do. Our models have redefined the standards in video-language modeling, empowering us with more intuitive and far-reaching capabilities, and fundamentally transforming the way we interact with and analyze various forms of media.

With a $110+ million in Seed and Series A funding, our company is backed by top-tier venture capital firms such as NVIDIA’s NVentures, NEA, Radical Ventures, and Index Ventures, and prominent AI visionaries and founders such as Fei-Fei Li, Silvio Savarese, Alexandr Wang and more. Headquartered in San Francisco, with an influential APAC presence in Seoul, our global footprint underscores our commitment to driving worldwide innovation.

Our partnership with NVIDIA and AWS gives us access to the most advanced chips, including B300s, enabling us to push the boundaries of what's possible in video AI.

We are a global company that values the uniqueness of each person’s journey. It is the differences in our cultural, educational, and life experiences that allow us to constantly challenge the status quo. We are looking for individuals who are motivated by our mission and eager to make an impact as we push the bounds of technology to transform the world. Join us as we revolutionize video understanding and multimodal AI.

ABOUT THE TEAM:

The Embedding & Search team sits at the core of Twelve Labs' video understanding capabilities. We design unified embedding spaces that span video, audio, text, and other modalities, and build retrieval systems that accurately surface results matching user intent across massive video collections.

Our research spans a broad range of challenges: multimodal representation learning through contrastive and probabilistic approaches, temporal video understanding including hierarchical segmentation and boundary detection, neural ranking architectures for multi-stage retrieval, and user behavior modeling to understand how people actually search for and interact with video content. We care deeply about both the algorithmic innovations that push state-of-the-art and the human-centered insights that make our systems genuinely useful.

Our research team has access to the most advanced chips in the world, including NVIDIA B300s, accelerating our research-to-production cycle.

ABOUT THE ROLE:

This position spans two tracks—Research Scientist and Research Engineer—determined by your strengths and interests. These roles exist on a spectrum rather than as discrete categories; both contribute to research and implementation.

As a Research Scientist, you will define research problems, formulate hypotheses, and design experiments that push the boundaries of what's possible in video search. You'll analyze user behavior patterns to uncover insights about how people interact with video content, and translate those insights into system improvements. You'll explore optimal architectures through rigorous ablation studies, develop evaluation methodologies that capture what matters, and communicate findings clearly to shape our technical direction.

As a Research Engineer, you will translate research ideas into stable, reproducible systems. You'll design and optimize training pipelines for large-scale distributed environments, build experiment infrastructure that accelerates research cycles, and ensure our models perform reliably in production. Your work bridges the gap between promising research results and systems that serve thousands of customers worldwide.

YOU MIGHT BE A GREAT FIT IF YOU HAVE:

We're looking for candidates with research experience in areas that align with our mission: video-text retrieval and contrastive learning, temporal video understanding and segmentation, learning-to-rank and neural reranking, or user modeling and behavioral analysis. Your experience should be demonstrated through past projects, concrete contributions, and research outputs.

You should be capable of independently driving research from ideation to execution. Strong proficiency in Python and Py Torch is essential, as is the ability to communicate effectively with colleagues from diverse backgrounds. Experience with large-scale model training, distributed systems, or deploying ML systems in production is a significant plus.

We evaluate based on relevant technical skills and research experience rather than degrees alone, though this is typically supported by an MS/PhD or equivalent practical experience in a relevant field.

WHAT MAKES THIS ROLE UNIQUE:

The gap between research and production is remarkably short here. Models and systems you build will be used by thousands of companies worldwide within months. We work as a unified team toward the broader goal of video search, rather than solving isolated problems. Our research philosophy balances rigorous experimentation with real-world application—we aim to build multimodal systems that are powerful, trustworthy, and genuinely useful.

OTHERS

  • Work Location: Seoul Itaewon office + Pangyo satellite office

  • Additional Info: 전문연구요원 편입/전직 가능합니다.

Even if you don't check every box, we encourage you to apply. If you're a zero-to-one achiever, a ferocious learner, and a kind team player who motivates others, you'll find a home at Twelve Labs.

HIRING PROCESS

Application Review → Recruiter Interview (비대면/30분) → Hiring Manager Interview (비대면/30분) → Technical Interview Round 1 (대면/60분) → Technical Interview Round 2 (비대면/90분) → Final Round Interview (비대면/30분) → Reference Check → Offer

閲覧数

0

応募クリック

0

Mock Apply

0

スクラップ

0

Twelve Labsについて

Twelve Labs

Twelve Labs

Series A

Intel Capital Corporation started off as the investment arm of Intel Corporation in 1991 and in January 2025, it spun off as a standalone investment fund.

51-200

従業員数

San Francisco

本社所在地

レビュー

10件のレビュー

3.8

10件のレビュー

ワークライフバランス

4.2

報酬

2.8

企業文化

4.0

キャリア

3.2

経営陣

3.5

65%

知人への推奨率

良い点

Great work-life balance

Supportive team and environment

Good company culture and friendly coworkers

改善点

Compensation/pay not competitive

Limited career advancement opportunities

Poor management and lack of direction

給与レンジ

5件のデータ

Senior/L5

Intern

Senior/L5 · MACHINE LEARNING ENGINEER

1件のレポート

$318,500

年収総額

基本給

$245,000

ストック

-

ボーナス

-

$318,500

$318,500

最新情報

Local model for video annotation on Mac Mini

I am looking for a local model to annotate terabytes of video on a Mac Mini. It should ideally provide timestamped descriptions of the scene, similar to Twelve Labs or Descript. The annotation can be done slowly. I will leave the Mac running 24/7 for weeks/months to get this done. Any thoughts on models to use?

Reddit

·

1w ago

·

1

·

1

Top 5 Products from Yesterday on Product Hunt

Hey everyone 👋 Check out these 5 awesome products that stood out yesterday on Product Hunt! A nice mix of AI, productivity, and innovation: **1. Dune** Context-aware Mac keypad that automates workflows + meetings. Awesome for streamlining tasks. **2. Claude Desktop Buddy** Bringing Claude into the physical world with maker hardware. A unique blend of AI and hardware for personal productivity. **3. The New Waydev** Measure the full AI SDLC. From token to production – perfect for developer

Reddit

·

2w ago

·

13

·

5

Full-Feature ElevenLabs MCP - TwelveLabs!

**NOTE: This is NOT related to the TwelveLabs video platform**. This is purely for ElevenLabs Conversational AI. -- I thought I was clever coming up with the name until I saw that 🫠 The official ElevenLabs MCP connector is great for TTS/voice cloning, but it doesn't cover the Conversational AI API at all — no agent config access, no conversation transcripts, no knowledge base management, etc. **Official ElevenLabs MCP:** * Create agent ✅ * Get agent details ✅ * List agents ✅ * Get conve

Reddit

·

4w ago

·

5

Machine Learning Engineer, 6+ Years Experience at Twelve Labs San Francisco, CA

Reddit

·

4w ago

·

1