採用
Figure is an AI robotics company developing autonomous general-purpose humanoid robots. Our goal is to build embodied AI systems that can perceive, reason, and act in the real world. Figure is headquartered in San Jose, CA, and this role requires 5 days/week in-office collaboration.
Our Helix team is responsible for developing the core AI systems that power humanoid autonomy. We are looking for a Helix AI Engineer, Video Pretraining to lead the development of large-scale video foundation models trained on diverse real-world and robot-collected data.
This role focuses on pretraining models that learn from raw video—capturing motion, interaction, and temporal structure—to enable downstream capabilities in perception, prediction, and embodied reasoning.
Responsibilities
- Design and train large-scale video foundation models on diverse datasets spanning internet-scale video and robot-collected data
- Develop pretraining strategies that capture temporal dynamics, motion, and object interaction from raw video sequences
- Build models that learn transferable representations for downstream tasks such as perception, tracking, prediction, and control
- Explore architectures for video understanding and generation, including transformer-based and diffusion-based approaches
- Implement efficient data pipelines and training strategies for high-throughput video ingestion and large-scale distributed training
- Optimize model performance across compute, memory, and training efficiency constraints
- Collaborate closely with generative modeling, agent, and robot learning teams to integrate pretrained models into the autonomy stack
- Design evaluation frameworks and benchmarks to measure temporal understanding, prediction quality, and generalization
Requirements
- Experience training large-scale models on video data or other high-dimensional sequential modalities
- Strong understanding of modern deep learning architectures for video, vision, or multimodal systems
- Experience with large-scale pretraining, including dataset curation, training dynamics, and scaling laws
- Proficiency in Python and deep learning frameworks such as Py Torch
- Experience working with distributed training systems and large GPU clusters
- Strong experimental rigor and ability to iterate quickly on model design and training strategies
- Solid software engineering skills and ability to build scalable, reliable systems
- Ability to operate independently and drive ambiguous, high-impact research directions
Bonus Qualifications
- Experience working on frontier video models or multimodal foundation models
- Background in video diffusion, autoregressive video modeling, or world models
- Experience at leading AI labs such as OpenAI, Google Deep Mind, Google, Byte Dance, Midjourney, or Adobe
- Experience with large-scale dataset construction and filtering for video pretraining
- Familiarity with robotics, embodied AI, or learning from egocentric / first-person video
- Publication record in machine learning, computer vision, or multimodal AI
The pay offered for this position may vary based on several individual factors, including job-related knowledge, skills, and experience. The total compensation package may also include additional components/benefits depending on the specific role. This information will be shared if an employment offer is extended.
総閲覧数
0
応募クリック数
0
模擬応募者数
0
スクラップ
0
類似の求人

Machine Learning Engineer, Recommendation - E-Commerce
TikTok · San Jose, CA

Machine Learning Engineering Technical Leader (hybrid) - 2009800
Cisco · San Jose, California, US

Machine Learning Engineer
Adobe · San Jose

Applied Researcher I (AI Foundations)
Capital One · 5 Locations

Machine Learning Engineer, Commerce Ads Ranking
TikTok · San Jose, CA
Figure AIについて

Figure AI
Series BFigure AI, Inc. is an American robotics company developing humanoid robots that operate via artificial intelligence. The company was founded in 2022 by Brett Adcock. As of late 2025, its estimated value was $39 billion.
201-500
従業員数
Sunnyvale
本社所在地
$2.6B
企業価値
ニュース&話題
Figure.AI new balance policy allows their 03 humanoid robot to keep its balance even if some low-body actuators are lost
​ Figure just unveiled "Vulcan," a new AI balance policy that allows the Figure 03 to lose up to 3 lower-body actuators and still stay upright. Instead of a "single point of failure" ending the shift, the robot simply limps itself to the repair bay.
·
3d ago
·
643
·
150
I quit my sales job to vibe code a SaaS tool 2 months ago. Now I landed an interview for a 6 figure AI coding job at a tech company.
2 months ago I quit my job in B2B sales to build a SaaS product full time. Zero engineering background. No CS degree. Never written a line of code before in my life. I used Claude as my entire engineering team. Chat for architecture decisions, Claude Code for actually building the thing. No Cursor, no other AI editors. Just me and Claude going back and forth until stuff worked. Built a full stack SaaS product from scratch. Electron app with Puppeteer automation on the desktop side, Next.js on
·
2w ago
·
3
·
2
Figure AI's Humanoid Walks into A Photoshoot By Itself!
·
2w ago
·
353
·
87
Figure AI Robot Makes History at White House Event - The Tech Buzz
The Tech Buzz
News
·
3w ago