채용

Research Scientist, Generative Worlds

Google DeepMind

New York City, New York, US; San Francisco, California, US

On-site

Full-time

5d ago

Snapshot

Help us build generative models of the 3D world. World models power numerous domains, such as media generation, visual reasoning, simulation, planning for embodied agents, and real-time interactive experiences. Work with us to build better versions of Gemini, Genie, and Veo, while also exploring new, spatial modalities beyond images and videos.

The Role Key responsibilities: Conduct research to build generative multimodal models of the 3D world. Solve essential problems to train world models at massive scale: build and train large-scale systems for data annotation, curate and annotate training datasets, build and maintain large model training infrastructure, develop scaling ladders and training recipes, develop metrics for spatial intelligence, enable real-time interactive experiences, study the integration of spatial modalities with multimodal language models, and of course: actually train massive-scale models.

Areas of focus:

3D computer vision, spatial annotation systems
Spatial representations
Training large-scale transformers
Generative pixel and latent models
Infrastructure for large-scale data pipelines and annotation.
Quantitative evals for spatial accuracy and intelligence.
Model scaling, efficiency, distillation, training infrastructure

About you

We seek individuals who are passionate about large-scale generative models and believe spatial understanding and generation are on the path to intelligence. We strive for simple methods that scale and look for candidates excited to improve models through infrastructure, data, evals, and compute.

In order to set you up for success as a Research Scientist/Engineer at Google Deep Mind, we look for the following skills and experience:

MSc or PhD in computer science or machine learning, or equivalent industry experience.
Experience with large-scale transformer models and/or large-scale data pipelines.
Track record of releases, publications, and/or open source projects relating to video generation, world models, multimodal language models, or transformer architectures.
Exceptional engineering skills in Python and deep learning frameworks (e.g., Jax, Tensor Flow, Py Torch), with a track record of building high-quality research prototypes and systems.
Demonstrated experience in large-scale training of multimodal generative models.

In addition, the following would be an advantage:

Experience building training codebases for large-scale video or multimodal transformers.
Expertise optimizing efficiency of distributed training systems and/or inference systems.
Strong background in 3D representations or 3D computer vision
Strong publication record at top-tier machine learning, computer vision, and graphics conferences (e.g., NeurIPS, ICLR, ICML, SIGGRAPH, CVPR, ICCV).
A keen eye for visual aesthetics and detail, coupled with a passion for creating high-quality, visually compelling generative content.

Total Views

Apply Clicks

Mock Applicants

Scraps

Similar Jobs

Senior Data Scientist

Vox Media · New York City

Staff Product Manager, Database

Pinecone · New York City

Data Scientist

FanDuel · New York City

Data Science Manager

Asana · New York City

Staff Data Scientist, Marketing

Asana · New York City

About Google DeepMind

Google DeepMind

Acquired

DeepMind Technologies Limited, trading as Google DeepMind or simply DeepMind, is a British-American artificial intelligence research laboratory which serves as a subsidiary of Alphabet Inc.

1,001-5,000

Employees

London

Headquarters

Reviews

3.8

10 reviews

Work Life Balance

3.8

Compensation

4.2

Culture

3.5

Career

4.0

Management

2.8

68%

Recommend to a Friend

Pros

Smart and brilliant colleagues

Good compensation and benefits

Work flexibility and remote options

Cons

Poor management and leadership issues

Bureaucracy and slow processes

Constantly changing priorities and goals

Interview Experience

5 interviews

Difficulty

3.0

/ 5

Duration

21-35 weeks

Offer Rate

60%

Experience

Positive 60%

Neutral 40%

Negative 0%

Interview Process

Application Review

Phone Screen/Online Assessment

Technical Interview

Team Matching Interview

Offer

Common Questions

Coding/Algorithm

Technical Knowledge

Behavioral/STAR

Research Experience

System Design

News & Buzz

Google Deepmind pioneer David Silver departs to found AI startup, betting LLMs alone won't reach superintelligence - the-decoder.com

Source: the-decoder.com

News

5w ago

Apple loses more AI researchers, Siri exec to Google and Meta - 9to5Mac

Source: 9to5Mac

News

5w ago

Apple Loses More AI Researchers and a Siri Executive in Latest Departures - Bloomberg

Source: Bloomberg

News

5w ago

Google DeepMind seeks team lead for growing AI chip design effort - Data Center Dynamics

Source: Data Center Dynamics

News

5w ago