招聘

Principal Research Engineer, Gemini Evals

Google DeepMind

Mountain View, California, US

On-site

Full-time

5d ago

Snapshot

This role is for a Principal level Research Engineer to lead the strategic development and execution of robust data pipelines, evaluation frameworks, and metric systems for the Gemini family of models and their associated product applications. As a key technical leader and individual contributor, you will apply deep expertise in large-scale machine learning, statistical rigor, and scalable engineering to ensure the safety, performance, and ethical alignment of our frontier AI systems before and after deployment.

About us

Artificial Intelligence could be one of humanity's most useful inventions. At Google Deep Mind, we’re a team of scientists, engineers, machine learning experts, and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.

This role is part of the Gemini Evaluation research teams. The Gemini Evals team defines success for Gemini, establishes metrics to track progress, and provides clear, actionable insights to guide development. As a Research Engineer on this team, you will be at the forefront of building the data and evaluation systems that ensure the safety and quality of the Gemini family of models.

The Role

As a Principle Research Engineer, you will operate as a technical expert and leader within the Gemini Data and Evaluation team. Your primary focus will be to architect and execute the rigorous evaluation and data systems that underpin all major model release and product launch decisions for Gemini.

This is a highly cross-functional role requiring a blend of deep ML research, world-class software engineering, and strategic influence. You will define the data strategy for critical evaluation campaigns, design novel metrics to measure safety and performance at scale, and mentor a team of engineers and researchers to build high-quality, reproducible systems. You will be accountable for communicating complex evaluation results directly to leadership stakeholders to guide the responsible deployment of our most advanced AI technology.

Key responsibilities

Technical Leadership & Strategy:

Work on post-training evaluation and fine-tuning of large-scale models to improve performance and safety.
Define and champion the technical roadmap for large-scale data and evaluation supporting the Gemini model family and its real-world applications
Drive the research of novel, high-signal evaluation methods (automated, human-in-the-loop, and adversarial) to measure model capabilities, alignment, safety, and trustworthiness.
Actively contribute to the broader scientific community by presenting findings on cutting-edge AI evaluation and safety methods.

About You

In order to set you up for success as a at Google Deep Mind, we look for the following skills and experience:

10+ years of experience in researching engineering, with at least 5 years in a technical leadership role.
Experience with large-scale machine learning systems, data processing pipelines and evaluation methodologies.
Experience with large language models (LLMs) and their evaluation.
Experience in post-training evaluation research

Total Views

Apply Clicks

Mock Applicants

Scraps

Similar Jobs

Senior Applied Scientist and Principal Applied Scientist (Multiple openings)

Microsoft · United States, Washington, Redmond

Responsable des techniciens de centre de données / Data Center Technician Manager

Microsoft · Canada, Québec, Quebec City

Senior Data Center Capacity Delivery Manager

Anthropic · San Francisco, CA

Principal/Senior Applied Scientist Security Models Training Team - Next-Gen frontier research

Microsoft · Israel, Tel Aviv, Herzliya; Israel, Multiple Locations, Multiple Locations

Senior Data Scientist, Skill & Matchmaking - VALORANT Data Ops

Riot Games · Mountain View, CA

About Google DeepMind

Google DeepMind

Acquired

DeepMind Technologies Limited, trading as Google DeepMind or simply DeepMind, is a British-American artificial intelligence research laboratory which serves as a subsidiary of Alphabet Inc.

1,001-5,000

Employees

London

Headquarters

Reviews

3.8

10 reviews

Work Life Balance

3.8

Compensation

4.2

Culture

3.5

Career

4.0

Management

2.8

68%

Recommend to a Friend

Pros

Smart and brilliant colleagues

Good compensation and benefits

Work flexibility and remote options

Cons

Poor management and leadership issues

Bureaucracy and slow processes

Constantly changing priorities and goals

Interview Experience

5 interviews

Difficulty

3.0

/ 5

Duration

21-35 weeks

Offer Rate

60%

Experience

Positive 60%

Neutral 40%

Negative 0%

Interview Process

Application Review

Phone Screen/Online Assessment

Technical Interview

Team Matching Interview

Offer

Common Questions

Coding/Algorithm

Technical Knowledge

Behavioral/STAR

Research Experience

System Design

News & Buzz

Google Deepmind pioneer David Silver departs to found AI startup, betting LLMs alone won't reach superintelligence - the-decoder.com

Source: the-decoder.com

News

5w ago

Apple loses more AI researchers, Siri exec to Google and Meta - 9to5Mac

Source: 9to5Mac

News

5w ago

Apple Loses More AI Researchers and a Siri Executive in Latest Departures - Bloomberg

Source: Bloomberg

News

5w ago

Google DeepMind seeks team lead for growing AI chip design effort - Data Center Dynamics

Source: Data Center Dynamics

News

5w ago