採用

Sr. Applied Scientist, Ads AI Core Infrastructure

Amazon

New York, NY, USA

On-site

Full-time

3w ago

Amazon Advertising is one of Amazon's fastest growing and most profitable businesses, responsible for defining and delivering AI-powered solutions that transform how advertisers make strategic decisions. We deliver billions of ad impressions and process massive volumes of advertiser data every single day. You'll work with us to pioneer breakthrough approaches in how AI agents access and reason over real-time advertiser data at scale.

We are using generative AI and agentic systems to help advertising agents provide instant, strategic advice to millions of advertisers. You will need to invent new techniques for agent orchestration, context optimization, and code generation to ensure we're delivering accurate, trustworthy insights with minimal latency and token consumption. You'll create feedback loops to ensure our solutions are constantly evaluating themselves and improving.

The Ads Real-Time Data Service team is seeking an exceptional Applied Scientist to research and develop novel approaches for agent-data interaction. The Ads Real-Time Data Service team is solving one of the most critical challenges in advertising AI: instant access to advertiser context. We're building the infrastructure that provides immediate, pre-computed access to advertiser data via Model Context Protocol (MCP) servers—an emerging standard for AI agent-data interaction. We're building summarized data for context using a mix of state of the art techniques like Code Act and RAG-based embeddings, achieving a fundamental transformation in how AI agents interact with data.

This role balances applied research (60%) with productionization (40%), giving you the opportunity to both advance the state of the art and see your innovations deployed at Amazon scale.

Key job responsibilities

Agent Orchestration & Optimization Research
Research and develop novel algorithms for agent-data interaction patterns that minimize latency, token consumption, and error rates
Investigate multi-agent orchestration strategies for complex advertiser queries requiring data from multiple sources
Develop techniques for automatic query optimization and caching strategies based on agent behavior patterns
Large Language Model Context & Token Optimization
Invent new methods for compressing advertiser context representations while preserving semantic meaning and analytical utility
Research optimal metadata generation techniques that help large language models understand and reason over structured advertiser data
Design evaluations to measure the impact of different data representations on agent response quality and token efficiency
Develop adaptive context selection algorithms that dynamically choose relevant data based on query intent
RAG-Based Embeddings & Semantic Search
Pioneer new RAG-based embedding approaches optimized for real-time advertiser data delivery with sub-second latency
Research and implement semantic search and retrieval techniques for advertiser datasets using vector embeddings
Design advertiser context frameworks that enable automatic schema mapping from advertiser concepts to data representations
Develop evaluation frameworks to measure performance across dimensions of latency, accuracy, and developer experience
Experimentation & Productionization
Design and execute rigorous experiments comparing traditional API orchestration versus Code Act patterns and RAG-based approaches across metrics like success rate, latency, token consumption, and response quality
Analyze large-scale advertiser interaction data to identify patterns, bottlenecks, and optimization opportunities
Collaborate with engineering teams to productionize research innovations and deploy them to 30+ advertising agents and skills
Establish evaluation metrics and benchmarks for agent-data interaction performance
Cross-Functional Collaboration & Thought Leadership
Partner with agent builder teams to understand their data requirements and constraints
Work with platform engineers to implement and optimize MCP servers, data pipelines, and sandbox execution environments
Collaborate with product managers to translate research insights into product features and roadmap priorities
Stay current on latest advancements in agentic AI research, specifically in large language models, multi-agent systems, chain of thought reasoning, and autonomous agents
Research Publication & Innovation
Author technical papers for top-tier conferences on agent orchestration, context optimization, RAG-based embeddings, and real-time data integration
File patents for novel techniques in agent-data interaction, token optimization, and Code Act patterns
Present research findings at internal tech talks and external conferences
Mentor engineers and junior scientists on machine learning techniques, experimental design, and research methodologies

A day in the life
You start your morning analyzing experiment results from overnight runs comparing three evaluations for different RAG-based embedding approaches. The data shows that one of the embedding pattern is returning a significant improvement in accuracy. You create a spec file with the findings and start drafting a technical paper to be shared with Amazon AI forum.

Mid-morning, you're in a design session with the engineering team discussing how to optimize RAG-based embeddings for semantic search over advertiser data. You propose using a hybrid approach combining dense and sparse embeddings to represent campaign metadata, enabling agents to find relevant campaigns through natural language queries while maintaining sub-second latency. You sketch out the architecture and discuss trade-offs between embedding model size, search latency, and accuracy.

After lunch, you dive into advertiser interaction logs from advertising agents and skills. You're looking for patterns in how advertisers ask questions about their campaigns. You discover that 60% of queries follow a similar structure: filter campaigns by criteria, aggregate metrics, and compare to benchmarks. This insight leads you to design a new pre-computation strategy using RAG-based embeddings that could reduce query latency by 40%.

In the afternoon, you collaborate with an Applied Scientist from an advertising agent team. They're seeing inconsistent results when agents try to calculate complex metrics across multiple campaigns. You investigate and discover the issue is related to how the agent interprets the advertiser context. You propose enriching the RAG-based embeddings with richer metadata descriptions and run experiments showing this improves calculation accuracy from 85% to 98%.

Late afternoon, you're prototyping a new approach for adaptive context selection using RAG-based embeddings with the spec file you generated earlier. Instead of providing agents with all available advertiser data, you want to dynamically select the most relevant datasets based on query intent using semantic similarity. You build a quick proof-of-concept and test it on historical queries. The results are promising: 30% reduction in tokens with no loss in response quality.

About the team
The Ads Real-Time Data Service team is a diverse group of passionate engineers and scientists dedicated to advancing agent-data interaction technology for advertising AI. We value creativity, collaboration, and a commitment to excellence. Our team thrives on tackling complex problems at the intersection of real-time data engineering, AI agent systems, and large language model optimization—turning innovative research ideas into production systems that serve millions of advertisers.

We are highly motivated, collaborative and fun-loving with an entrepreneurial spirit and bias for action. We have a broad mandate to experiment and innovate, working on problems in agentic AI, context optimization, RAG-based embeddings, and real-time data delivery. We celebrate both research excellence (papers, patents) and engineering impact (production systems serving 30+ advertising agents and skills). We maintain a sustainable pace with flexible work arrangements and a strong focus on work-life balance.

Basic Qualifications

3+ years of building machine learning models for business application experience
PhD, or Master's degree and 6+ years of applied research experience
Experience programming in Java, C++, Python or related language
Experience with neural deep learning methods and machine learning

Preferred Qualifications

Experience with modeling tools such as R, scikit-learn, Spark MLLib, Mx Net, Tensorflow, numpy, scipy etc.
Experience with large scale distributed systems such as Hadoop, Spark etc.

Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.

Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees, supervisors, and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service; and follow all federal, state, and local laws and Company policies. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness and professionalism, and safeguard business operations and the Company’s reputation. Pursuant to the Los Angeles County Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.

Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.

The base salary range for this position is listed below. Your Amazon package will include sign-on payments and restricted stock units (RSUs). Final compensation will be determined based on factors including experience, qualifications, and location. Amazon also offers comprehensive benefits including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage), 401(k) matching, paid time off, and parental leave. Learn more about our benefits at https://amazon.jobs/en/benefits.

USA, CA, Palo Alto - 192,200.00 - 260,000.00 USD annually
USA, NY, New York - 183,800.00 - 248,700.00 USD annually
USA, WA, Seattle - 167,100.00 - 226,100.00 USD annually

総閲覧数

応募クリック数

模擬応募者数

スクラップ

類似の求人

Senior Research Scientist - Personalization

Spotify · New York, NY

Principal Machine Learning Engineer, Content Engineering

Paramount · New York, NY, US, 10036

Staff GenAI Engineer - Application Performance Monitoring (APM)

Datadog · New York, New York, USA

Senior Applied AI Engineer (Value Engineering) - Life Sciences

Celonis · New York, US, New York

Applied AI ML Engineer-Sr. Associate

JPMorgan Chase · New York, NY, United States, US

Amazonについて

Amazon

Public

Amazon.com, Inc. is an American multinational technology company engaged in e-commerce, cloud computing, online advertising, digital streaming, and artificial intelligence.

10,001+

従業員数

Seattle

本社所在地

$1.5T

企業価値

レビュー

2.9

10件のレビュー

ワークライフバランス

2.8

報酬

3.7

企業文化

2.5

キャリア

2.3

経営陣

2.1

35%

友人に勧める

良い点

Good pay and compensation

Strong benefits package

Flexible scheduling options

改善点

Poor management and leadership

Limited growth and promotion opportunities

High stress and demanding work environment

給与レンジ

4件のデータ

Junior/L3

Mid/L4

Principal/L7

Senior/L5

Staff/L6

Director

Junior/L3 · Data Scientist L4

0件のレポート

$181,968

年収総額

基本給

ストック

ボーナス

$154,672

$209,264

面接体験

10件の面接

難易度

3.7

/ 5

期間

21-35週間

内定率

20%

体験

ポジティブ 10%

普通 10%

ネガティブ 80%

面接プロセス

Application Review

Recruiter Screen

Online Assessment

Technical Phone Screen

Onsite/Virtual Loop

Team Matching

Offer

よくある質問

Coding/Algorithm

System Design

Behavioral/STAR

Leadership Principles

Technical Knowledge

ニュース＆話題

Amazon vs. Walmart: This Isn't Even Close - The Motley Fool

The Motley Fool

News

2d ago

'Kevin' Review: Jason Schwartzman, Aubrey Plaza in Amazon Cat Cartoon - The Hollywood Reporter

The Hollywood Reporter

News

3d ago

Amazon's best weekend deals: Apple, Clinique, Yeti and more — save up to 70% - Yahoo

Yahoo

News

3d ago

Amazon Delivery Drones Involve a Perilous 10-Foot Drop. Users Are Posting the Apparent Results - Gizmodo

Gizmodo

News

3d ago