招聘

Principal/Senior Applied Scientist Security Models Training Team - Next-Gen frontier research

Microsoft

Israel, Tel Aviv, Herzliya; Israel, Multiple Locations, Multiple Locations

On-site

Full-time

1mo ago

Required Skills

Machine Learning

Large Language Models

Natural Language Processing

Deep Learning

Python

Reinforcement Learning

Model Architecture Design

Overview

The Security Models Training team is expanding to drive the development of a new type of GenAI architecture that can effectively address the unique challenges of cybersecurity. This is a unique opportunity to engage in frontier research that is product-focused at the same time.

The Security Models Training team builds and operates the large‑scale AI training and adaptation engines that power Microsoft Security products, turning cutting‑edge research into reliable, production‑ready capabilities.

As a Principal/Senior Applied Scientist, you will own end‑to‑end model development for security scenarios, including developing new model architectures, continual pre‑training, task‑focused fine‑tuning, reinforcement learning, and objective, benchmark‑driven evaluation.

You will drive training efficiency and reliability on distributed GPU systems, deepen model reasoning and tool‑use capabilities, and embed Responsible AI, privacy, and compliance into every stage of the workflow. The role is hands‑on and impact‑focused, partnering closely with engineering and product to translate innovations into shipped, measurable outcomes, defining quality gates and readiness criteria, and mentoring scientists and engineers to scale results across globally distributed teams.

You will combine strong coding, experimentation, and debugging skills with a systems mindset to accelerate iteration cycles, improve throughput and cost‑effectiveness, and help shape the next generation of secure, trustworthy AI for our customers.

Responsibilities:

You’ll work as part of an Applied Science team on high-impact, technically ambitious AI projects that directly shape the future of AI in Cyber security, with ownership for taking advanced research through to production impact.

Technical Leadership & Ownership: set technical direction for major security domain initiatives; lead security model programs spanning pre‑training, task tuning, reinforcement learning, and evaluation; translate cutting‑edge research into production‑ready capabilities.
Advanced Model Design– Building and customizing deep learning model architectures (e.g., modifying transformer blocks, attention/memory modules, etc.) at the SLM/LLM scale; making principled architectural tradeoffs to improve reliability, robustness, and security‑specific behavior.
Advanced Model Training – Apply deep expertise in pre-training, post-training, and reinforcement learning (RL) for both language and other modalities, including time-series.
Design & Evaluate Datasets – Build high-quality datasets and benchmarks; define objective evaluation frameworks and quality gates; run ablation studies to measure impact and optimize data and training effectiveness to support confident product decisions.
Develop Data Infrastructure – Create and maintain scalable pipelines for ingestion, preprocessing, filtering, and annotation of large, complex datasets, with attention to privacy, governance, and long‑term reuse across security scenarios.
Research & Innovation – Collaborate with cross-functional teams to push research and product boundaries, delivering models that make a real-world impact.
Qualifications- M.Sc. / Ph.D. in Computer Science, Information Systems, Electrical or Computer Engineering or Data Science (Ph.D. strongly preferred). Candidates with M.Sc. / Ph.D. in related fields with proven industry experience or a strong publication record in the areas of LLM, Information Retrieval, Machine Learning, Natural Language Processing, Time Series Forecasting and Deep Learning are considered as well.
Proven hands-on experience of at least 5 years (including post-grad work) in building and deploying Machine Learning products. Key areas of expertise include Natural Language Processing and Large Language Models, along with an understanding of concepts such as Privacy and Responsible AI. Candidates are expected to demonstrate a strong history of successfully translating applied research into production-ready solutions, along with a proven track record of delivering projects within large-scale production environments.
Proven expertise in the LLM and/or time-series forecasting domain, demonstrating comprehensive knowledge of relevant concepts in the domain. Ideal applicants should be proficient in areas such as LLM’s pre and post training, including CPT, SFT and RL, LLM benchmarking, agentic flows, and model alignment.
Hands-on experience in building neural model architectures at the 100M+ scale and the proficiency to adapt them at all abstraction levels down the individual block (e.g. changing the innerworkings of an attention block, introducing new blocks, or changing the routings)
Demonstrated proficiency in problem-solving and data analysis, with substantial expertise in evaluating the performance of large language models (LLMs) and/or time-series forecasting models, developing benchmarks tailored to practical scenarios.

Preferred Qualifications -:

PhD degree in Computer Science, Information Systems, or Data Science.
Evidence of research contributions through publications or records of top-tier journal and conference publications or submitted/accepted papers in top venues (KDD, ICML, AAAI, ACL, ICLR, etc.)
Proven track record in pre/post-training of large transformer models for language and/or time series tasks.
Customer obsession and passion about making real world product impact through production deployed systems.
Excellent verbal and written communication skills, with the ability to simplify and explain complex ideas.
Effective collaboration skills while working within a globally distributed organization.

#MSFTSecurity #MSECAI
#MSECAIILDC

This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.

Total Views

Apply Clicks

Mock Applicants

Scraps

Similar Jobs

Network Cluster Architect - Data center Infrastructure

Apple · Austin, TX

Senior Data Scientist Multiple Locations

Intuit · san diego

Senior Data Science Analyst_Pune

Citigroup · pune

Data Scientist II, ADSP Product Operations & Analytics

Amazon · New York, NY, USA

Applied Scientist, Amazon Ads, Demand Forecasting & Guidance

Amazon · Palo Alto, CA, USA

About Microsoft

Microsoft

A software corporation that develops, manufactures, licenses, supports, and sells a range of software products and services.

10,001+

Employees

Redmond

Headquarters

$3000B

Valuation

Reviews

3.8

5 reviews

Work Life Balance

4.1

Compensation

4.3

Culture

3.4

Career

3.2

Management

3.0

65%

Recommend to a Friend

Pros

Excellent compensation and benefits package

Four-day workweek with improved work-life balance

Supportive managers and teams

Cons

High-pressure environment causing anxiety

Unprofessional interview processes

Limited creative work opportunities

Salary Ranges

5,571 data points

Mid/L4

Principal/L7

Senior/L5

Staff/L6

Director

Mid/L4 · Data and Applied Scientist

0 reports

$202,099

total / year

Base

$149,342

Stock

$32,252

Bonus

$20,505

$139,572

$301,212

Interview Experience

7 interviews

Difficulty

3.7

/ 5

Duration

14-28 weeks

Offer Rate

14%

Experience

Positive 14%

Neutral 29%

Negative 57%

Interview Process

Application Review

Recruiter Screen

Technical Phone Screen

Technical Interview

Onsite/Virtual Interviews

Final Round

Offer

Common Questions

Coding/Algorithm

System Design

Behavioral/STAR

Technical Knowledge

Past Experience

News & Buzz

Microsoft loses $400 billion in few hours, what's behind one of the worst stock market days for the compa - Times of India

Source: Times of India

News

5w ago

Microsoft Stock Tumbles 12.1% In Worst Day For Company In Years - HuffPost

Source: HuffPost

News

5w ago

Microsoft: The 'question' the company needs to answer - Yahoo Finance

Source: Yahoo Finance

News

5w ago

AI is a planet-sized bubble — and Microsoft's slump is a taste of the crash to come, tech guru Erik Gordon says - Business Insider

Source: Business Insider

News

5w ago