招聘
Principal/Senior Applied Scientist Security Models Training Team - Next-Gen frontier research

Principal/Senior Applied Scientist Security Models Training Team - Next-Gen frontier research
Israel, Tel Aviv, Herzliya; Israel, Multiple Locations, Multiple Locations
·
On-site
·
Full-time
·
1mo ago
Required Skills
Machine Learning
Large Language Models
Natural Language Processing
Deep Learning
Python
Reinforcement Learning
Model Architecture Design
Overview
The Security Models Training team is expanding to drive the development of a new type of GenAI architecture that can effectively address the unique challenges of cybersecurity. This is a unique opportunity to engage in frontier research that is product-focused at the same time.
The Security Models Training team builds and operates the large‑scale AI training and adaptation engines that power Microsoft Security products, turning cutting‑edge research into reliable, production‑ready capabilities.
As a Principal/Senior Applied Scientist, you will own end‑to‑end model development for security scenarios, including developing new model architectures, continual pre‑training, task‑focused fine‑tuning, reinforcement learning, and objective, benchmark‑driven evaluation.
You will drive training efficiency and reliability on distributed GPU systems, deepen model reasoning and tool‑use capabilities, and embed Responsible AI, privacy, and compliance into every stage of the workflow. The role is hands‑on and impact‑focused, partnering closely with engineering and product to translate innovations into shipped, measurable outcomes, defining quality gates and readiness criteria, and mentoring scientists and engineers to scale results across globally distributed teams.
You will combine strong coding, experimentation, and debugging skills with a systems mindset to accelerate iteration cycles, improve throughput and cost‑effectiveness, and help shape the next generation of secure, trustworthy AI for our customers.
Responsibilities:
You’ll work as part of an Applied Science team on high-impact, technically ambitious AI projects that directly shape the future of AI in Cyber security, with ownership for taking advanced research through to production impact.
-
Technical Leadership & Ownership: set technical direction for major security domain initiatives; lead security model programs spanning pre‑training, task tuning, reinforcement learning, and evaluation; translate cutting‑edge research into production‑ready capabilities.
-
Advanced Model Design– Building and customizing deep learning model architectures (e.g., modifying transformer blocks, attention/memory modules, etc.) at the SLM/LLM scale; making principled architectural tradeoffs to improve reliability, robustness, and security‑specific behavior.
-
Advanced Model Training – Apply deep expertise in pre-training, post-training, and reinforcement learning (RL) for both language and other modalities, including time-series.
-
Design & Evaluate Datasets – Build high-quality datasets and benchmarks; define objective evaluation frameworks and quality gates; run ablation studies to measure impact and optimize data and training effectiveness to support confident product decisions.
-
Develop Data Infrastructure – Create and maintain scalable pipelines for ingestion, preprocessing, filtering, and annotation of large, complex datasets, with attention to privacy, governance, and long‑term reuse across security scenarios.
-
Research & Innovation – Collaborate with cross-functional teams to push research and product boundaries, delivering models that make a real-world impact.
-
Qualifications- M.Sc. / Ph.D. in Computer Science, Information Systems, Electrical or Computer Engineering or Data Science (Ph.D. strongly preferred). Candidates with M.Sc. / Ph.D. in related fields with proven industry experience or a strong publication record in the areas of LLM, Information Retrieval, Machine Learning, Natural Language Processing, Time Series Forecasting and Deep Learning are considered as well.
-
Proven hands-on experience of at least 5 years (including post-grad work) in building and deploying Machine Learning products. Key areas of expertise include Natural Language Processing and Large Language Models, along with an understanding of concepts such as Privacy and Responsible AI. Candidates are expected to demonstrate a strong history of successfully translating applied research into production-ready solutions, along with a proven track record of delivering projects within large-scale production environments.
-
Proven expertise in the LLM and/or time-series forecasting domain, demonstrating comprehensive knowledge of relevant concepts in the domain. Ideal applicants should be proficient in areas such as LLM’s pre and post training, including CPT, SFT and RL, LLM benchmarking, agentic flows, and model alignment.
-
Hands-on experience in building neural model architectures at the 100M+ scale and the proficiency to adapt them at all abstraction levels down the individual block (e.g. changing the innerworkings of an attention block, introducing new blocks, or changing the routings)
-
Demonstrated proficiency in problem-solving and data analysis, with substantial expertise in evaluating the performance of large language models (LLMs) and/or time-series forecasting models, developing benchmarks tailored to practical scenarios.
Preferred Qualifications -:
- PhD degree in Computer Science, Information Systems, or Data Science.
- Evidence of research contributions through publications or records of top-tier journal and conference publications or submitted/accepted papers in top venues (KDD, ICML, AAAI, ACL, ICLR, etc.)
- Proven track record in pre/post-training of large transformer models for language and/or time series tasks.
- Customer obsession and passion about making real world product impact through production deployed systems.
- Excellent verbal and written communication skills, with the ability to simplify and explain complex ideas.
- Effective collaboration skills while working within a globally distributed organization.
#MSFTSecurity #MSECAI
#MSECAIILDC
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Total Views
0
Apply Clicks
0
Mock Applicants
0
Scraps
0
Similar Jobs

Network Cluster Architect - Data center Infrastructure
Apple · Austin, TX

Senior Data Scientist Multiple Locations
Intuit · san diego

Senior Data Science Analyst_Pune
Citigroup · pune

Data Scientist II, ADSP Product Operations & Analytics
Amazon · New York, NY, USA

Applied Scientist, Amazon Ads, Demand Forecasting & Guidance
Amazon · Palo Alto, CA, USA
About Microsoft
Reviews
3.8
5 reviews
Work Life Balance
4.1
Compensation
4.3
Culture
3.4
Career
3.2
Management
3.0
65%
Recommend to a Friend
Pros
Excellent compensation and benefits package
Four-day workweek with improved work-life balance
Supportive managers and teams
Cons
High-pressure environment causing anxiety
Unprofessional interview processes
Limited creative work opportunities
Salary Ranges
5,571 data points
Mid/L4
Principal/L7
Senior/L5
Staff/L6
Director
Mid/L4 · Data and Applied Scientist
0 reports
$202,099
total / year
Base
$149,342
Stock
$32,252
Bonus
$20,505
$139,572
$301,212
Interview Experience
7 interviews
Difficulty
3.7
/ 5
Duration
14-28 weeks
Offer Rate
14%
Experience
Positive 14%
Neutral 29%
Negative 57%
Interview Process
1
Application Review
2
Recruiter Screen
3
Technical Phone Screen
4
Technical Interview
5
Onsite/Virtual Interviews
6
Final Round
7
Offer
Common Questions
Coding/Algorithm
System Design
Behavioral/STAR
Technical Knowledge
Past Experience
News & Buzz
Microsoft loses $400 billion in few hours, what's behind one of the worst stock market days for the compa - Times of India
Source: Times of India
News
·
5w ago
Microsoft Stock Tumbles 12.1% In Worst Day For Company In Years - HuffPost
Source: HuffPost
News
·
5w ago
Microsoft: The 'question' the company needs to answer - Yahoo Finance
Source: Yahoo Finance
News
·
5w ago
AI is a planet-sized bubble — and Microsoft's slump is a taste of the crash to come, tech guru Erik Gordon says - Business Insider
Source: Business Insider
News
·
5w ago