Jobs

Research Intern - AI Frameworks (Network Systems and Tools)
United States, Washington, Redmond
·
On-site
·
Full-time
·
3mo ago
Research Internships at Microsoft provide a dynamic environment for research careers with a network of world-class research labs led by globally-recognized scientists and engineers, who pursue innovation in a range of scientific and technical disciplines to help solve complex challenges in diverse fields, including computing, healthcare, economics, and the environment.
Advances in Artificial Intelligence (AI) increasingly depend on breakthroughs in systems and architecture, where hardware, models, and software must be co-designed to scale efficiently. This Research Internship offers the opportunity to explore next-generation AI systems through performance modeling, architectural analysis, and emerging inference mechanisms. Research Interns will investigate topics such as disaggregated inference, memory-architecture, and interconnect technologies specifically focused on request scheduling and key-value (KV) caching optimizations. This role is ideal for students passionate about understanding AI systems end-to-end and shaping the architectural foundations of tomorrow’s intelligent datacenters.
Responsibilities
Research Interns put inquiry and theory into practice. Alongside fellow doctoral candidates and some of the world’s best researchers, Research Interns learn, collaborate, and network for life. Research Interns not only advance their own careers, but they also contribute to exciting research and development strides. During the 12-week internship, Research Interns are paired with mentors and expected to collaborate with other Research Interns and researchers, present findings, and contribute to the vibrant life of the community. Research internships are available in all areas of research, and are offered year-round, though they typically begin in the summer.
Additional Responsibilities
- Investigate and evaluate emerging disaggregated KV cache architectures.
- Implement a hierarchical storage architecture with multiple tiers
- GPU Memory: Active working set of KV caches currently used by the model
- CPU DRAM: Hot cache for recently used KV chunks using pinned memory for efficient GPU-CPU transfers
- Local Storage: Large-scale local caching (NVMe, local disk)
- Build Peer-to-Peer (P2P) service KV cache sharing architecture that enables direct, high-performance cache transfer between multiple LLM serving instances without requiring centralized cache servers.
Qualifications
Required Qualifications
- Currently enrolled in a PhD program in Computer Science, Electrical/Computer Engineering, or a related field.
Other Requirements
- Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship.
- In addition to the qualifications below, you’ll need to submit a minimum of two reference letters for this position as well as a cover letter and any relevant work or research samples. After you submit your application, a request for letters may be sent to your list of references on your behalf. Note that reference letters cannot be requested until after you have submitted your application, and furthermore, that they might not be automatically requested for all candidates. You may wish to alert your letter writers in advance, so they will be ready to submit your letter.
Preferred Qualifications
- Research experience in areas such as computer architecture, AI/ML systems, performance modeling, distributed systems, or hardware–software co-design.
- Programming skills in Python, C/C++ with experience building prototypes, simulators, or performance analysis tools.
- Familiarity with modern AI workloads and/or deep learning frameworks (e.g., PyTorch).
- Demonstrated ability to define and pursue original research directions in AI systems or architecture.
- Ability to collaborate effectively with researchers across disciplines and work in cross-group, cross-cultural environments.
- Proficient communication and presentation skills for sharing complex technical insights.
- Ability to think creatively and approach system and architecture challenges with unconventional or innovative solutions.
- Experience with PyTorch, CUDA, Triton, or performance-simulation tools.
- Background in large-scale system design, AI inference bottleneck analysis, or modeling cost/performance tradeoffs. Understanding of accelerator, memory-system, or interconnect design principles.
The base pay range for this internship is USD $6,710 - $13,270 per month. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $8,760 - $14,360 per month.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-intern-pay
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Total Views
0
Apply Clicks
0
Mock Applicants
0
Scraps
0
Similar Jobs

R&D AD Laboratory Associate Scientist (Fixed-term)
Thermo Fisher · Gosselies, Belgium

HPE Labs - AI Research Lab Research Associate (Intern)
Juniper Networks · 2 Locations

Equity Research - Networking Equipment Analyst/Associate
Morgan Stanley · New York, New York, United States of America

Associate Scientist, Chemistry
Bristol-Myers Squibb · San Diego - RayzeBio - CA

2026 Summer Physicist/Scientist Intern - PhD (Santa Clara, CA)
Applied Materials · Santa Clara,CA
About Microsoft
Reviews
3.8
5 reviews
Work Life Balance
4.1
Compensation
4.3
Culture
3.4
Career
3.2
Management
3.0
65%
Recommend to a Friend
Pros
Excellent compensation and benefits package
Four-day workweek with improved work-life balance
Supportive managers and teams
Cons
High-pressure environment causing anxiety
Unprofessional interview processes
Limited creative work opportunities
Salary Ranges
5,571 data points
Junior/L3
Mid/L4
Junior/L3 · Advertising Client Success
2 reports
$163,358
total / year
Base
$141,875
Stock
-
Bonus
-
$163,358
$163,358
Interview Experience
7 interviews
Difficulty
3.7
/ 5
Duration
14-28 weeks
Offer Rate
14%
Experience
Positive 14%
Neutral 29%
Negative 57%
Interview Process
1
Application Review
2
Recruiter Screen
3
Technical Phone Screen
4
Technical Interview
5
Onsite/Virtual Interviews
6
Final Round
7
Offer
Common Questions
Coding/Algorithm
System Design
Behavioral/STAR
Technical Knowledge
Past Experience
News & Buzz
Microsoft loses $400 billion in few hours, what's behind one of the worst stock market days for the compa - Times of India
Source: Times of India
News
·
5w ago
Microsoft Stock Tumbles 12.1% In Worst Day For Company In Years - HuffPost
Source: HuffPost
News
·
5w ago
Microsoft: The 'question' the company needs to answer - Yahoo Finance
Source: Yahoo Finance
News
·
5w ago
AI is a planet-sized bubble — and Microsoft's slump is a taste of the crash to come, tech guru Erik Gordon says - Business Insider
Source: Business Insider
News
·
5w ago