採用

Principal Software Engineer
United States, Washington, Redmond; United States, California, Mountain View
·
On-site
·
Full-time
·
3mo ago
報酬
$139,900 - $274,800
必須スキル
C
C++
C#
Java
JavaScript
Python
GPU inference optimization
Overview
Monetization Engineering is responsible for building a unified, intelligent, and resilient monetization platform that drives revenue across Microsoft’s AI-native surfaces, including Copilot, Search, MSN, Shopping, and both first-party and third-party ecosystems. Our mission is to enhance advertiser value, optimize platform performance, and achieve long-term revenue growth through large-scale systems, machine learning-driven optimization, experimentation, and cross-surface innovation.
The Ads Brain team serves as the technological core of Microsoft's rapidly expanding digital advertising business. The team focuses on accelerating Microsoft’s large-scale deep learning inference for Ads, Shopping, Copilot, and other surfaces, including both offline and online applications that support OpenAI LLM models and next-generation LLMs/SLMs. We play a pivotal role in bridging state-of-the-art GPU and deep learning technologies with critical business applications.
We are seeking an experienced professional with expertise in GPU inference optimization and a deep understanding of LLM/SLM architecture to join our team. This is a unique opportunity to contribute to cutting-edge advancements in AI and deep learning while driving impactful solutions for Microsoft’s advertising and monetization platforms.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Starting January 26, 2026, Microsoft AI (MAI) employees who live within a 50- mile commute of a designated Microsoft office in the U.S. or 25-mile commute of a non-U.S., country-specific location are expected to work from the office at least four days per week. This expectation is subject to local law and may vary by jurisdiction.
- Responsibilities- Engage directly with key partners to understand, design, and implement complex inferencing capabilities for state-of-the-art deep learning models, driving innovations in AI infrastructure.
- Work with cutting-edge hardware and software stacks to deliver best-in-class inference performance while optimizing for cost, leveraging open-source projects to advance deep learning applications.
- Collaborate with external and internal teams to identify new areas for improvement and contribute to innovations that enhance model performance and deployment. Discover/solve impactful technical problems, advance state-of-the-art technologies, and translate ideas into production.
- Developing internal tools to support the AI lifecycle, including experiment tracking, model versioning, and performance monitoring.
- Create deep connections within our communities, focus on increasing representation, retaining, and growing our current team members, while fostering awareness and growth through an inclusive environment.
Qualifications Required Qualifications:
- Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
Preferred Qualifications:
- Experience with model compression (quantization, distillation, SVD, low‑rank methods).
- Experience in building high‑throughput inference serving stacks (continuous batching, KV‑cache optimizations, routing).
- Familiarity with Microsoft’s DLIS, Talon routing, Triton/TensorRT‑LLM stack, and Azure/H100/A100 GPU environments.
- Publications, competition wins, or real‑world deployments related to model efficiency.
- Solid experience in GPU inference optimization (CUDA, TensorRT, Triton, or custom GPU kernels).
- Proficiency in profiling tools (Nsight, Tensor Board, Py Torch profiler) and ability to identify CPU/GPU bottlenecks.
- Deep understanding of LLM/SLM architectures (attention, embeddings, MoE, decoders).
- Experience optimizing latency‑critical online services.
- Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or PythonOR Bachelor's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
Other Requirements:
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings:
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
#MicrosoftAI
Software Engineering IC5 - The typical base pay range for this role across the U.S. is USD $139,900 - $274,800 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $188,000 - $304,200 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
https://careers.microsoft.com/us/en/us-corporate-pay
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
総閲覧数
0
応募クリック数
0
模擬応募者数
0
スクラップ
0
類似の求人

Senior Staff Engineer, Software (C++/Networking) (R4337)
Shield AI · San Diego, California

Principal Software Engineer - Commerce
HubSpot · Cambridge, MA, USA

Principal Engineer Systems Test – System Test Analyst (26-004)
Northrop Grumman · United States-Colorado-Schriever AFB

Staff Software Engineer - Imagery & Visualization
General Motors · 2 Locations

Senior Staff Software Engineer, Infrastructure
Rippling · San Francisco, CA
Microsoftについて

Microsoft
PublicMicrosoft Corporation is an American multinational technology conglomerate headquartered in Redmond, Washington.
10,001+
従業員数
Redmond
本社所在地
$3000B
企業価値
レビュー
4.4
10件のレビュー
ワークライフバランス
3.2
報酬
4.1
企業文化
4.3
キャリア
3.8
経営陣
4.0
82%
友人に勧める
良い点
Cutting-edge technology and innovative projects
Great team culture and collaborative atmosphere
Excellent benefits and competitive compensation
改善点
Heavy workload and frequent overtime
High expectations and stressful environment
Bureaucratic processes can be slow
給与レンジ
5,620件のデータ
Senior/L5
Senior/L5 · Account Management
5件のレポート
$209,483
年収総額
基本給
$181,941
ストック
-
ボーナス
-
$194,895
$209,483
面接体験
1件の面接
難易度
4.0
/ 5
期間
14-28週間
体験
ポジティブ 0%
普通 0%
ネガティブ 100%
面接プロセス
1
Application Review
2
Recruiter Screen
3
Technical Phone Screen
4
Onsite/Virtual Interviews
5
Team Matching
6
Offer
よくある質問
Coding/Algorithm
System Design
Behavioral/STAR
Technical Knowledge
ニュース&話題
'Players Are Frustrated.' Microsoft Execs Tease a Return to Xbox-Exclusive Games - PCMag
PCMag
News
·
Today
Microsoft Announces Major Changes to Windows Update in Windows 11 - Thurrott.com
Thurrott.com
News
·
Today
Microsoft to offer voluntary retirement to thousands of US employees for the first time - CNN
CNN
News
·
Today
20,000 job cuts at Meta, Microsoft raise concern that AI-driven labor crisis is here - CNBC
CNBC
News
·
Today