Jobs
Compensation
$100,600 - $199,000
Required skills
C
C++
C#
Java
JavaScript
Python
Overview:
Microsoft Azure High Performance Computing & AI Engineering (HPC & AI Eng) team is responsible for managing the core platform & fleet of AI & High Performance Computing products that customers use to run their most performant and demanding workloads. The AI Customer Experience (AICE) engineering team within the HPC & AI Eng. team is on the frontlines managing the flagship supercomputers and infrastructure used by top tier AI customers that enable breakthroughs such as ChatGPT and are highlighted in Top500, MLPerf and Graph500 rankings.
We run lean, obsess about customer experience and use evidence-based approach to decision making. We have live-site first, metrics-driven culture that prevents us from accumulating debt and necessity to put out fires on daily basis. You will be in a position that carries a ton of responsibility and provides opportunities to directly impact customers satisfaction.
As a Supercomputing Software Engineer on the AICE team, you will design & develop capabilities needed to monitor & efficiently operate across the infrastructure & fleet of supercomputers at scale. To enable first to know of critical incidents impacting customer capacity, you will create end to end data pipelines that process & synthesize large volume of telemetry, log files and other data sources to create actionable alerts.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
- Responsibilities- Contribute to improving key metrics such as Job Mean Time to Interrupt, Nodes in Service, Mean Time to Resolve on flagship supercomputers.
- Manages operations of supercomputers by responding quickly to mitigate issues.
- Implements systemic solutions and mitigations to more complex issues impacting performance or functionality of supercomputers
- Reviews and writes incident postmortem and presents insights that drive changes to reduce or eliminate incidents.
- Independently improves troubleshooting guides (TSGs), wikis, tests, and telemetry, adding comprehensive observability and monitoring capabilities.
- Proactively seeks new knowledge and adapts to new trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of supercomputers while also driving consistency in monitoring and operations at scale.
Qualifications:
Required Qualifications:
- Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or PythonOR equivalent experience.
Other Requirements:
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.
Preferred Qualifications:
- Bachelor's Degree in Computer ScienceOR related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, OR Python
- OR Master's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
Software Engineering IC3 - The typical base pay range for this role across the U.S. is USD $100,600 - $199,000 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $131,400 - $215,400 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here:
https://careers.microsoft.com/us/en/us-corporate-pay
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Total Views
0
Apply Clicks
0
Weekly mock applicants
0
Bookmarks
0
Similar jobs

Software Engineering, Customer Success
Attentive · United States

Software Engineer & Computer Science - Recent Grad/Full Time
Honeywell · United States, US

Software Engineer & Computer Science - Recent Grad/Full Time (US Person Required)
Honeywell · United States, US

We at JPMorganChase are excited to host an In-Person Networking event in New York, NY for Consumer & Community Banking in Software Engineering
JPMorgan Chase · United States, US

EDA Software Developer
Nokia · United States, US
About Microsoft

Microsoft
PublicMicrosoft Corporation is an American multinational technology conglomerate headquartered in Redmond, Washington.
10,001+
Employees
Redmond
Headquarters
$3000B
Valuation
Reviews
3.8
5 reviews
Work-life balance
4.1
Compensation
4.3
Culture
3.4
Career
3.2
Management
3.0
65%
Recommend to a friend
Pros
Excellent compensation and benefits package
Four-day workweek with improved work-life balance
Supportive managers and teams
Cons
High-pressure environment causing anxiety
Unprofessional interview processes
Limited creative work opportunities
Salary Ranges
5,620 data points
Senior/L5
Senior/L5 · Account Management
5 reports
$209,483
total per year
Base
$181,941
Stock
-
Bonus
-
$194,895
$209,483
Interview experience
1 interviews
Difficulty
4.0
/ 5
Duration
14-28 weeks
Experience
Positive 0%
Neutral 0%
Negative 100%
Interview process
1
Application Review
2
Recruiter Screen
3
Technical Phone Screen
4
Onsite/Virtual Interviews
5
Team Matching
6
Offer
Common questions
Coding/Algorithm
System Design
Behavioral/STAR
Technical Knowledge
Culture Fit
News & Buzz
Could Microsoft Win The War For Enterprise AI? - Josh Bersin
Josh Bersin
News
·
6d ago
‘Starting In April’—Microsoft Changes Windows Update After 15 Years - Forbes
Forbes
News
·
1w ago
Microsoft is reportedly giving you a ton of Start menu customization options - XDA
XDA
News
·
1w ago
Get Microsoft Office apps on your Mac for under $9 each - Mashable
Mashable
News
·
1w ago