refresh

지금 많이 보는 기업

지금 많이 보는 기업

Microsoft
Microsoft

Empowering every person and organization on the planet to achieve more.

Member of Technical Staff, Software Engineer - MAI SuperIntelligence team

직무DevOps
경력Staff+
위치Zürich, Switzerland
근무오피스 출근
고용정규직
게시3개월 전
지원하기

필수 스킬

Docker

Go

Terraform

Azure

Overview Help build the infrastructure that powers training, evaluation, and data platforms for reliable deployment of world-class foundational AI models. We are on a mission to create state-of-the-art AI models and deploy them across Microsoft products at an unprecedented scale.You’ll collaborate across engineering and research to design, evolve, and operate core research infrastructure, so that product teams can train faster, evaluate more rigorously, and ship with confidence. You’ll work closely with the teams that transform pre-trained models into the consumer Copilot experience.Microsoft’s mission is to empower every person and every organization to achieve more, and we build on values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive.Microsoft Superintelligence Team This role is part of Microsoft AI's Superintelligence Team. The MAIST is a startup-like team inside Microsoft AI, created to push the boundaries of AI toward Humanist Superintelligence—ultra-capable systems that remain controllable, safety-aligned, and anchored to human values. Our mission is to create AI that amplifies human potential while ensuring humanity remains firmly in control. We aim to deliver breakthroughs that benefit society—advancing science, education, and global well-being. We’re also fortunate to partner with incredible product teams giving our models the chance to reach billions of users and create immense positive impact. If you’re a brilliant, highly-ambitious and low ego individual, you’ll fit right in—come and join us as we work on our next generation of models!

  • Responsibilities- Design and build core platform services for scalable training and evaluation, including cluster orchestration, job scheduling, data and compute pipelines, and artifact management.
  • Standardize containerized workflows by maintaining Docker images, CI/CD, and runtime configurations; advocate for best practices in security, reproducibility, and cost efficiency.
  • Implement end-to-end observability and operations through metrics, tracing, logging, dashboard development, monitoring, and automated alerts for model training and platform health (using Prometheus, Grafana, Open Telemetry).
  • Architect and operate services on Azure cloud platforms, managing infrastructure-as-code (Terraform/Helm), secrets, networking, and storage.
  • Enhance developer experience by creating tools, CLIs, and portals that simplify job submission, metrics analysis, and experiment management for generalist software engineering and research teams.
  • Enforce security and compliance policies for data access, container hardening, and supply-chain integrity, and partner with security and privacy teams to maintain robust practices in multi-tenant environments and secret management.
  • Collaborate cross-functionally with data, model, and product teams to align infrastructure roadmaps with training needs, evaluation protocols, and Copilot product goals.

Qualifications:

Required skills

  • Strong software engineering background building reliable, scalable production systems (Python preferred)
  • Hands‑on experience supporting large‑scale ML / LLM training, evaluation, or experimentation infrastructure
  • Operating GPU‑heavy workloads in cloud environments using Docker and Kubernetes (scheduling, utilization, isolation)
  • Designing and running data / compute pipelines and orchestration (e.g., Airflow, Argo) with object storage (Azure Blob / S3)
  • Platform reliability and operability: observability, metrics, logging, tracing, alerting (Prometheus, Grafana, Open Telemetry)

Desired skills

  • Building secure, reproducible platforms using CI/CD, infrastructure‑as‑code (Terraform, Helm), container security, and secrets management
  • Experience working closely with AI researchers in fast‑moving, experimental, frontier‑scale research environments and building internal tools (CLIs, portals, APIs) to boost productivity

This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.

전체 조회수

0

전체 지원 클릭

0

전체 Mock Apply

0

전체 스크랩

0

Microsoft 소개

Microsoft

Microsoft

Public

Microsoft Corporation is an American multinational technology conglomerate headquartered in Redmond, Washington.

10,001+

직원 수

Redmond

본사 위치

$3000B

기업 가치

리뷰

10개 리뷰

4.4

10개 리뷰

워라밸

3.2

보상

4.1

문화

4.3

커리어

3.8

경영진

4.0

82%

지인 추천률

장점

Cutting-edge technology and innovative projects

Great team culture and collaborative atmosphere

Excellent benefits and competitive compensation

단점

Heavy workload and frequent overtime

High expectations and stressful environment

Bureaucratic processes can be slow

연봉 정보

5,620개 데이터

Senior/L5

Senior/L5 · Account Management

5개 리포트

$209,483

총 연봉

기본급

$181,941

주식

-

보너스

-

$194,895

$209,483

면접 후기

후기 1개

난이도

4.0

/ 5

소요 기간

14-28주

경험

긍정 0%

보통 0%

부정 100%

면접 과정

1

Application Review

2

Recruiter Screen

3

Technical Phone Screen

4

Onsite/Virtual Interviews

5

Team Matching

6

Offer

자주 나오는 질문

Coding/Algorithm

System Design

Behavioral/STAR

Technical Knowledge