Jobs
Required Skills
Python
Kafka
Spark
Overview
JOB DESCRIPTION
Modern ads platforms run on always-on, real-time data: streaming events, feature computation, near-real-time aggregations, and low-latency serving to power ML models that operate at massive scale under strict freshness, cost, and reliability requirements.
Microsoft Ads builds and operates large-scale, latency-sensitive systems that serve billions of requests. We are looking for a Sr Software Engineer who is hands-on with production coding and system design to build the real-time data pipelines and feature/embedding materialization systems that feed online stores/caches and integrate tightly with ML inference serving.
This role is ideal for engineers who enjoy:
- building robust streaming + ETL systems (correctness, idempotency, backfills, late data),
- owning SLOs with strong observability and operational maturity,
- and optimizing end-to-end performance and cost across compute, storage, and serving integrations.
Primary success metrics are freshness, correctness, latency, reliability, and cost in production.
Responsibilities:
Responsibilities:
- Design and implement real-time streaming ETL / feature pipelines (e.g., Flink or Spark Structured Streaming) that meet strict freshness and correctness constraints.
- Build and operate reliable messaging and ingestion with Kafka/Pulsar (partitioning strategy, retries, ordering guarantees, DLQs, backpressure handling).
- Own data contracts between producers, pipelines, and consumers: schema evolution, versioning, compatibility, validation, and safe rollout.
- Implement production-grade backfill/replay workflows
- Define and meet SLOs using Open Telemetry/Prometheus/Grafana for metrics, tracing, dashboards, alerting, and incident response readiness.
- Integrate pipelines with online stores/caches and ML consumers (feature stores, embedding pipelines, LLM API calls, online/offline consistency patterns).
- Partner with applied scientists on feature/embedding definitions, validation, and end-to-end quality measurement.
- Optimize end-to-end performance and efficiency: CPU/memory/I/O, serialization, caching, network overhead, concurrency, and pipeline compute cost.
- Contribute to serving/inference integrations where needed (e.g., Triton/ONNX Runtime/TensorRT) including batching and latency/cost tradeoffs.
- Ship safely with CI/CD, automated testing (unit/integration/data quality), and operational playbooks/runbooks.
Qualifications:
Required Qualifications
- Bachelor’s or Master’s degree in Computer Science, Electrical/Computer Engineering, or a related field, with 6+ years of related experience.
- Strong programming skills in language C++,C# or Python (at least one required).
- Hands-on experience in one or more:
- Building and operating streaming data pipelines in production (Flink or Spark Structured Streaming),
- Distributed systems engineering with strong reliability and operational rigor,
- Messaging systems such as Kafka/Pulsar.
- Experience operating services with Kubernetes/containers and production readiness practices (deployments, scaling, rollbacks).
- Experience with observability stacks such as Open Telemetry, Prometheus, Grafana.
Preferred Qualifications:
- Experience with feature stores, embedding pipelines, and online/offline consistency (freshness guarantees, correctness validation).
- Experience with data lakehouse/table formats and optimizations eg partitioning, compaction, and incremental processing.
- Experience with GPU inference serving (Triton, ONNX Runtime/TensorRT) and performance techniques (batching, request shaping, tail-latency reduction).
- Background in cost/performance modeling, capacity planning, and reliability improvements for high-scale data platforms.
- Experience in Ads/search/recommendations or other high-scale systems where freshness, latency, and cost are important
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.
Total Views
0
Apply Clicks
0
Mock Applicants
0
Scraps
0
Similar Jobs

Principal Engineer Software
Palo Alto Networks · Bangalore, India

Sr Structural Analyst - New Glenn
Blue Origin · Greater Seattle Area; Space Coast

R&D Product Development Engineer
Samsung · 19 Chapin Road, Pine Brook, NJ, USA
HP
Senior Software Engineer - Routing Protocol Developer
HPE · Bangalore, Karnātaka, India

Staff Software Engineer
Northrop Grumman · United States-Alabama-Huntsville
About Microsoft
Reviews
3.8
5 reviews
Work Life Balance
4.1
Compensation
4.3
Culture
3.4
Career
3.2
Management
3.0
65%
Recommend to a Friend
Pros
Excellent compensation and benefits package
Four-day workweek with improved work-life balance
Supportive managers and teams
Cons
High-pressure environment causing anxiety
Unprofessional interview processes
Limited creative work opportunities
Salary Ranges
5,571 data points
Junior/L3
Mid/L4
Junior/L3 · Advertising Client Success
2 reports
$163,358
total / year
Base
$141,875
Stock
-
Bonus
-
$163,358
$163,358
Interview Experience
7 interviews
Difficulty
3.7
/ 5
Duration
14-28 weeks
Offer Rate
14%
Experience
Positive 14%
Neutral 29%
Negative 57%
Interview Process
1
Application Review
2
Recruiter Screen
3
Technical Phone Screen
4
Technical Interview
5
Onsite/Virtual Interviews
6
Final Round
7
Offer
Common Questions
Coding/Algorithm
System Design
Behavioral/STAR
Technical Knowledge
Past Experience
News & Buzz
Microsoft loses $400 billion in few hours, what's behind one of the worst stock market days for the compa - Times of India
Source: Times of India
News
·
7w ago
Microsoft Stock Tumbles 12.1% In Worst Day For Company In Years - HuffPost
Source: HuffPost
News
·
7w ago
Microsoft: The 'question' the company needs to answer - Yahoo Finance
Source: Yahoo Finance
News
·
7w ago
AI is a planet-sized bubble — and Microsoft's slump is a taste of the crash to come, tech guru Erik Gordon says - Business Insider
Source: Business Insider
News
·
7w ago
