채용
Sr Software Development Engineer - Silicon Development Infrastructure , ML Silicon Infrastructure

Sr Software Development Engineer - Silicon Development Infrastructure , ML Silicon Infrastructure
Austin, TX, USA
·
On-site
·
Full-time
·
1w ago
We're seeking a Senior Silicon Software Development Infrastructure Engineer to architect, build and operate the infrastructure that accelerates silicon development at Annapurna Labs. In this role, you'll design and deliver the platforms, tooling, and automation that enable our chip design teams to iterate faster, validate more thoroughly, and bring transformative silicon to market. You'll work at the intersection of cloud infrastructure, high-performance computing, and electronic design automation—building systems that directly impact AWS's ability to innovate in custom silicon.
This is a unique opportunity to shape infrastructure that supports chip development while working with world-class engineers across hardware, software, and operations disciplines.
Key job responsibilities
Customer-Obsessed Infrastructure Development:
- Partner directly with silicon design, verification, emulation, formal verification, and software teams to deeply understand their development workflows, pain points, and iteration cycles.
- Build customer-facing tooling including command-line interfaces, REST APIs, and automation services that eliminate manual toil and reduce time-to-results
- Gather continuous feedback from internal customers and rapidly iterate on solutions. Benchmark infrastructure based on silicon development workflows to provide internal customers with the optimal resources for silicon development.
Own End-to-End Platform Delivery:
- Design, implement, and operate cloud infrastructure (AWS preferred) and high-performance computing clusters using schedulers like Slurm
- Build and maintain CI/CD pipelines for infrastructure-as-code, container images, service deployments, and cluster configuration changes with comprehensive testing, staged rollouts, and safe rollback mechanisms
- Take full ownership of platform reliability, performance, and cost efficiency—from initial design through production operation and continuous improvement
Deliver Results Through Automation and Observability:
- Develop data pipelines that ingest metrics, logs, and workflow results from distributed systems
- Design and operate databases that capture workflow metadata, job outcomes, and resource utilization patterns
- Build dashboards and alerting systems that surface actionable insights on efficiency, utilization, reliability, and cost trends
- Establish monitoring, incident response processes, runbooks, and documentation that enable operational excellence
Invent and Simplify Systems:
- Identify opportunities to simplify workflows and reduce complexity in silicon development infrastructure
- Design pragmatic, scalable solutions that balance immediate needs with long-term maintainability
- Challenge assumptions and propose innovative approaches to infrastructure problems, always asking "what's the simplest thing that could work?"
- Build reusable abstractions and platforms that eliminate repetitive work across multiple teams and chip programs
A day in the life
As a Senior Software Development Engineer at Annapurna Labs working on silicon infrastructure, you'll start your day investigating any anomalies in job completion rates or resource utilization. You might spend your morning collaborating with a design verification team to optimize their regression workflows, identifying bottlenecks in their CI pipeline and proposing architectural improvements that could reduce iteration time by hours.
In the afternoon, you could be designing a new API that abstracts license server complexity for emulation teams, or implementing autoscaling policies for a Slurm cluster that needs to handle unpredictable workload spikes while optimizing costs. You'll participate in architecture reviews, contribute to postmortems when incidents occur, and continuously refine the observability dashboards that give teams real-time visibility into their development velocity.
Throughout the day, you'll balance immediate customer needs—unblocking a team waiting for compute capacity—with longer-term platform investments that will scale across multiple chip programs. You'll write code, review infrastructure-as-code changes, and collaborate across time zones with engineers who depend on the systems you build. Every improvement you deliver directly accelerates the path from RTL to silicon.
About the team
At Annapurna Labs, your infrastructure work directly enables breakthrough innovations in custom silicon that power AWS and transform industries. You'll collaborate with world-class chip designers, verification engineers, and software developers who are pushing the boundaries of what's possible. We offer the resources and scale of AWS with the innovation culture and technical depth of a focused silicon team.
If you're passionate about building infrastructure that accelerates innovation, thrive on customer obsession and ownership, and want to see your work enable the next generation of AWS silicon—we want to hear from you.
Basic Qualifications
- 5+ years of professional software and systems development experience
- Bachelor's degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent work experience
- Strong programming skills in Python or similar languages with demonstrated software engineering best practices
- Familiarity with semiconductor development workflows and electronic design automation (EDA) tools in domains such as design verification, physical design, emulation, or formal verification
- Experience designing, building, and operating cloud infrastructure with infrastructure-as-code methodologies
- Solid understanding of networking, security, performance optimization, and distributed systems fundamentals
- Experience with CI/CD systems such as Jenkins, GitLab CI, or similar platforms
- Clear communication skills with ability to explain technical tradeoffs, propose solutions, and collaborate effectively across teams
Preferred Qualifications
- Experience with operating system-level debugging and performance optimization, including NUMA node configuration, memory topology tuning, and system resource allocation strategies.
- Experience operating AWS cloud environments at scale with deep knowledge of EC2, VPC, IAM, and related services
- Experience designing and operating high-performance computing (HPC) or high-throughput computing (HTC) clusters with workload schedulers like Slurm
- Hands-on experience with backend systems including message queues, caching layers, artifact repositories, or internal service platforms
- Knowledge of enterprise authentication systems such as Entra ID, LDAP, FreeIPA, or SSSD
- Experience with high-performance storage architectures and optimizing data movement for large-scale workloads
- Familiarity with license server management for capacity-constrained or expensive commercial toolchains
- Track record of driving operational excellence through monitoring, incident response, and continuous improvement
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.
The base salary range for this position is listed below. Your Amazon package will include sign-on payments and restricted stock units (RSUs). Final compensation will be determined based on factors including experience, qualifications, and location. Amazon also offers comprehensive benefits including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage), 401(k) matching, paid time off, and parental leave. Learn more about our benefits at https://amazon.jobs/en/benefits.
USA, TX, Austin - 168,100.00 - 227,400.00 USD annually
총 조회수
0
총 지원 클릭 수
0
모의 지원자 수
0
스크랩
0
비슷한 채용공고

Senior Software Engineer - Cloud Infrastructure, Golang
Apple · Austin, TX

Sr Software Engineer
PayPal · Austin, Texas, United States of America; San Jose, California, United States of America

AI Solution Principal Systems Development Engineer
Dell · Austin, Texas, United States

Sr. Software Engineer - Video Apps
Apple · Austin, TX

Staff Engineer, Emulation Technical Lead
Tenstorrent · Austin, Texas, United States
Amazon 소개

Amazon
PublicAmazon.com, Inc. is an American multinational technology company engaged in e-commerce, cloud computing, online advertising, digital streaming, and artificial intelligence.
10,001+
직원 수
Seattle
본사 위치
$1.5T
기업 가치
리뷰
2.9
10개 리뷰
워라밸
2.8
보상
3.7
문화
2.5
커리어
2.3
경영진
2.1
35%
친구에게 추천
장점
Good pay and compensation
Strong benefits package
Flexible scheduling options
단점
Poor management and leadership
Limited growth and promotion opportunities
High stress and demanding work environment
연 봉 정보
4개 데이터
L2
L3
L4
L5
L6
L2 · Data Analyst L2
0개 리포트
$108,330
총 연봉
기본급
$43,332
주식
$54,165
보너스
$10,833
$75,831
$140,829
면접 경험
10개 면접
난이도
3.7
/ 5
소요 기간
21-35주
합격률
20%
경험
긍정 10%
보통 10%
부정 80%
면접 과정
1
Application Review
2
Recruiter Screen
3
Online Assessment
4
Technical Phone Screen
5
Onsite/Virtual Loop
6
Team Matching
7
Offer
자주 나오는 질문
Coding/Algorithm
System Design
Behavioral/STAR
Leadership Principles
Technical Knowledge
뉴스 & 버즈
Amazon vs. Walmart: This Isn't Even Close - The Motley Fool
The Motley Fool
News
·
2d ago
'Kevin' Review: Jason Schwartzman, Aubrey Plaza in Amazon Cat Cartoon - The Hollywood Reporter
The Hollywood Reporter
News
·
2d ago
Amazon's best weekend deals: Apple, Clinique, Yeti and more — save up to 70% - Yahoo
Yahoo
News
·
2d ago
Amazon Delivery Drones Involve a Perilous 10-Foot Drop. Users Are Posting the Apparent Results - Gizmodo
Gizmodo
News
·
2d ago