招聘
必备技能
Kubernetes
Terraform
Linux
Excel
SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologies to make this possible, with the ultimate goal of enabling human life on Mars.
SR.
LINUX SITE RELIABILITY ENGINEER:
SpaceX is looking for an experienced engineer with deep working knowledge of Kubernetes and related containerized technologies. This employee will be a member of the Information Technology Linux Infrastructure team and will provide expertise in Kubernetes design, maintenance, scaling and optimization in support of critical business functions. The ideal candidate will be flexible and flourish in a fast paced and challenging environment. They should be a self-starter, self-motivator and possess ingenuity to excel at this position.
RESPONSIBILITIES:
-
Install, manage, scale and optimize Kubernetes and RKE clusters using Ansible, Terraform and adjacent technologies in production environments.
-
Work closely with other SpaceX engineers to gather requirements, research, evaluate, design, plan, deploy, and support software platforms and related technologies running in Kubernetes within a world-class environment that meets the needs of the demanding SpaceX engineering teams. Build highly resilient, high-performance, scalable, and robust systems.
-
Exercise a high degree of personal responsibility for the processes, systems, and tools you create and manage; all supporting the goal of making humanity an interplanetary species.
-
Make recommendations, justify, and implement improvements using an accepted change control methodology.
-
Work within a diverse group to design and deliver creative solutions and resolve problems in a timely and proactive manner by interacting with internal business units.
-
Define, document and follow standards and best practices for systems design, testing, and implementation.
-
Foster an environment of collaboration and cross-training, upskilling the team in Kubernetes expertise and ensuring peers are developed into capable engineers.
-
Drive scripting, self-service and automation to develop solutions to reduce administrative overhead and TOIL.
-
Participate in on-call rotation to handle urgent after-hours work when necessary.
BASIC QUALIFICATIONS:
-
Bachelor’s degree in Computer Science or a STEM discipline and 5+ years of systems engineering experience; OR 7+ years of systems engineering experience in lieu of a degree.
-
Experience deploying and supporting Linux servers in physical and virtualized environments (e.g. VMware via automation).
-
Experience with the Linux shell as well as configuring and extending Linux instances (e.g. kernel modules, cgroups, pki, iptables, interfaces).
-
Experience supporting and scaling containerized applications in Linux environments.
-
Experience using automation frameworks (e.g. Ansible, Terraform) to manage provisioning and post-provisioning lifecycles of infrastructure and Kubernetes installations.
PREFERRED SKILLS AND EXPERIENCE:
-
Expertise in creating repeatable, reliable, scalable systems architectures, with high availability, fault tolerance, performance tuning, monitoring, and statistics/metrics collection.
-
Expertise in source code version control tools such as Git and Subversion and collaborating on source code via Pull Requests and other Git-based workflows.
-
Strong understanding of Linux Container Runtime.
-
Experience implementing configuration management provisioning and workflow automation solutions via Infrastructure as Code, CI/CD and Git Ops (e.g. Ansible, AWX/Tower, Vagrant, Puppet, Redfish, Jenkins, cloud-init, ArgoCD, etc).
-
Experience writing test automation to ensure backwards compatibility of feature and change development for automation processes and Kubernetes deployments.
-
Experience with programming and scripting languages such as Python and Golang to develop software solutions and integrate with external systems to implement automation against RESTful API services.
-
Experience installing, configuring and troubleshooting Kubernetes internals, CNI, CRI and CSI plugins (e.g. Docker, Cri-O, Ceph, Cilium), load balancing (e.g. MetalLB), Service Mesh (e.g. Istio) and software-defined storage (e.g. rook-ceph) in cloud or on-premise environments.
-
Experience developing solutions using Kubernetes patterns to extend system functionality and solve custom use cases (e.g. webhooks, controllers, operators, sidecars).
-
Experience implementing proactive alert/monitoring workflows and dashboards for Linux systems and Kubernetes deployments using Prometheus, Grafana, InfluxDB or similar technologies.
-
Experience with dynamic system configuration templating using Jinja, Jsonnet, YAML and Helm.
ADDITIONAL REQUIREMENTS:
- Must be willing to work extended hours and weekends as needed.
ITAR REQUIREMENTS:
- To conform to U.S. Government export regulations, applicant must be a (i) U.S. citizen or national, (ii) U.S. lawful, permanent resident (aka green card holder), (iii) Refugee under 8 U.S.C. § 1157, or (iv) Asylee under 8 U.S.C. § 1158, or be eligible to obtain the required authorizations from the U.S. Department of State. Learn more about the ITAR here.
SpaceX is an Equal Opportunity Employer; employment with SpaceX is governed on the basis of merit, competence and qualifications and will not be influenced in any manner by race, color, religion, gender, national origin/ethnicity, veteran status, disability status, age, sexual orientation, gender identity, marital status, mental or physical disability or any other legally protected status.
Applicants wishing to view a copy of SpaceX’s Affirmative Action Plan for veterans and individuals with disabilities, or applicants requiring reasonable accommodation to the application/interview process should reach out to EEOCompliance@spacex.com*. *
总浏览量
0
申请点击数
0
模拟申请者数
0
收藏
0
相似职位

Principal IT Analyst - Release Train Engineer SAP S/4 HANA
Honeywell · Charlotte, NC, United States, US

Site Reliability Engineer III
F5 Networks · 3 Locations

Senior Security DevOps Engineer
Apple · San Diego, CA

Senior Site Reliability Engineer
Coalition · Any location, United States

Software Engineer Graduate (Site Reliability Engineering) - 2026 Start (BS/MS)
TikTok · San Jose, CA
关于SpaceX

SpaceX
Late StageSpace Exploration Technologies Corp., more commonly known as SpaceX, is a private American aerospace and artificial intelligence company headquartered at the Starbase development site in Starbase, Texas.
13,000+
员工数
Hawthorne
总部位置
$180B
企业估值
评价
4.1
10条评价
工作生活平衡
2.3
薪酬
3.8
企业文化
4.2
职业发展
4.0
管理层
3.5
75%
推荐给朋友
优点
Innovative projects and cutting-edge technology
Strong mission and sense of purpose
Great team and collaborative environment
缺点
Poor work-life balance
Long hours required
High pressure and stress
薪资范围
108个数据点
Junior/L3
Junior/L3 · Integration Technician
106份报告
$80,528
年薪总额
基本工资
$68,316
股票
$12,212
奖金
-
$53,612
$122,653
面试经验
5次面试
难度
3.4
/ 5
时长
21-35周
体验
正面 0%
中性 60%
负面 40%
面试流程
1
Application Review
2
Phone Screen
3
Technical Interview/Live Coding
4
Onsite/Virtual Interviews
5
Technical Presentation
6
Final Round
常见问题
Coding/Algorithm
Technical Knowledge
Behavioral/STAR
System Design
Past Experience
新闻动态
Elon Musk has used SpaceX as a kind of piggy bank over the last two decades, turning to the company as a financial tool to get loans and bolster his struggling companies, according to an examination by The New York Times based on corporate filings, intern - facebook.com
facebook.com
News
·
2d ago
Breakingviews - SpaceX’s market claims are planet-scale absurdity - Reuters
Reuters
News
·
2d ago
RIAs Face QQQ Risks Amid SpaceX IPO, Nasdaq Rule Changes - Wealth Management
Wealth Management
News
·
2d ago
SpaceX drone ship retires, what it means for launches - Florida Today
Florida Today
News
·
2d ago