채용
We are seeking a Site Reliability Engineer (SRE) with strong Database Administration (DBA) skills to ensure the reliability, performance, and scalability of our infrastructure and data platforms. You will work across engineering, operations, and data teams to build resilient systems, automate operations, and maintain mission‑critical databases. You'll create standardized CI/CD frameworks that empower development teams while providing hands-on support to troubleshoot and resolve their build and deployment issues.
This role is ideal for someone who enjoys solving distributed‑systems challenges while also diving deep into database internals, performance tuning, and data reliability.
- Experience: 8+ years in SRE/Platform/DevOps/Operations roles with ownership of production systems at scale.
- Cloud: Hands-on with AWS/Azure/GCP (preferably two); strong grasp of managed services trade-offs.
- Containers & Orchestration:Docker and Kubernetes (AKS/EKS/GKE); Helm/Kustomize; service mesh familiarity (Istio).
- Observability: Open Telemetry; metrics/logs/traces design; alerting strategies; RCA & postmortems.
- Infrastructure as Code:Terraform (preferred) or Cloud-native equivalents; modules, remote state, and CI integration.
- Programming & Scripting: Proficiency in Python/Go and Bash for automation, tooling, and APIs.
- Reliability Practices: SLO/error budgets, capacity planning, chaos/resilience testing, progressive delivery.
- Soft Skills: Calm under pressure, strong communication, pragmatic decision-making, and a continuous improvement mindset
- Understanding of networking fundamentals (DNS, load balancing, TCP/IP)
- Expertise in designing and developing reusable CI/CD pipeline templates
- Proficiency with at least two CI/CD platforms (Atlassian, Azure DevOps, Jenkins, GitHub Actions, GitLab CI)
- Strong experience with Docker and Kubernetes
- Infrastructure as Code skills (Terraform, ARM templates, or CloudFormation)
- Cloud platform expertise (Azure, AWS, or GCP)
- Experience troubleshooting build and deployment issues across multiple technology stacks
- Strong Git and version control workflow knowledge
- Experience with automated testing frameworks (.NET: x Unit/NUnit, Python: pytest)
- Artifact and package management (Nu Get, PyPI, Azure Artifacts, Artifactory)
- Scripting skills (PowerShell, Bash, Python)
Good to Have
-
Experience with additional programming languages (Java, Node.js, Go)
-
Knowledge of frontend frameworks (React, Angular, Vue.js)
-
Git Ops implementation experience (ArgoCD, Flux)
-
Service mesh technologies (Istio, Linkerd)
-
Advanced deployment strategies (blue-green, canary, feature flags)
-
Database CI/CD and migration automation (Entity Framework, Flyway, Liquibase)
-
Security scanning tools integration (Sonar Qube, OWASP, Snyk)
-
Monitoring and observability tools (Prometheus, Grafana, ELK, Application Insights)
-
Configuration management (Ansible, Chef, Puppet)
-
Multi-cloud or hybrid cloud deployment experience
-
Experience building internal developer platforms
-
Creating CLI tools or IDE extensions for developer productivity
-
Policy-as-code implementation (OPA, Sentinel)
-
Cloud certifications (Azure, AWS, or GCP)
-
Kubernetes certifications (CKA, CKAD)
-
Experience with monorepo tools (Nx, Turborepo, Bazel)
-
API gateway and microservices architecture experience
-
Reliability Engineering
-
Define and manage service SLOs/SLIs, track error budgets, and drive reliability roadmaps.
-
Proactively identify reliability bottlenecks, lead remediation, and preventative actions.
-
Establish CI/CD best practices and standards across the organization
-
Observability & Telemetry
-
Implement and scale metrics, logs, and traces across services (e.g., Prometheus/Grafana, Open Telemetry, Dynatrace/Azure Monitor, ELK).
-
Build actionable dashboards and alerts with noise reduction and runbooks for on-call.
-
Incident Management
-
Own on-call rotations, triage, and coordination; drive post-incident reviews and blameless RCA with clear corrective actions.
-
Automate rollback/roll-forward, health checks, and verification steps.
-
Performance & Capacity
-
Conduct load and resilience testing; manage capacity planning and cost optimization (autoscaling, right-sizing, caching).
-
Tune databases, queues, and network settings for throughput and latency.
-
Automation & Tooling
-
Reduce toil with automation and self-service tooling; standardize deployment and recovery procedures.
-
Build reliability guardrails (chaos experiments, circuit breakers, rate limiting, backoff).
-
Platform & Infrastructure
-
Operate and harden Kubernetes clusters, container runtimes, and service meshes.
-
Manage infrastructure using Infrastructure as Code (IaC) - (Terraform/CloudFormation/Bicep), secrets management, and policy-as-code.
-
Security & Compliance
-
Implement Dev Sec Ops practices: vulnerability management, dependency scanning, Identity and Access Management (IAM) hardening.
-
Collaboration
-
Partner with developers, QA, and product on design reviews, release strategies, and production readiness.
-
Document standards and provide enablement sessions to elevate reliability practices.
-
Create comprehensive documentation and self-service guides.
-
**Tools & Technologies **
-
Cloud: Azure
-
Containers & K8s: Docker, AKS/EKS, Helm, Istio
-
Observability: Open Telemetry, Prometheus/Grafana, Azure Monitor/Log Analytics, Dynatrace, Elastic
-
CI/CD: GitHub Actions or Azure DevOps Pipelines; canary/blue-green deployments
-
IaC & Config: Terraform/Terragrunt, Bicep, Vault/Azure Key Vault, SSM
-
Security: Dependabot; Cosign; OPA
Total Views
0
Apply Clicks
0
Mock Applicants
0
Scraps
0
Similar Jobs

Senior Software Engineer - Xbox
Microsoft · United States, Washington, Redmond
NE
Senior Software Engineer - Free BSD/C++
NetApp · RTP, North Carolina, USA Office (NOCAROLINA)

Senior Engineer-DEG - Analog Layout
Micron · Hyderabad - Phoenix Aquila, India

Principal Software Engineer, Amazon DynamoDB
Amazon · Dublin, D, IRL

Senior Staff Software Engineer Multiple Locations
Intuit · mountain view
About Honeywell

Honeywell
PublicThe future is what we make it.
10000+
Employees
Charlotte
Headquarters
Reviews
3.2
4 reviews
Work Life Balance
3.5
Compensation
4.0
Culture
4.0
Career
3.0
Management
2.5
Pros
Good team and helpful colleagues
Fair pay and good benefits
Training and resources available
Cons
Limited job progression
Old boys club culture
High expectations with unclear answers
Salary Ranges
1,391 data points
Mid/L4
Senior/L5
Mid/L4 · Data Analyst II
2 reports
$136,600
total / year
Base
$105,077
Stock
-
Bonus
-
$136,600
$136,600
Interview Experience
4 interviews
Difficulty
2.5
/ 5
Duration
14-28 weeks
Offer Rate
25%
Experience
Positive 0%
Neutral 75%
Negative 25%
Interview Process
1
Application Review
2
Recruiter Screen
3
Technical Phone Screen
4
Hiring Manager Interview
5
Panel Interview
6
Online Assessment
7
Offer
Common Questions
Technical Knowledge
Behavioral/STAR
Past Experience
Coding/Algorithm
Culture Fit
News & Buzz
Honeywell’s 2026 Earnings Outlook: Growth Amid Transition - TipRanks
Source: TipRanks
News
·
5w ago
Jim Cramer Is Enthusiastic About Honeywell’s (HON) Quantum Spinoff - Insider Monkey
Source: Insider Monkey
News
·
5w ago
DOL warns pro-plaintiff ruling in Honeywell case could threaten 401(k) matches - Pensions & Investments
Source: Pensions & Investments
News
·
5w ago
Building automation helps lead Honeywell sales growth - Facilities Dive
Source: Facilities Dive
News
·
5w ago