採用
- IND - Staff Engineer, Reliability
- GCC070
We’re determined to make a difference and are proud to be an insurance company that goes well beyond coverages and policies. Working here means having every opportunity to achieve your goals – and to help others accomplish theirs, too. Join our team as we help shape the future.
Cloud Services Team is searching for a Reliability Engineer. Candidate must have hands-on experience operating and engineering services on Google Cloud Platform (GCP), including data, compute, and observability services. The team is accountable for the operations, engineering, and governance of 200+ Cloud Technologies across a multiple cloud environment. Role requires helping mature operational practices for GCP workloads as part of our multi-cloud strategy. This is an excellent opportunity for someone who is interested in a mix of strategy and hands-on work. The ideal candidate should feel comfortable working with teammates at all levels of the organization including leadership.
Key Responsibilities
-
Assists in the development, maintenance and operations of IT services across 200+ infra services across our Cloud transformation landscape.
-
Develop solutions and drive adoption of enterprise solutions such as Cyber Protection, Disaster Recovery, and Security enhancements, across Line of business teams.
-
Drive improvement, through automation, of software delivered as a service from an efficiency and simplicity perspective.
-
Provide clear operational documents and construction/support specifications to IT userbase.
-
Provide insight into operational Metrics across the entire Cloud Environment.
-
Consult with customers on any new requirements or design questions or functionality configurations for environments on and off premise
-
Delivers the tooling and capabilities needed to enable cloud compliance, metrics and reporting and cost management roadmap and strategy.
-
Participate in incident resolution and change implementation as necessary. This may occasionally include support during non standard hours.
-
Operate and improve reliability for production workloads running on Google Cloud Platform (GCP), focusing on availability, scalability, and operational readiness rather than application development.
-
Own day‑to‑day operational concerns for core GCP services including Compute Engine, GKE, Cloud Run, Big Query, Cloud Storage, and supporting platform services.
-
Provide operational support for Big Query platforms including job performance troubleshooting, capacity planning, quota management, dataset permissions, and cost optimization (slot usage, reservations, and quotas).
-
Support Vertex AI platforms from an operations and reliability standpoint, including environment readiness, access controls, monitoring, pipeline execution health, and incident response (not model development).
-
Build and maintain observability standards using Cloud Monitoring, Cloud Logging, Error Reporting, and custom SLI/SLO dashboards for GCP workloads.
-
Implement alerting strategies aligned to error budgets and production reliability goals; reduce alert noise and prevent toil.
-
Execute incident response, triage, and post‑incident analysis for GCP services, contributing to PIRs and corrective actions.
-
Develop and maintain runbooks, operational playbooks, and escalation workflows for GCP services.
-
Drive automation-first operations, including self‑healing patterns using Cloud Functions, Cloud Run jobs, Scheduler, and event‑driven remediation.
-
Enforce and operate GCP security and governance controls, including IAM, service accounts, Org Policies, VPC Service Controls, KMS, Secret Manager, and networking guardrails.
-
Partner with engineering and data teams to review designs for operability, resiliency, and supportability, ensuring workloads meet production readiness standards before launch.
Required Skills & Experience:
-
Expert understanding of how applications should be engineered by following fault tolerate best practices, separation of duties, observability, and being operator friendly.
-
Expert on being Self-motivated and results-oriented with the ability to work in a team environment and independently
-
Strong hands-on experience with Big Query, including performance tuning, cost management, and governance.
-
Experience with Vertex AI, including pipelines, model deployment, model monitoring, and integration with Big Query.
-
Deep knowledge of Cloud IAM, service accounts, Workload Identity Federation, and principle-of-least-privilege controls.
-
Experience with GKE operations (clusters, node pools, autoscaling, workload identity, Istio/Anthos optional).
-
Understanding of Cloud Storage, Pub/Sub, Dataflow, Dataproc, and Cloud Composer for data/ML workflows.
-
Experience building CI/CD pipelines targeting GCP using Cloud Build, Artifact Registry, and Terraform.
-
Ability to troubleshoot GCP networking: VPCs, firewall rules, private service access, interconnects/VPN.
Nice to Have
-
Intermediate knowledge of Terraform and Cloud Formation required.
-
Intermediate Microsoft office skills
-
Hands-on experience with advanced GCP services such as Vertex AI, Big Query, Dataflow, Pub/Sub, Cloud Run, and GKE.
-
Experience creating org-level policies, security baselines, and automation patterns for GCP environments
What We Offer
-
Collaborative work environment with global teams.
-
Competitive compensation and comprehensive benefits.
-
Continuous learning and growth opportunities in geospatial and risk analytics technologies.
総閲覧数
0
応募クリック数
0
模擬応募者数
0
スクラップ
0
類似の求人
Senior Software Engineer
NetApp · Bangalore, India Office (BANGALORE)

Senior Site Reliability Engineer
Calendly · Remote - US

Cloud Native Senior DevOps Engineer - Digital Manufacturing (8+ Years)
SAP ·

Senior Software Developer (Mobile)
Warner Bros. Discovery · Kanata 307 Legget Dr.

Cloud Automation and Innovation - IT Architecture Senior Specialist - Encryption standards, Tooling
SAP ·
Hartfordについて

Hartford
BootstrappedThe Hartford Insurance Group, Inc., known as The Hartford, is a U.S.-based insurance company. The Hartford is a Fortune 500 company headquartered in its namesake city of Hartford, Connecticut. It was ranked 162nd in Fortune 500 in 2024.
51-200
従業員数
Paris
本社所在地
レビュー
3.7
10件のレビュー
ワークライフバランス
4.2
報酬
2.3
企業文化
4.1
キャリア
2.8
経営陣
3.2
68%
友人に勧める
良い点
Good work-life balance and flexible hours
Strong team culture and supportive colleagues
Excellent health benefits and vacation time
改善点
Non-competitive salary and pay
Limited career advancement and growth opportunities
Poor communication from upper management
給与レンジ
59件のデータ
Junior/L3
Mid/L4
Senior/L5
Director
Junior/L3 · Business Intelligence Developer
1件のレポート
$95,082
年収総額
基本給
$82,680
ストック
-
ボーナス
-
$95,082
$95,082
面接体験
3件の面接
難易度
3.3
/ 5
期間
14-28週間
体験
ポジティブ 0%
普通 67%
ネガティブ 33%
面接プロセス
1
Phone Interview
2
Video Interview
3
Analyst Interview
4
Trader Interview
5
Vice President Interview
ニュース&話題
Water main break, highway closed: Connecticut Department of Transportation - fox61.com
fox61.com
News
·
3d ago
Volunteers provide free home repairs for veterans in East Hartford, Meriden - WFSB
WFSB
News
·
3d ago
Hartford Athletic Look To Bounce Back Against Loudoun United - The Blazing Musket
The Blazing Musket
News
·
3d ago
Water main break closes highway in Hartford Saturday morning - NBC Connecticut
NBC Connecticut
News
·
3d ago