採用
福利厚生
•Healthcare
•Equity
必須スキル
SAP
Python
Excel
At Cadence, we hire and develop leaders and innovators who want to make an impact on the world of technology.
Job Summary:
The Data Center Operations Engineer is responsible for supporting, maintaining, and deploying critical data center infrastructure with a strong focus on
Linux-based systems, GPU server deployments, and Infini Band networking. This role requires hands-on expertise in data center operations, cluster bring-up, hardware installation, and troubleshooting across compute, network, and GPU environments. The engineer will collaborate closely with global infrastructure, development, and operations teams to ensure reliable, secure, and scalable service delivery.
-
Key Responsibilities
-
Provide hands-on operational support for all data center projects, deployments, and repair activities.
-
Participate in an on-call rotation and provide on-site or remote support during maintenance windows and incidents.
-
Troubleshoot and resolve operational issues related to Linux servers, GPU platforms, networking, and storage infrastructure.
-
Support customer and internal deployments, ensuring timely and successful bring-up of GPU servers and clusters.
-
Perform Infini Band fabric bring-up, switch configuration, subnet management, and troubleshooting.
-
Conduct daily health checks of Linux systems and infrastructure components, proactively identifying and mitigating risks.
-
Install, configure, test, and maintain server hardware (rack and stack, labeling, HDDs, memory, CPUs, RAID batteries, NICs, etc.).
-
Install, configure, and troubleshoot networking equipment including routers, switches, and terminal servers for out-of-band management.
-
Review and validate equipment deployments against approved design documentation and standards.
-
Support data center builds, refreshes, migrations, and expansions while adhering to quality and safety standards.
-
Coordinate with vendors and onsite staff for hardware delivery, diagnostics, replacement, and warranty services.
-
Utilize monitoring and alerting frameworks to identify issues, escalate appropriately, and ensure timely service restoration.
-
Maintain accurate documentation of operational procedures, system configurations, and runbooks.
-
Follow established incident management, escalation procedures, and service-level agreements (SLAs).
-
Collaborate with global teams across time zones to support operational initiatives and continuous improvement efforts.
-
Contribute to process improvement initiatives and ensure adherence to documented policies, processes, and procedures.
-
Required Qualifications
-
Bachelor’s degree in Computer Science, Engineering, Information Technology, or equivalent practical experience.
-
*Strong hands-on experience in Linux environments, including system administration, troubleshooting, and performance validation.
-
*Proficiency with Linux command-line tools and shell scripting (Bash or equivalent).
-
Experience with cluster bring-up, driver installation, and system-level configuration.
-
Hands-on experience setting up and validating GPU servers in clustered environments.
-
Experience with end-to-end GPU testing in Infini Band-based clusters.
-
Working knowledge of Infini Band networking, including switch configuration and subnet management.
-
Solid understanding of networking fundamentals, including the OSI model and TCP/IP protocol suite (IP, ARP, ICMP, TCP, UDP, SMTP, FTP, TFTP).
-
Experience installing, configuring, and troubleshooting routers, switches, and terminal servers.
-
Familiarity with fiber and copper cabling, including IP and SAN deployments.
-
Experience managing incident tickets, maintaining acceptable ticket loads, and meeting SLAs.
-
Strong organizational skills with meticulous attention to detail in data center environments.
-
Ability to follow and enforce documented escalation procedures and operational policies.
-
Strong verbal and written communication skills, with the ability to collaborate effectively with cross-functional and global teams.
-
Preferred Qualifications
-
Experience supporting HPC, AI, or large-scale GPU environments.
-
Exposure to data center monitoring
-
Experience documenting operational processes and maintaining technical runbooks.
-
Familiarity with large-scale data center buildouts or refresh programs.
-
Physical Requirements
-
Ability to perform the essential functions of the role, including lifting, moving, and installing equipment weighing 50 pounds or more, with or without reasonable accommodation.
-
Ability to work in data center environments, including raised floors, equipment racks, and confined spaces.
-
Willingness to work flexible hours, including nights, weekends, and on-call rotations as required.
-
Work Environment
-
On-site data center environment with occasional remote coordination.
-
Interaction with hardware vendors, service providers, and internal engineering teams.
-
Fast-paced operational setting requiring attention to detail, adherence to safety standards, and rapid problem resolution.
We’re doing work that matters. Help us solve what others can’t.
総閲覧数
2
応募クリック数
0
模擬応募者数
0
スクラップ
0
類似の求人

Service Management L2 Engineer
Nokia · United States, US

Turbomachinery Engineer II
Relativity Space · Long Beach, California

System Safety Engineer, Autonomy Trucking
Applied Intuition · Sunnyvale, California, United States

CPU Physical Design Pathfinding Engineer
Qualcomm · San Diego, California, United States of America

HICOM Sustainment Integrator
Booz Allen Hamilton · Fort Leavenworth, KS
Cadenceについて

Cadence
PublicCadence Design Systems provides electronic design automation (EDA) software, hardware, and IP for designing and verifying electronic systems and semiconductors.
5,001-10,000
従業員数
San Jose
本社所在地
$8.5B
企業価値
レビュー
4.0
10件のレビュー
ワークライフバランス
4.2
報酬
2.8
企業文化
4.1
キャリア
3.2
経営陣
3.4
72%
友人に勧める
良い点
Good work-life balance
Supportive and collaborative team environment
Flexible work arrangements
改善点
Below market compensation
Limited career advancement opportunities
Heavy workload and long hours
給与レンジ
58件のデータ
Junior/L3
Junior/L3 · Data Analyst
1件のレポート
$91,103
年収総額
基本給
$85,276
ストック
-
ボーナス
$5,827
$59,612
$139,984
面接体験
1件の面接
難易度
3.0
/ 5
期間
14-28週間
面接プロセス
1
Application Review
2
Recruiter Screen
3
Technical Phone Screen
4
Onsite/Virtual Interviews
5
Final Decision
よくある質問
Technical Knowledge
Behavioral/STAR
Past Experience
Problem Solving
ニュース&話題
Ninety One UK Ltd Cuts Position in Cadence Design Systems, Inc. $CDNS - MarketBeat
MarketBeat
News
·
3d ago
Moran Wealth Management LLC Sells 19,592 Shares of Cadence Design Systems, Inc. $CDNS - MarketBeat
MarketBeat
News
·
3d ago
Cadence Maps Its Future Beyond EDA With Agentic AI and Simulation - HPCwire
HPCwire
News
·
4d ago
Lesser-Known Cadence Design Systems Just Landed Google and Nvidia Deals. Should You Buy CDNS Stock? - Barchart.com
Barchart.com
News
·
4d ago