
Organizing the world's information and making it universally accessible.
Cloud Technical Solutions Engineer, Compute
福利待遇
•股权
•医疗保险
•Learning Budget
•弹性工作
•育儿假
必备技能
PostgreSQL
React
Python
About the job
The Google Cloud team helps companies, schools, and government seamlessly make the switch to Google products and supports them along the way. You listen to the customer and swiftly problem-solve technical issues to show how our products can make businesses more productive, collaborative, and innovative. You work closely with a cross-functional team of web developers and systems administrators, not to mention a variety of both regional and international customers. Your relationships with customers are crucial in helping Google grow its Cloud business and helping companies around the world innovate.
In this role, you will own customer issues and provide specialized support to other teams. You will be a part of a global team that provides support to ensure customers can deploy their Artificial Intelligence (AI) and Machine Learning (ML) workloads on AI Infrastructure products. You will troubleshoot technical problems with hardware and software debugging, networking, Linux system administration, coding/scripting, and updating documentation. You will help the customer’s success in the AI/ML space by making improvements to the product, internal tools, processes, and documentation. You will help drive business growth by recognizing and advocating for the customers’ tests related to AI deployments.
Responsibilities
-
Manage customer’s problems through diagnosis, resolution, or implementation of new investigation tools to increase productivity for customer issues on AI/ML infrastructure.
-
Develop an understanding of AI/ML workloads and underlying hardware architectures by troubleshooting, reproducing, determining the root cause for customer reported issues, and building tools for diagnosis.
-
Act as a consultant and subject matter expert for internal stakeholders in Engineering, Business, and customer organizations to resolve deployment and operational obstacles in AI infrastructure environments.
-
Work with multiple Product and Engineering teams to find ways to improve the product, and interact with our Site Reliability Engineering (SRE) teams to drive production.
-
Be available for non-standard work hours or shifts which may include weekends as needed.
Minimum qualifications
-
Bachelor’s degree in Science, Technology, Engineering, Mathematics, or equivalent practical experience.
-
6 years of experience with writing code in one or more general purpose programming languages (e.g., C++, Java, Python, Go, etc).
-
Experience with Linux/Unix systems with debugging issues across the hardware/software boundary on enterprise-grade server infrastructure.
-
Experience in troubleshooting for customer needs, and triaging technical issues across the stack (e.g., hardware faults, networking, virtualization, kernel drivers, firmware, performance).
Preferred qualifications
-
Experience in working with distributed systems with the knowledge of common solutions, design patterns, or best practices.
-
Experience in working with Artificial Intelligence/Machine Learning (AI/ML) computing hardware, including Graphics Processing Unit (GPUs) or other accelerators.
-
Experience with containerization and orchestration technologies like Kubernetes or Slurm.
-
Experience with ML frameworks (e.g., Tensor Flow, Pytorch), with the knowledge of the AI/ML training and inference lifecycle.
-
Excellent troubleshooting and communication skills with attention to details.
浏览量
0
申请点击
0
Mock Apply
0
收藏
0
相似职位

Frontend Software Engineer, ML Platform, Autopilot Infrastructure
Tesla · Palo Alto, California

Mobile App Engineer, Service & Roadside Assistance, Vehicle Software
Tesla · Palo Alto, California

Software Engineer, Mobile App, Vehicle Software
Tesla · Palo Alto, California

Frontend Software Engineer, Energy Residential
Tesla · Fremont, California

Senior Software Engineer – Golang (m/w/d) - Gigafactory Berlin-Brandenburg
Tesla · Grünheide (mark), Brandenburg
关于Google

Google specializes in internet-related services and products, including search, advertising, and software.
10,001+
员工数
Mountain View
总部位置
$1,700B
企业估值
评价
10条评价
4.5
10条评价
工作生活平衡
3.2
薪酬
4.3
企业文化
4.1
职业发展
4.2
管理层
3.8
82%
推荐率
优点
Great benefits and perks
Innovative and interesting work
Career development and learning opportunities
缺点
High pressure and expectations
Long hours and heavy workload
Fast-paced and overwhelming environment
薪资范围
57,503个数据点
Mid/L4
Mid/L4 · Accessibility Analyst
1份报告
$214,500
年薪总额
基本工资
$165,000
股票
-
奖金
-
$214,500
$214,500
面试评价
9条评价
难度
3.4
/ 5
时长
14-28周
录用率
44%
体验
正面 0%
中性 56%
负面 44%
面试流程
1
Application Review
2
Online Assessment/Technical Screen
3
Phone Screen
4
Onsite/Virtual Interviews
5
Team Matching
6
Offer
常见问题
Coding/Algorithm
System Design
Behavioral/STAR
Technical Knowledge
Product Sense
最新动态
Our eighth generation TPUs: two chips for the agentic era - blog.google
blog.google
News
·
1w ago
Google Maps on Android Auto now shows bigger labels on streets along your route [Gallery] - 9to5Google
9to5Google
News
·
1w ago
Google to invest up to $40 billion in AI rival Anthropic - Reuters
Reuters
News
·
1w ago
Google to invest up to $40B in Anthropic in cash and compute - TechCrunch
TechCrunch
News
·
1w ago