招聘
͏
Key Responsibilities:
· Design, deploy, and manage a highly available, distributed LGTM stack on Kubernetes.
· Operate and optimize Grafana, Loki, Tempo, and Mimir/Prometheus at scale.
· Implement observability best practices (metrics, logs, traces) for microservices and cloud-native applications.
· Build and maintain Helm charts, GitOps/AzureDevOps workflows for observability services.
· Ensure high availability, disaster recovery, scaling strategies, and performance tuning of observability components.
· Manage storage backends (object storage such as S3/GCS/Azure Blob) for logs, metrics, and traces.
· Configure alerting strategies using Prometheus Alertmanager and Grafana Alerting.
· Define and enforce SLOs, SLIs, and monitoring standards across engineering teams.
· Support onboarding of application teams to observability tooling.
· Troubleshoot complex distributed systems issues across Kubernetes and observability pipelines.
· Implement security best practices (RBAC, TLS, network policies, secrets management).
· Automate operational processes using Infrastructure as Code (Terraform, Helm, etc.).
· Monitor system capacity, optimize cost, and ensure efficient resource utilization.
· Maintain documentation and provide knowledge sharing sessions for internal teams.
Required Skills & Experience:
Technical Skills:
· Strong hands-on experience managing LGTM stack:
o Grafana (dashboards, alerting, RBAC, multi-tenancy)
o Loki (log ingestion, indexing, retention, scaling)
o Tempo (distributed tracing, sampling strategies)
o Mimir or Prometheus (remote write, federation, scaling, HA)
· Solid experience with Kubernetes (cluster operations, networking, storage, RBAC).
· Experience deploying distributed systems using Helm, GitOps/AzureDevOps tools
· Knowledge of PromQL and LogQL.
· Experience with object storage systems (S3-compatible, GCS, Azure Blob).
· Familiarity with Open Telemetry and instrumentation standards.
· Experience configuring and tuning Alertmanager.
· Understanding of microservices architecture and cloud-native patterns.
· Experience with CI/CD pipelines.
· Scripting skills (Bash, Python, or Go).
· Familiarity with cloud platforms (AWS, GCP, and Azure).
Soft Skills:
· Strong problem-solving and analytical skills.
· Ability to work cross-functionally with engineering and platform teams.
· Clear communication and documentation skills.
· Proactive mindset with a focus on reliability and automation
͏
总浏览量
0
申请点击数
0
模拟申请者数
0
收藏
0
相似职位
关于Wipro

Wipro
PublicA technology services and consulting company focused on building solutions that address clients' digital transformation needs.
10,001+
员工数
Bengaluru
总部位置
$8.5B
企业估值
评价
3.1
10条评价
工作生活平衡
3.5
薪酬
2.3
企业文化
3.8
职业发展
2.5
管理层
2.2
45%
推荐给朋友
优点
Good training and learning opportunities
Flexible work hours and remote options
Supportive colleagues and teamwork
缺点
Low and uncompetitive compensation
Limited growth and career advancement opportunities
Poor management direction and support
薪资范围
41,395个数据点
Mid/L4
Mid/L4 · Analyst - Business Process L2
1份报告
$128,283
年薪总额
基本工资
$111,550
股票
-
奖金
-
$128,283
$128,283
面试经验
5次面试
难度
2.0
/ 5
时长
14-28周
录用率
40%
体验
正面 100%
中性 0%
负面 0%
面试流程
1
Application Review
2
Online Assessment/Aptitude Test
3
Technical Interview
4
HR Interview
5
Offer
常见问题
Coding/Algorithm
Technical Knowledge
Behavioral/STAR
Past Experience
Culture Fit
新闻动态
Wipro share buyback, target prices: What Jefferies, Morgan Stanley, others say after soft Q1 guidance - MSN
MSN
News
·
4d ago
Wipro attrition falls to 13.8%, headcount inches up by 136 - The Economic Times
The Economic Times
News
·
5d ago
Wipro shares slide up to 4% after weak Q4, muted outlook dents sentiment - The Times of India
The Times of India
News
·
5d ago
Indian shares rise on peace deal hopes; Wipro, HDFC Life cap gains - TradingView — Track All Markets
TradingView — Track All Markets
News
·
5d ago



