Job Description
Overview
We are seeking a highly skilled DevOps Cloud Engineer with strong experience in cloud infrastructure, CI/CD automation, containerization, and system monitoring.
Responsibilities- Define and implement SLIs, SLOs, and error budgets for business‑critical digital banking services.
- Build actionable observability metrics, logs, traces, dashboards, and alerts using Dynatrace, Prometheus, Grafana, and ELK while reducing alert fatigue.
- Leverage AI‑driven insights and anomaly detection from Dynatrace Davis AI or equivalent AIOps platform to proactively predict and resolve reliability issues before impact.
- Lead incident management from on‑call triage and root‑cause analysis to blameless postmortems with actionable follow‑ups.
- Improve deployment safety with robust rollout, rollback strategies, canary and blue‑green deployments, and production readiness reviews.
- Support and optimize microservices‑based architectures, ...