Job Description
From designing fault-tolerant architectures to leading incident responses you’ll have the freedom to shape how we deliver stable secure and high-performance banking services. We’re looking for a talented Site Reliability Engineer (SRE) to keep our systems running smoothly, reliably and at scale. Through smart automation, deep observability and a calm head in a crisis you’ll help us balance speed, compliance and stability while working alongside DevOps, Cloud, Quality Engineering and Product teams to drive continuous improvements in performance, security and resilience. You’ll play a key role in enhancing reliability, accelerating delivery and ensuring seamless digital experiences for our customers.
What You Will Be Doing- Define and implement SLIs, SLOs and error budgets for business‑critical digital banking services.
- Build actionable observability metrics, logs, traces, dashboards and alerts using Dynatrace, Prometheus, Grafana and ELK while reducing aler...