Job Description
Impact:
- Own the mainframe automation strategy and roadmap to improve reliability, detection, and recovery.
- Scale automated solutions across infrastructure domains (batch, storage, networking, middleware) using APIs, orchestration, and infrastructure-as-code.
- Architect and govern multi-site failover automation; maintain and test DR playbooks and runbooks.
- Define and operationalize SLOs/SLIs, error budgets, and alerting standards; reduce MTTA/MTTR through event correlation and automated remediation.
- Instill disciplined engineering: peer reviews, version control, change management gates, and automation standards aligned to risk/compliance.
- Build and lead a high-performing team; develop talent in REXX, z/OS automation, DevOps, and integration.
- Partner across platforms, applications, cyber, risk, and compliance to prioritize automation investments that reduce toil and operati...