Job Description
Responsibilities
- Own day‑to‑day operational health of the platform across Nutanix on‑prem stacks and AWS recovery environments.
- Monitor availability, performance, and capacity across critical workloads and respond to platform‑level incidents.
- Operate Rubrik Enterprise for backup and recovery: policies, schedules, retention, replication to AWS, and storage capacity.
- Ensure all priority (P1/P2) workloads are correctly protected and regularly validated through restore tests.
- Plan and execute disaster recovery and cyber‑recovery exercises, maintaining and updating recovery runbooks and documentation.
- Execute and support workload migrations from VMware and physical infrastructure into Nutanix, including pre‑ and post‑migration validation.
- Use Windows Server and Linux expertise to troubleshoot platform and workload availability/performance issues (not routine BAU patching).
- Automate routine reliability tasks (backu...