Job Description
SRE Engineer - Hyderabad, India
Department: Aisa Technology, Japan
Role Overview
MetLife is seeking an experienced Site Reliability Engineer (SRE) to ensure the availability, scalability, and performance of critical systems and services. The role involves monitoring, automation, incident management, and collaboration with engineering teams to optimize system reliability and efficiency.
Key Responsibilities
- System Reliability & Performance: Ensure system uptime, troubleshoot issues, and optimize performance.
- Service Design & Automation: Develop automation scripts and tools to streamline operations.
- Monitoring & Alerting: Implement observability solutions using ELK, Grafana, Splunk, and Azure Monitor.
- Incident Response & Management: Lead root cause analysis, post-mortems, and corrective actions.
- Collaboration: Work with engineering teams to align system performance with business goals. ...