Job Description
Chief HPC & AI Network Architect
EPAM Systems, Argentina
EPAM Systems is seeking a Chief HPC Network Engineer to define the global technical strategy, reference architecture, and engineering vision behind advanced AI, research, and Kubernetes-based GPU infrastructure for a major global technology client.
Required expertise: InfiniBand/RDMA, Kubernetes, and extensive experience in high-performance networking; strong skills in AI workloads and network observability.
Responsibilities
- Establish architectural roadmaps for GPU infrastructure and high-performance networking.
- Enforce engineering standards across teams.
- Provide technical leadership across teams.
- Architect, operate, and optimize high-performance network fabrics for large-scale LLM and distributed AI workloads.
- Own and drive the architectural vision for advanced AI and Kubernetes-based GPU infrastructure.
- Support advanced AI and GPU inf...