💼 Full-Time Position

Distinguished Engineer, GPU Fleet Operations Automation

🏢
NVIDIA
📍 Santa Clara, CA, United States
📍
Location
Santa Clara, United States
📅
Posted
June 01, 2026
Type
Full-Time
🎯

Full-Time Opportunity: This is a permanent, full-time position with a competitive package and real career growth potential.

Job Description

NVIDIA is leading the industry in delivering accelerated computing in cloud and enterprise environments. We’re a team of innovative engineers dedicated to solving some of the world’s biggest challenges, constantly driving advancements, and impacting millions of lives worldwide!


As a technology leader at NVIDIA, you will lead the development of DGX Cloud strategy for GPU fleet lifecycle, health, observability and utilization monitoring, and remediation. You will define and drive the technical strategy across multiple environments (bare metal, cloud service provider, and neoclouds). Including defining and developing the auto-remediation strategies to detect, fix, validate, and restore-to-service critical systems. You will work with NVIDIA leadership cross-organizationally and cross-functionally to deliver accelerated computing infrastructure that enables customers with the highest availability and operational standards.


What You’ll Be Doing:
+ Various ...