💼 Full-Time Position

Senior Performance Architect, Nemotron

🏢
NVIDIA
📍 Santa Clara, CA, United States
📍
Location
Santa Clara, United States
📅
Posted
June 05, 2026
Type
Full-Time
🎯

Full-Time Opportunity: This is a permanent, full-time position with a competitive package and real career growth potential.

Job Description

We are now looking for a Senior Performance Architect for Nemotron! At NVIDIA, we are redefining the future of AI systems through deep model–system–hardware co-design. We are looking for a forward-thinking Nemotron Performance Architect to shape the next generation of Nemotron models through performance modeling, analysis, and forward projections. In this role, you will predict before we build - developing high-fidelity models to evaluate how architectural choices translate into real-world deployment efficiency. You will ensure that future models achieve Pareto-optimal trade-offs across accuracy, throughput, and interactivity on target platforms.


Recent efforts such as LatentMoE (https://research.nvidia.com/labs/nemotron/LatentMoE/) architectures and the Nemotron Super (https://developer.nvidia.com/blog/introducing-nemotron-3-super-an-open-hybrid-mamba-transformer-moe-for-agentic-reasoning/) model exemplify the kind of performance-driven co-design you will help advance...