principal software engineering Principal Engineer tech_leadership · Posted May 19, 2026

Skills

Python CUDA PyTorch LLMs Deep Learning Reinforcement Learning Data Structures Machine Learning Data Analytics vLLM

About this role

Nvidia is hiring a principal-level Principal Engineer in the software engineering function based in Santa Clara, CA. The posting calls out experience with Python, CUDA, PyTorch, LLMs.

Role: Principal Engineer
Function: software engineering
Level: principal
Track: Tech leadership
Employment: Full-time
Location: Santa Clara, CA
Posted: May 19, 2026

More roles at Nvidia

Senior System Firmware Engineer - BIOS UEFI

Santa Clara, CA · senior

Python C Bash

Software Solutions Engineer

Pune, India · mid

Python Bash CUDA

SoC Power Architect

Yokneam, Israel · mid

Data Structures Frontend Development Backend Development

EDA Methodology Architect

Santa Clara, CA · mid

Python LLMs Machine Learning

Senior Systems Prototyping and Emulation Engineer

Santa Clara, CA · senior

Embedded Systems Python C All Nvidia jobs →

Job description

from Nvidia careers

We are now looking for a Senior Performance Architect for Nemotron! At NVIDIA, we are redefining the future of AI systems through deep model–system–hardware co-design. We are looking for a forward-thinking Nemotron Performance Architect to shape the next generation of Nemotron models through performance modeling, analysis, and forward projections. In this role, you will predict before we build - developing high-fidelity models to evaluate how architectural choices translate into real-world deployment efficiency. You will ensure that future models achieve Pareto-optimal trade-offs across accuracy, throughput, and interactivity on target platforms.

Recent efforts such as LatentMoE architectures and the Nemotron Super model exemplify the kind of performance-driven co-design you will help advance—where modeling insights directly shape model architecture and system efficiency at scale. This role sits at the center of Generative AI evolution, partnering across research, framework development, compiler, and hardware teams to guide decisions that determine how efficiently intelligence scales in production.

What You’ll Be Doing:

Develop high-fidelity analytical performance models to prototype emerging algorithmic techniques & hardware optimizations to drive model-hardware co-design Nemotron family of models.
Prioritize features to guide future software and hardware roadmap based on detailed performance modeling and analysis
Model end-to-end performance impact of emerging GenAI workflows - such as Speculative Decoding, Agentic Pipelines, Inference-time compute scaling, RL etc. – to understand future datacenter needs
This is an excerpt. Read the full job description on Nvidia careers →

All software engineering jobs software engineering in Santa Clara, CA Jobs in Santa Clara, CA software engineering salaries software engineering career path

All Nvidia Jobs Browse software engineering roles principal positions

Senior Performance Architect, Nemotron

About this role

More roles at Nvidia

Job description