About this role

Nvidia is hiring a mid-level AI Infrastructure Engineer in the machine learning function based in Santa Clara, CA. The posting calls out experience with Python, CUDA, Deep Learning, API Development.

Role: AI Infrastructure Engineer
Function: machine learning
Level: mid
Track: Individual contributor
Employment: Full-time
Location: Santa Clara, CA
Posted: May 28, 2026

More roles at Nvidia

Firmware Engineer

Yokneam, Israel · mid

Python C Networking

Senior Power Integrity Methodology CAD Engineer

Santa Clara, CA · senior

Python

Senior System Software Engineer - Metropolis

Pune, India · senior

AWS GCP Azure

Senior Physical Design Methodology Engineer, Innovus Flows

Santa Clara, CA · senior

Python PyTorch scikit-learn

Senior System Software Engineer

Santa Clara, CA · senior

Python C Embedded Systems All Nvidia jobs →

Job description

from Nvidia careers

We are now looking for a Senior AI Frameworks Engineer (C++/Python)! NVIDIA's high-performance computing platforms are powering the AI revolution across many applications and industries. Within our software stack, CUTLASS stands out as a popular open-source ecosystem dedicated to high-performance math primitives. Since 2017, it has provided the community with C++ template abstractions to implement custom GEMM and related computations efficiently on NVIDIA GPUs.

We are building the next frontier of this ecosystem: Pythonic CUTLASS (CUTLASS DSL). This initiative aims to bring "speed-of-light" performance and powerful abstractions of our stack directly into the Python environment. Join the CUTLASS team and help bridge the gap between low-level hardware primitives and high-level developer productivity. If you are passionate about building elegant, high-performance DSLs and want to empower the next generation of AI researchers and engineers with better tools, apply today!

What you'll be doing:

As a core contributor to the CUTLASS project, you will use your expertise in systems programming and API design to create a world-class developer experience for GPU programming and kernel delivery.

Design APIs that prioritize user productivity, providing a "native" feel for developers accustomed to modern scientific computing and deep learning frameworks.
Develop robust compilation infrastructure—including AST transformations and JIT-friendly execution—to lower Pythonic descriptions into high-performance GPU machine code.
This is an excerpt. Read the full job description on Nvidia careers →

All machine learning jobs machine learning in Santa Clara, CA Jobs in Santa Clara, CA machine learning salaries machine learning career path

All Nvidia Jobs Browse machine learning roles mid positions