mid machine learning AI Infrastructure Engineer ic · Posted May 28, 2026

About this role

Nvidia is hiring a mid-level AI Infrastructure Engineer in the machine learning function based in Santa Clara, CA. The posting calls out experience with Python, CUDA, Deep Learning, API Development.

Role
AI Infrastructure Engineer
Function
machine learning
Level
mid
Track
Individual contributor
Employment
Full-time
Location
Santa Clara, CA
Posted
May 28, 2026

More roles at Nvidia

Firmware Engineer
Yokneam, Israel · mid
Python C Networking
Senior Power Integrity Methodology CAD Engineer
Santa Clara, CA · senior
Python
Senior System Software Engineer - Metropolis
Pune, India · senior
AWS GCP Azure
Senior Physical Design Methodology Engineer, Innovus Flows
Santa Clara, CA · senior
Python PyTorch scikit-learn
Senior System Software Engineer
Santa Clara, CA · senior
Python C Embedded Systems
All Nvidia jobs →

Job description

from Nvidia careers

We are now looking for a Senior AI Frameworks Engineer (C++/Python)! NVIDIA's high-performance computing platforms are powering the AI revolution across many applications and industries. Within our software stack, CUTLASS stands out as a popular open-source ecosystem dedicated to high-performance math primitives. Since 2017, it has provided the community with C++ template abstractions to implement custom GEMM and related computations efficiently on NVIDIA GPUs.

We are building the next frontier of this ecosystem: Pythonic CUTLASS (CUTLASS DSL). This initiative aims to bring "speed-of-light" performance and powerful abstractions of our stack directly into the Python environment. Join the CUTLASS team and help bridge the gap between low-level hardware primitives and high-level developer productivity. If you are passionate about building elegant, high-performance DSLs and want to empower the next generation of AI researchers and engineers with better tools, apply today!

What you'll be doing:

As a core contributor to the CUTLASS project, you will use your expertise in systems programming and API design to create a world-class developer experience for GPU programming and kernel delivery.

  • Design APIs that prioritize user productivity, providing a "native" feel for developers accustomed to modern scientific computing and deep learning frameworks.

  • Develop robust compilation infrastructure—including AST transformations and JIT-friendly execution—to lower Pythonic descriptions into high-performance GPU machine code.

    This is an excerpt. Read the full job description on Nvidia careers →
All machine learning jobs machine learning in Santa Clara, CA Jobs in Santa Clara, CA machine learning salaries machine learning career path
All Nvidia Jobs Browse machine learning roles mid positions