senior software engineering Site Reliability Engineer ic 5+ yrs Bachelor's · Posted Apr 20, 2026

About this role

Nvidia is hiring a senior-level Site Reliability Engineer in the software engineering function based in Santa Clara, CA. The posting calls out experience with Python, Ruby, AWS, GCP and roughly 5+ years of relevant work. Listed education preference: a bachelor's degree or equivalent.

Role
Site Reliability Engineer
Function
software engineering
Level
senior
Track
Individual contributor
Employment
Full-time
Location
Santa Clara, CA
Experience
5+ years
Education
Bachelor's degree
Posted
Apr 20, 2026
AI Summary
Senior SRE owns end-to-end infrastructure solutions for NVIDIA's global compute platform across multi-cloud environments. Requires 5+ years building critical services, HPC cluster experience (Slurm/LSF/Kubernetes), IaC proficiency, and strong infrastructure automation expertise to ensure high uptime and operational excellence.

More roles at Nvidia

Senior System Software Engineer - Linux Kernel Storage
Hyderabad, India · senior
Performance Optimization
Senior Software Engineer, NCCL and CUDA - CSP Engagements
Santa Clara, CA · senior
Kubernetes Docker Ansible
Director, AI Enablement
Santa Clara, CA · director
LLMs Machine Learning
Senior CI/CD Engineer
Santa Clara, CA · senior
AWS GCP Azure
Senior Board Test Engineer
Santa Clara, CA · senior
Python Bash Testing
All Nvidia jobs →

Job description

from Nvidia careers

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. NVIDIA is looking for phenomenal people like you to help us accelerate the next wave of artificial intelligence.

We’re looking for a Senior SRE to join our Compute Farm team and help build the next generation of our global services platform. At NVIDIA, you’ll keep critically important systems running while working on the technologies that are redefining computing. You’ll harness the power of AI to deliver groundbreaking solutions to some of the world’s toughest problems—and see your work have real, lasting impact!

What you'll be doing:

  • Own SRE solutions end‑to‑end, from design and implementation to operation and continuous improvement, ensuring they integrate cleanly with HPC schedulers, storage, and network fabrics.

    This is an excerpt. Read the full job description on Nvidia careers →
All software engineering jobs software engineering in Santa Clara, CA Jobs in Santa Clara, CA software engineering salaries software engineering career path
All Nvidia Jobs Browse software engineering roles senior positions