principal software engineering Hardware Engineer tech_leadership · Posted May 19, 2026

About this role

Nvidia is hiring a principal-level Hardware Engineer in the software engineering function based in Santa Clara, CA.

Role
Hardware Engineer
Function
software engineering
Level
principal
Track
Tech leadership
Employment
Full-time
Location
Santa Clara, CA
Posted
May 19, 2026

More roles at Nvidia

Senior System Software Engineer - AI Performance and Efficiency Tools
Shanghai, China · senior
Python CUDA Kubernetes
Software Developer - Networking
Yokneam, Israel · mid
C++ C Networking
Senior System Software Engineer - Linux Kernel Storage
Hyderabad, India · senior
Performance Optimization
Senior Software Engineer, NCCL and CUDA - CSP Engagements
Santa Clara, CA · senior
Kubernetes Docker Ansible
Director, AI Enablement
Santa Clara, CA · director
LLMs Machine Learning
All Nvidia jobs →

Job description

from Nvidia careers

NVIDIA is seeking a Principal Failure Analysis Engineer to lead Silicon Failure Analysis (SiFA) Lab Infrastructure, responsible for enabling a high-availability, safe, and scalable failure analysis environment. This role leads the lab framework including facilities, utilities, tool enablement, safety, access control, and operational readiness so that Fault Isolation (FI), Physical Failure Analysis (PFA), and Supplier Quality Engineering (SQE) teams can efficiently root cause our groundbreaking semiconductor products. The role partners closely with FI, PFA, SQE, Corporate Facilities, EHS, IT, Finance, Procurement, and equipment vendors to ensure reliable, secure, and scalable lab operations aligned with NVIDIA’s technology roadmap.

What You'll Be Doing:

  • Lead the overall Silicon Failure Analysis (SiFA) Lab infrastructure, ensuring a safe, highly available, and scalable environment that enables FI, PFA, and SQE teams to efficiently root‑cause advanced semiconductor issues

  • Own day‑to‑day lab operations and infrastructure readiness, serving as the primary point of accountability for availability, reliability, and rapid resolution of infrastructure issues impacting failure analysis operations

  • Manage lab facilities and utilities including power, backup power, cooling water, DI/PCW, exhaust, vacuum, CDA, nitrogen, and specialty gases, coordinating upgrades, maintenance, outages, and construction to minimize disruption

  • Drive failure analysis tool enablement and reliability from delivery through sustained operation, ensuring preventive maintenance and improving uptime, availability, MTBF, MTTR, and PM compliance

    This is an excerpt. Read the full job description on Nvidia careers →
All software engineering jobs software engineering in Santa Clara, CA Jobs in Santa Clara, CA software engineering salaries software engineering career path
All Nvidia Jobs Browse software engineering roles principal positions