Machine Learning Systems Engineer, Networking
Nvidia · Santa Clara, CA
About this role
Nvidia is hiring a mid-level Machine Learning Engineer based in Santa Clara, CA. The posting calls out experience with Python, Rust, C, Scala.
- Role
- Machine Learning Engineer
- Function
- machine learning
- Level
- mid
- Track
- Individual contributor
- Employment
- Full-time
- Location
- Santa Clara, CA
- Posted
- May 18, 2026
More roles at Nvidia
Job description
from Nvidia careersJoin our team of innovative engineers who are building an AI Data Center AIOps platform that turns raw, high-volume telemetry into reliable, job-centric insights and automation for GPU fleets. As an ML Engineer on this team, you'll design and implement ML algorithms that run in real-time streaming pipelines, detecting anomalies and surfacing insights across massive-scale infrastructure before they impact AI training and inference.
The core challenge of this role is building ML algorithms that are simultaneously accurate and efficient —processing millions of telemetry streams in real time within tight CPU and memory budgets. You'll need both the data science depth to design and validate algorithms and the engineering discipline to implement them in production at scale.
What you'll be doing:
Implement production ML algorithms in Go — optimized for real-time streaming pipelines operating at massive scale under strict resource constraints
Design and develop new ML algorithms where needed: anomaly detection, health scoring, and predictive analytics on high-volume time-series telemetry from GPU and network infrastructure
Improve and extend existing algorithms and experiment with new approaches suited to real-time streaming constraints
Build and maintain end-to-end ML pipelines — from data ingestion and schema design through model inference — optimized for on-premises, latency-sensitive deployments
This is an excerpt. Read the full job description on Nvidia careers →