mid machine learning AI Infrastructure Engineer ic 3+ yrs Bachelor's · Posted Apr 20, 2026

About this role

Nvidia is hiring a mid-level AI Infrastructure Engineer in the machine learning function based in Shanghai, China. The posting calls out experience with Deep Learning, Python, Bash, CUDA and roughly 3+ years of relevant work. Listed education preference: a bachelor's degree or equivalent.

Role
AI Infrastructure Engineer
Function
machine learning
Level
mid
Track
Individual contributor
Employment
Full-time
Location
Shanghai, China
Experience
3+ years
Education
Bachelor's degree
Posted
Apr 20, 2026
AI Summary
Deploy, manage, and maintain large-scale HPC/AI clusters using Linux job scheduling tools like Slurm and Kubernetes. Troubleshoot infrastructure from bare metal to application level. Requires 3+ years HPC/AI experience, Linux expertise, networking knowledge, Python/bash scripting, and familiarity with GPU technologies and orchestration platforms.

More roles at Nvidia

Senior Manager, IT Software Engineering
Shanghai, China · senior
Python LLMs RAG
Senior System Level Test Engineer - LPU
Santa Clara, CA · senior
Python C C#
Lead System Software Engineer Platform - Server Embedded Firmware
Santa Clara, CA · senior
Testing Python C
Senior Software Engineer, GoLang - DSX MaxQ
Santa Clara, CA · senior
Python Rust C
Senior ASIC Physical Design Engineer, Netlisting
Santa Clara, CA · senior
Python Deep Learning
All Nvidia jobs →

Job description

from Nvidia careers

NVIDIA is looking for a HPC and AI Cluster Engineer to join the Networking clusters solutions HPC/AI Infrastructure team. We are building supercomputers and AI clusters based on groundbreaking technologies. We are looking for a cluster engineer to be a key player to the most exciting computing hardware and software to contribute to the latest breakthroughs in artificial intelligence and GPU computing

You will work with the latest Accelerated computing and Deep Learning software and hardware platforms, and with many scientific researchers, developers, and customers to craft improved workflows and develop new, leading differentiated solutions. You will interact with HPC, OS, GPU compute, and systems specialist to architect, develop and bring up large scale performance platforms. Does this sound like you? If so, we would love to hear from you!

What you will be doing:

  • Deploy, manage and maintain large scale HPC/AI clusters

  • Managing Linux job/workload schedules and orchestration tools

  • Support and maintain continuous integration and delivery pipelines

  • Troubleshooting and fixing, bottom up from bare metal, operating system, software stack and application level

  • Supporting Research & Development activities and engaging in POCs for future improvements

What we need to see:

  • Bachelor's Degree in Computer Science, Engineering, or a related field; or equivalent experience

    This is an excerpt. Read the full job description on Nvidia careers →
All machine learning jobs machine learning in Shanghai, China Jobs in Shanghai, China machine learning salaries machine learning career path
All Nvidia Jobs Browse machine learning roles mid positions