Deep Learning Performance Architect
Nvidia · Shanghai, China
About this role
Nvidia is hiring a mid-level Machine Learning Engineer based in Shanghai, China. The posting calls out experience with TensorFlow, PyTorch, LLMs, Deep Learning.
- Role
- Machine Learning Engineer
- Function
- machine learning
- Level
- mid
- Track
- Individual contributor
- Employment
- Full-time
- Location
- Shanghai, China
- Posted
- May 13, 2026
More roles at Nvidia
Job description
from Nvidia careersNVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.
NVIDIA is developing processor and system architectures that accelerate AI workloads based on neural networks. We are looking for an experienced deep learning performance architect to join our inference architecture team. In this position, you will have a chance to work on DL performance modelling, analysis, and optimization on brand-new hardware architectures for various DL workloads. You will make your contributions to our dynamic technology-focused company.
What you will be doing:
Analyze brand-new DL networks (LLM etc.), identify and prototype performance opportunities to influence SW and Architecture team for NVIDIA's current and next-gen inference products.
This is an excerpt. Read the full job description on Nvidia careers →