System Software Architect, AI and GPU Networking
Nvidia · Beijing, China
About this role
Nvidia is hiring a mid-level Solutions Architect in the software engineering function based in Beijing, China. The posting calls out experience with Python, CUDA, Deep Learning, Networking and roughly 5+ years of relevant work. Listed education preference: a master's degree or equivalent.
- Role
- Solutions Architect
- Function
- software engineering
- Level
- mid
- Track
- Individual contributor
- Employment
- Full-time
- Location
- Beijing, China
- Experience
- 5+ years
- Education
- Master's degree
- Posted
- Apr 20, 2026
More roles at Nvidia
Job description
from Nvidia careersNVIDIA has been defining computer graphics, PC gaming, and accelerated computing for more than 25 years. With an outstanding legacy of innovation, driven by phenomenal technology, and extraordinary people, NVIDIA is looking for a strong technical senior architect to join us in shaping the future. Senior Architects are innovators who can translate business needs into workable technology solutions. Their expertise is deep and broad. They are hands on, producing both detailed technical work and high-level architectural designs.
As an architect in the AI Networking Research team, you will explore technological challenges on accelerate networking and building AI data centers. Develop and research new transport functions and semantics for optimizing AI workloads, AI systems communication and accelerations and much more. You will also be part of architectural and development efforts across numerous technological fields, related to the modern AI data center, such as distributed AI and deep learning solutions, data analytics, High Performance Computing (HPC), Software Defined Networking (SDN), virtualization, storage, and more.
What you’ll be doing:
Enhance NVIDIA's GPU Networking offerings for accelerating AI workloads, such as NVIDIA Dynamo, NVIDIA NIXL and NVIDIA UCX, tailored to the unique requirements of AI workloads.
Design and prototype features and optimizations that accelerate data movement and enable new capabilities for inference and model serving - focusing on throughput, latency, and memory efficiency..
This is an excerpt. Read the full job description on Nvidia careers →