Senior Developer Technology Engineer
Nvidia · Beijing, China
About this role
Nvidia is hiring a senior-level Platform Engineer in the software engineering function based in Beijing, China. The posting calls out experience with Python, CUDA, TensorFlow, PyTorch and roughly 3+ years of relevant work. Listed education preference: a bachelor's degree or equivalent.
- Role
- Platform Engineer
- Function
- software engineering
- Level
- senior
- Track
- Individual contributor
- Employment
- Full-time
- Location
- Beijing, China
- Experience
- 3+ years
- Education
- Bachelor's degree
- Posted
- Apr 20, 2026
More roles at Nvidia
Job description
from Nvidia careersWe are building the next generation of GPU‑accelerated recommendation tools, redefining how models are trained and deployed at scale. Our mission is to make developing and productizing GPU‑based recommender systems as seamless, efficient, and powerful as possible. As part of this effort, you will join a world‑class team of ML, HPC, and Software Engineers focused on maximizing training and inference speed while enabling effortless scalability.
What You’ll Be Doing:
Profile, analyze, and optimize GPU‑accelerated code to improve training and inference performance for large‑scale recommender systems.
Design, implement, and maintain high‑performance C++/CUDA components within our core recommendation framework.
Develop and execute tests (unit, integration, and performance) to ensure numerical correctness, stability, and regression prevention in GPU workloads.
Collaborate closely with CUDA and ML engineers to interpret profiling results, refine designs, and implement optimization strategies.
Design and optimize high‑throughput data flows between GPUs, RDMA‑capable NICs, and NVMe SSDs using technologies such as GPUDirect RDMA and GPUDirect Storage.
What We Need to See:
Bachelor’s or Master’s degree in Computer Science, Software Engineering, Mathematics, or a related technical field.
3+ years of experience in C++, CUDA, and Python development on Linux systems.
Solid understanding of numerical computing, floating‑point behavior, and GPU performance profiling.
Proven ability to diagnose and optimize computational pipelines using profiling tools such as Nsight Systems or nvprof.
This is an excerpt. Read the full job description on Nvidia careers →