Principal PMT-ES - AI/ML Training, Annapurna Labs
Amazon · Cupertino, CA · Project/Program/Product Management--Technical
About this role
Amazon is hiring a principal-level ML Platform Engineer in the machine learning function based in Cupertino, CA. The posting calls out experience with AWS, PyTorch, LLMs, Deep Learning. Compensation is listed at $208,300–$281,800 per year.
- Role
- ML Platform Engineer
- Function
- machine learning
- Level
- principal
- Track
- Individual contributor
- Employment
- Full-time
- Location
- Cupertino, CA
- Department
- Project/Program/Product Management--Technical
- Posted
- Apr 16, 2026
More roles at Amazon
Job description
from Amazon careersAWS Trainium is deployed at scale, with millions of chips in production, used for training and inference of frontier models. AWS Neuron is the software stack for Trainium, enabling customers to run deep learning and generative AI workloads with optimal performance and cost efficiency. AWS Neuron is hiring a Principal Technical Product Manager to define and drive product strategy for training software on Trainium. This includes distributed training libraries, post-training workflows (RLHF, DPO, fine-tuning), reinforcement learning frameworks, and training performance optimization. Your mission is to enable researchers and operators to train frontier models at scale on Trainium, from single-node experimentation to distributed training across thousands of nodes. You will be the champion inside AWS for frontier model builders pushing the bounds of scale and resilience for current and emerging training paradigms. You will work with customers inside and outside the company to identify key improvements and stay ahead of the training landscape. You will define how Neuron supports the training AI/ML ecosystem and what tools customers will use for their training workflows on Trainium. To be successful, you will partner with engineering teams building training libraries and distributed training infrastructure, applied scientists developing optimization techniques, and PMs responsible for compiler,…