principal machine learning ML Platform Engineer ic · Posted Apr 16, 2026
$208,300 – $281,800
USD per year

About this role

Amazon is hiring a principal-level ML Platform Engineer in the machine learning function based in Cupertino, CA. The posting calls out experience with AWS, PyTorch, LLMs, Deep Learning. Compensation is listed at $208,300–$281,800 per year.

Role
ML Platform Engineer
Function
machine learning
Level
principal
Track
Individual contributor
Employment
Full-time
Location
Cupertino, CA
Department
Project/Program/Product Management--Technical
Posted
Apr 16, 2026

More roles at Amazon

Infrastructure Reliability Engineer, Infrastructure Reliability & Quality
Herndon, VA · mid
AWS Networking Machine Learning
Software Development Engineer, Amazon Ads Nova
New York City, NY · mid
Java C# AI Agents
Certified Pharmacy Technician, Fulfillment - Amazon Pharmacy
Miami, FL · mid
Delivery Consultant – SAP , Professional Services, SAP
New York City, NY · mid
Python Ruby AWS
MLA IP Design Verification Engineer, Annapurna Labs
Cupertino, CA · mid
Python AWS Machine Learning
All Amazon jobs →

Job description

from Amazon careers

AWS Trainium is deployed at scale, with millions of chips in production, used for training and inference of frontier models. AWS Neuron is the software stack for Trainium, enabling customers to run deep learning and generative AI workloads with optimal performance and cost efficiency. AWS Neuron is hiring a Principal Technical Product Manager to define and drive product strategy for training software on Trainium. This includes distributed training libraries, post-training workflows (RLHF, DPO, fine-tuning), reinforcement learning frameworks, and training performance optimization. Your mission is to enable researchers and operators to train frontier models at scale on Trainium, from single-node experimentation to distributed training across thousands of nodes. You will be the champion inside AWS for frontier model builders pushing the bounds of scale and resilience for current and emerging training paradigms. You will work with customers inside and outside the company to identify key improvements and stay ahead of the training landscape. You will define how Neuron supports the training AI/ML ecosystem and what tools customers will use for their training workflows on Trainium. To be successful, you will partner with engineering teams building training libraries and distributed training infrastructure, applied scientists developing optimization techniques, and PMs responsible for compiler,…

This is an excerpt. Read the full job description on Amazon careers →
All machine learning jobs machine learning in Cupertino, CA Jobs in Cupertino, CA machine learning salaries machine learning career path
All Amazon Jobs Browse machine learning roles principal positions