Machine Learning - Compiler Engineer , AWS Neuron, Annapurna Labs
Amazon · Cupertino, CA · Software Development
About this role
Amazon is hiring a mid-level Machine Learning Engineer based in Cupertino, CA. The posting calls out experience with Java, AWS, TensorFlow, PyTorch. Listed education preference: a bachelor's degree or equivalent. Compensation is listed at $165,200–$223,600 per year.
- Role
- Machine Learning Engineer
- Function
- machine learning
- Level
- mid
- Track
- Individual contributor
- Employment
- Full-time
- Location
- Cupertino, CA
- Education
- Bachelor's degree
- Department
- Software Development
- Posted
- Oct 31, 2025
More roles at Amazon
Job description
from Amazon careersDo you want to be part of AI revolution? At AWS our vision is to make deep learning pervasive for everyday developers and to democratize access to AI hardware and software infrastructure. In order to deliver on that vision, we’ve created innovative software and hardware solutions that make it possible. AWS Neuron is the SDK that optimizes the performance of complex ML models executed on AWS Inferentia and Trainium, our custom chips designed to accelerate deep-learning workloads. This role is for a software engineer in the Compiler team for AWS Neuron. As part of this role, you will be responsible for building next generation Neuron compiler which transforms ML models written in ML frameworks (e.g, PyTorch, TensorFlow, and JAX) to be deployed AWS Inferentia and Trainium based servers in the Amazon cloud. You will be responsible for solving hard compiler optimization problems to achieve optimum performance for variety of ML model families including massive scale large language models like Llama, Deepseek, and beyond as well as stable diffusion, vision transformers and multi-model models. You will be required to understand how these models work inside-out to make informed decisions on how to best coax the compiler to generate optimal implementation instruction.…