mid Machine Learning Engineer ic Bachelor's · Posted Oct 31, 2025
$165,200 – $223,600
USD per year

About this role

Amazon is hiring a mid-level Machine Learning Engineer based in Cupertino, CA. The posting calls out experience with Java, AWS, TensorFlow, PyTorch. Listed education preference: a bachelor's degree or equivalent. Compensation is listed at $165,200–$223,600 per year.

Role
Machine Learning Engineer
Function
machine learning
Level
mid
Track
Individual contributor
Employment
Full-time
Location
Cupertino, CA
Education
Bachelor's degree
Department
Software Development
Posted
Oct 31, 2025
AI Summary
Build next-generation ML compiler for AWS Neuron that transforms models from PyTorch, TensorFlow, and JAX for deployment on custom chips. Solve compiler optimization problems for large language models and vision transformers. Requires C++/Java expertise; compiler or ML accelerator experience preferred.

More roles at Amazon

Delivery Consultant – SAP , Professional Services, SAP
New York City, NY · mid
Python Ruby AWS
MLA IP Design Verification Engineer, Annapurna Labs
Cupertino, CA · mid
Python AWS Machine Learning
Sr. Social Media & Influencer Manager, Amazon Books Brand & Social
Seattle, WA · senior
Senior Supply Chain Manager, Fauna
New York City, NY · senior
Full Stack
Ingénieur en machine learning/Machine Learning Engineer, ProServe Shared Delivery Team - Data & AI
Montreal, Canada · mid
Python JavaScript TypeScript
All Amazon jobs →

Job description

from Amazon careers

Do you want to be part of AI revolution? At AWS our vision is to make deep learning pervasive for everyday developers and to democratize access to AI hardware and software infrastructure. In order to deliver on that vision, we’ve created innovative software and hardware solutions that make it possible. AWS Neuron is the SDK that optimizes the performance of complex ML models executed on AWS Inferentia and Trainium, our custom chips designed to accelerate deep-learning workloads. This role is for a software engineer in the Compiler team for AWS Neuron. As part of this role, you will be responsible for building next generation Neuron compiler which transforms ML models written in ML frameworks (e.g, PyTorch, TensorFlow, and JAX) to be deployed AWS Inferentia and Trainium based servers in the Amazon cloud. You will be responsible for solving hard compiler optimization problems to achieve optimum performance for variety of ML model families including massive scale large language models like Llama, Deepseek, and beyond as well as stable diffusion, vision transformers and multi-model models. You will be required to understand how these models work inside-out to make informed decisions on how to best coax the compiler to generate optimal implementation instruction.…

This is an excerpt. Read the full job description on Amazon careers →
All machine learning jobs machine learning in Cupertino, CA Jobs in Cupertino, CA machine learning salaries machine learning career path
All Amazon Jobs Browse machine learning roles mid positions