About this role

Amazon is hiring a manager-level Engineering Manager in the software engineering function based in Cupertino, CA. The posting calls out experience with AWS, PyTorch, LLMs, Machine Learning. Compensation is listed at $212,700–$287,700 per year.

Role: Engineering Manager
Function: software engineering
Level: manager
Track: hybrid
Employment: Full-time
Location: Cupertino, CA
Department: Software Development
Posted: Sep 5, 2025

AI Summary

Lead a team of AI/ML engineers to onboard and optimize large language models for inference on AWS Neuron and Trainium accelerators. Drive model enablement speed, inference optimization, and performance improvements across distributed inference systems. Requires strong LLM architecture knowledge, model optimization expertise, and ability to manage fast-changing priorities in a vertically integrated stack.

Upgrade to Pro for AI summaries, resume match scores & career intelligence →

More roles at Amazon

Senior Data Associate with German, Artificial General Intelligence

London, United Kingdom · junior

LLMs Machine Learning

Senior Business Intelligence Engineer, EU Stores CX Analytics & Automation

Clichy, France · senior

Python SQL React

Delivery Trainer, RSR

Traverse City, MI · mid

Agile Compliance

Delivery Trainer, RSR

Abbeville, LA · mid

Agile Compliance

Delivery Trainer, RSR

North Mankato, MN · mid

Agile Compliance All Amazon jobs →

Job description

from Amazon careers

DESCRIPTION AWS Utility Computing (UC) provides product innovations, from foundational services such as Amazon Elastic Compute Cloud (EC2), to new product innovations that continue to set AWS’s services and features apart in the industry. We develop AWS Neuron, the complete software stack for Trainium, Amazon's custom cloud-scale machine learning accelerators. Come optimize LLMs such as Llama and GPT-OSS to run really fast on Trainium. As the SDM for the LLM Inference Model Enablement team, you will lead a team of expert AI/ML engineers to onboard and optimize state-of-the-art open-source and customer LLMs, both dense and MoE, for inference on Neuron and Trainium and Inferentia accelerators. You will also drive improvements in model enablement speed and experience, while advancing inference usability and quality through inference features, infrastructure optimization, tools, and automation. The ideal candidate will have a strong background in LLM model architectures, model performance optimizations, and inference techniques, such as delivering high-performance models using distributed inference libraries. You should be capable of managing demanding, fast-changing priorities. You should have a strong technical ability to understand and deliver as part of a vertically integrated system stack consisting of the PyTorch inference library, Neuron compiler, runtime, and collectives. A day in the…

This is an excerpt. Read the full job description on Amazon careers →

All software engineering jobs software engineering in Cupertino, CA Jobs in Cupertino, CA software engineering salaries software engineering career path

All Amazon Jobs Browse software engineering roles manager positions

Software Development Manager, LLM Inference Model Enablement, Neuron SDK

About this role

More roles at Amazon

Job description