About this role

Amazon is hiring a mid-level Software Engineer based in Cupertino, CA. The posting calls out experience with Python, CUDA, AWS, PyTorch. Compensation is listed at $165,200–$223,600 per year.

Role: Software Engineer
Function: software engineering
Level: mid
Track: Individual contributor
Employment: Full-time
Location: Cupertino, CA
Department: Software Development
Posted: Nov 19, 2025

AI Summary

Mid-level ML engineer optimizing deep learning inference on AWS custom accelerators (Inferentia/Trainium). Develops high-performance kernels, tunes LLM models like Llama and DeepSeek, and collaborates across compiler, runtime, and hardware layers. Requires strong ML expertise, distributed systems knowledge, and experience with PyTorch or JAX frameworks.

Upgrade to Pro for AI summaries, resume match scores & career intelligence →

More roles at Amazon

Delivery Trainer, RSR

Traverse City, MI · mid

Agile Compliance

Delivery Trainer, RSR

Abbeville, LA · mid

Agile Compliance

Delivery Trainer, RSR

North Mankato, MN · mid

Agile Compliance

Operations Supervisor

Knuellwald, Germany · mid

Data Center Technician (Night Shift)

Mesa, AZ · mid

React AWS Networking All Amazon jobs →

Job description

from Amazon careers

The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon’s custom machine learning accelerators, Inferentia and Trainium. The AWS Neuron SDK, developed by the Annapurna Labs team at AWS, is the backbone for accelerating deep learning and GenAI workloads on Amazon's Inferentia and Trainium ML accelerators. This comprehensive toolkit includes an ML compiler, runtime, and application framework that seamlessly integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and Acceleration team is at the forefront of running a wide range of models and supporting novel architecture alongside maximizing their performance for AWS's custom ML accelerators. Working across the stack from PyTorch till the hardware-software boundary, our engineers build systematic infrastructure, innovate new methods and create high-performance kernels for ML functions, ensuring every compute unit is fine tuned for optimal performance for our customers' demanding workloads. We combine deep hardware knowledge with ML expertise to push the boundaries of what's possible in AI acceleration. As part of the broader Neuron organization, our team works across multiple technology layers - from frameworks and kernels and collaborate…

This is an excerpt. Read the full job description on Amazon careers →

All software engineering jobs software engineering in Cupertino, CA Jobs in Cupertino, CA software engineering salaries software engineering career path

All Amazon Jobs Browse software engineering roles mid positions

Software Development Engineer, AI/ML, AWS Neuron, Model Inference

About this role

More roles at Amazon

Job description