mid Software Engineer ic · Posted Nov 19, 2025
$165,200 – $223,600
USD per year

About this role

Amazon is hiring a mid-level Software Engineer based in Cupertino, CA. The posting calls out experience with Python, CUDA, AWS, PyTorch. Compensation is listed at $165,200–$223,600 per year.

Role
Software Engineer
Function
software engineering
Level
mid
Track
Individual contributor
Employment
Full-time
Location
Cupertino, CA
Department
Software Development
Posted
Nov 19, 2025
AI Summary
Mid-level ML engineer optimizing deep learning inference on AWS custom accelerators (Inferentia/Trainium). Develops high-performance kernels, tunes LLM models like Llama and DeepSeek, and collaborates across compiler, runtime, and hardware layers. Requires strong ML expertise, distributed systems knowledge, and experience with PyTorch or JAX frameworks.

More roles at Amazon

Delivery Trainer, RSR
Traverse City, MI · mid
Agile Compliance
Delivery Trainer, RSR
Abbeville, LA · mid
Agile Compliance
Delivery Trainer, RSR
North Mankato, MN · mid
Agile Compliance
Operations Supervisor
Knuellwald, Germany · mid
Data Center Technician (Night Shift)
Mesa, AZ · mid
React AWS Networking
All Amazon jobs →

Job description

from Amazon careers

The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon’s custom machine learning accelerators, Inferentia and Trainium. The AWS Neuron SDK, developed by the Annapurna Labs team at AWS, is the backbone for accelerating deep learning and GenAI workloads on Amazon's Inferentia and Trainium ML accelerators. This comprehensive toolkit includes an ML compiler, runtime, and application framework that seamlessly integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and Acceleration team is at the forefront of running a wide range of models and supporting novel architecture alongside maximizing their performance for AWS's custom ML accelerators. Working across the stack from PyTorch till the hardware-software boundary, our engineers build systematic infrastructure, innovate new methods and create high-performance kernels for ML functions, ensuring every compute unit is fine tuned for optimal performance for our customers' demanding workloads. We combine deep hardware knowledge with ML expertise to push the boundaries of what's possible in AI acceleration. As part of the broader Neuron organization, our team works across multiple technology layers - from frameworks and kernels and collaborate…

This is an excerpt. Read the full job description on Amazon careers →
All software engineering jobs software engineering in Cupertino, CA Jobs in Cupertino, CA software engineering salaries software engineering career path
All Amazon Jobs Browse software engineering roles mid positions