mid Software Engineer ic · Posted Dec 9, 2025
$165,200 – $223,600
USD per year

About this role

Amazon is hiring a mid-level Software Engineer based in Cupertino, CA. The posting calls out experience with Python, CUDA, AWS, PyTorch. Compensation is listed at $165,200–$223,600 per year.

Role
Software Engineer
Function
software engineering
Level
mid
Track
Individual contributor
Employment
Full-time
Location
Cupertino, CA
Department
Software Development
Posted
Dec 9, 2025
AI Summary
Mid-level software engineer optimizing deep learning inference on AWS custom ML accelerators (Inferentia/Trainium). Develops high-performance kernels, works across PyTorch to hardware layers, and enables LLM model families for optimal performance. Requires strong ML and systems optimization expertise.

More roles at Amazon

Technical Infrastructure Program Manager, Fiber Deployment, Global Connectivity Infrastructure Development
London, United Kingdom · mid
AWS Networking Cloud Computing
Data Center IT Support Engineer, MEL - DCO
Melbourne, Australia · mid
AWS Linux
Engineering Operation Technician, Data Center Operations
Wharton, TX · mid
AWS Networking
Engineering Operation Technician, Data Center Operations
Wharton, TX · mid
AWS Networking
Strategic Procurement Specialist, Delivery Services
Sydney, Australia · mid
SQL MySQL
All Amazon jobs →

Job description

from Amazon careers

The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon’s custom machine learning accelerators, Inferentia and Trainium. The AWS Neuron SDK, developed by the Annapurna Labs team at AWS, is the backbone for accelerating deep learning and GenAI workloads on Amazon's Inferentia and Trainium ML accelerators. This comprehensive toolkit includes an ML compiler, runtime, and application framework that seamlessly integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and Acceleration team is at the forefront of running a wide range of models and supporting novel architecture alongside maximizing their performance for AWS's custom ML accelerators. Working across the stack from PyTorch till the hardware-software boundary, our engineers build systematic infrastructure, innovate new methods and create high-performance kernels for ML functions, ensuring every compute unit is fine tuned for optimal performance for our customers' demanding workloads. We combine deep hardware knowledge with ML expertise to push the boundaries of what's possible in AI acceleration. As part of the broader Neuron organization, our team works across multiple technology layers - from frameworks and kernels and collaborate…

This is an excerpt. Read the full job description on Amazon careers →
All software engineering jobs software engineering in Cupertino, CA Jobs in Cupertino, CA software engineering salaries software engineering career path
All Amazon Jobs Browse software engineering roles mid positions