About this role

Amazon is hiring a mid-level Software Engineer based in Cupertino, CA. The posting calls out experience with Python, CUDA, AWS, PyTorch. Compensation is listed at $165,200–$223,600 per year.

Role: Software Engineer
Function: software engineering
Level: mid
Track: Individual contributor
Employment: Full-time
Location: Cupertino, CA
Department: Software Development
Posted: Dec 9, 2025

AI Summary

Mid-level software engineer optimizing deep learning inference on AWS custom ML accelerators (Inferentia/Trainium). Develops high-performance kernels, works across PyTorch to hardware layers, and enables LLM model families for optimal performance. Requires strong ML and systems optimization expertise.

Upgrade to Pro for AI summaries, resume match scores & career intelligence →

More roles at Amazon

Technical Infrastructure Program Manager, Fiber Deployment, Global Connectivity Infrastructure Development

London, United Kingdom · mid

AWS Networking Cloud Computing

Data Center IT Support Engineer, MEL - DCO

Melbourne, Australia · mid

AWS Linux

Engineering Operation Technician, Data Center Operations

Wharton, TX · mid

AWS Networking

Engineering Operation Technician, Data Center Operations

Wharton, TX · mid

AWS Networking

Strategic Procurement Specialist, Delivery Services

Sydney, Australia · mid

SQL MySQL All Amazon jobs →

Job description

from Amazon careers

The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon’s custom machine learning accelerators, Inferentia and Trainium. The AWS Neuron SDK, developed by the Annapurna Labs team at AWS, is the backbone for accelerating deep learning and GenAI workloads on Amazon's Inferentia and Trainium ML accelerators. This comprehensive toolkit includes an ML compiler, runtime, and application framework that seamlessly integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and Acceleration team is at the forefront of running a wide range of models and supporting novel architecture alongside maximizing their performance for AWS's custom ML accelerators. Working across the stack from PyTorch till the hardware-software boundary, our engineers build systematic infrastructure, innovate new methods and create high-performance kernels for ML functions, ensuring every compute unit is fine tuned for optimal performance for our customers' demanding workloads. We combine deep hardware knowledge with ML expertise to push the boundaries of what's possible in AI acceleration. As part of the broader Neuron organization, our team works across multiple technology layers - from frameworks and kernels and collaborate…

This is an excerpt. Read the full job description on Amazon careers →

All software engineering jobs software engineering in Cupertino, CA Jobs in Cupertino, CA software engineering salaries software engineering career path

All Amazon Jobs Browse software engineering roles mid positions

Software Development Engineer - AI/ML, AWS Neuron

About this role

More roles at Amazon

Job description