Sr. Technical Program Manager, HWEng Accelerator Systems
Amazon · Cupertino, CA · Project/Program/Product Management--Technical
About this role
Amazon is hiring a senior-level Technical Program Manager in the software engineering function based in Cupertino, CA. The posting calls out experience with AWS, Cloud Computing and roughly 5+ years of relevant work.
- Role
- Technical Program Manager
- Function
- software engineering
- Level
- senior
- Track
- Individual contributor
- Employment
- Full-time
- Location
- Cupertino, CA
- Experience
- 5+ years
- Department
- Project/Program/Product Management--Technical
- Posted
- Jan 5, 2026
More roles at Amazon
Job description
from Amazon careersAWS Hardware Engineering seeks a Senior Technical Program Manager to own end-to-end NPI delivery for storage server systems and/or accelerators while ensuring optimal fleet health and server availability for AWS customers. In this critical role, you will monitor fleet performance, proactively identify and mitigate risks before they impact customers, and coordinate cross-functional initiatives to improve compute capacity. You will design technical deployment methodologies with robust safety measures, drive continuous improvement in availability metrics, and communicate strategic recommendations to senior leadership. This position directly impacts customer satisfaction by delivering innovative hardware solutions and maintaining the reliability standards that keep the cloud running. Key job responsibilities - Own end-to-end delivery of NPIs for storage server systems or accelerators from concept to production deployment - Drive ODM/vendors to deliver complex technology on a timely basis - Track and report on overall fleet health across server systems, identifying trends and potential risks - Develop and implement solutions for short, medium, and long-term fleet health improvements - Coordinate across various teams to drive progress toward availability and reliability goals - Lead cross-functional initiatives to prevent customer-impacting incidents through proactive fleet management - Drive continuous improvement in server availability metrics and customer experience - Manage complex…