senior software engineering Site Reliability Engineer ic · Posted May 29, 2026
$136,600 – $184,800
USD per year

About this role

Amazon is hiring a senior-level Site Reliability Engineer in the software engineering function based in Herndon, VA. The posting calls out experience with AWS, Networking, Machine Learning, Cloud Computing. Compensation is listed at $136,600–$184,800 per year.

Role
Site Reliability Engineer
Function
software engineering
Level
senior
Track
Individual contributor
Employment
Full-time
Location
Herndon, VA
Department
Systems, Quality, & Security Engineering
Posted
May 29, 2026
AI Summary
Senior Infrastructure Reliability Engineer driving reliability risk identification and mitigation for AWS datacenter power generation systems. Conducts root cause analysis of generator failures, applies Physics-of-Failure methodologies, and leads vendor qualification and fleet performance monitoring. Requires expertise in reliability engineering tools (FMEA, fault tree analysis, Weibull analysis) and statistical analysis of generator test and field data.

More roles at Amazon

Data Center Controls Tech, Deployment
Herndon, VA · mid
AWS Networking Cloud Computing
High-Speed Interface Validation Engineer, Post Silicon Validation
Austin, TX · mid
Python AWS Testing
Head of Environment, GPO Sustainability
Luxembourg · director
Sr. Delivery Consultant - AI/ML, AWS Professional Services
Arlington, VA · senior
Python SQL AWS
ML Accelerator Performance Validation Engineer, Post Silicon Validation
Austin, TX · senior
Python Java CUDA
All Amazon jobs →

Job description

from Amazon careers

As an Infrastructure Reliability Engineer specializing in Power Generation, you will be proactively driving the reliability risk identification, assessment, and mitigation for datacenter LV MV generator systems. You will be responsible for root cause analysis of critical generator failures and drive continuous improvements to enhance datacenter availability for AWS customers. You will work closely with both internal teams and external partners including generator OEMs, fuel system suppliers, and service providers to drive key aspects of product specification, risk identification, and execution. You must be ownership minded, independent, action and results oriented to succeed in an open collaborative environment. The candidate should have experience applying Physics-of-Failure (PoF) based approaches to develop and implement both analytical and empirical methods for generator quality and reliability risk identification across design, manufacture, and deployment stages. The candidate should be able to drive AWS application-specific requirements for lifecycle environmental and operational stress analysis of generator systems. The candidate should be capable of evaluating not only generator design quality and reliability risks, but also have the skills and experience in assessing manufacturing process related quality issues for generator components and assemblies. Knowledge of statistical techniques and models is required to analyze generator test data and field performance…

This is an excerpt. Read the full job description on Amazon careers →
All software engineering jobs software engineering in Herndon, VA Jobs in Herndon, VA software engineering salaries software engineering career path
All Amazon Jobs Browse software engineering roles senior positions