Manager, Site Reliability Engineering
Navan · Tel Aviv, Israel · Engineering
About this role
Navan is hiring a manager-level Engineering Manager in the software engineering function based in Tel Aviv, Israel. The posting calls out experience with AWS, Terraform, CloudFormation, Serverless.
- Role
- Engineering Manager
- Function
- software engineering
- Level
- manager
- Track
- hybrid
- Employment
- Full-time
- Location
- Tel Aviv, Israel
- Department
- Engineering
More roles at Navan
Job description
from Navan careersAt Navan, we’re committed to creating the best experience for business travelers, ensuring that our systems are always reliable, scalable, and efficient. As we continue to grow, we’re looking for a Site Reliability Engineering (SRE) Manager to join our team in headquarters based out of Tel-Aviv. In this role, you will lead a team of SREs, drive innovation in infrastructure design and automation, and ensure our systems run seamlessly at scale, serving thousands of travelers every day.
What You’ll Do
- Lead & Mentor the SRE Team: Guide and develop a high-performing team of SREs, fostering a culture of collaboration, reliability, and continuous improvement.
- Drive Infrastructure Reliability & Automation: Collaborate with Engineering and Product teams to design and implement scalable, fault-tolerant systems. Leverage IaC tools (e.g., Terraform, CloudFormation) and microservices architectures to automate and improve infrastructure.
- Incident Management: Improve incident response processes, reduce MTTR, and proactively mitigate risks. Apply resiliency patterns to ensure systems are fault-tolerant and highly available.
- Define & Measure SLOs: Develop service-level objectives (SLOs) and KPIs to track and improve system reliability, using tools like NewRelic or DataDog for observability.
- 24x7 Production Support: Ensure system availability in a 24x7 environment, applying expertise in AWS (e.g., ECS, Lambda, DynamoDB) and database management for optimal performance.