Staff Site Reliability Engineer - Incident Management & Reliability (Remote - Canada)
Confluent · Remote (Canada) · Engineering
About this role
Confluent is hiring a staff-level Site Reliability Engineer in the software engineering function as a remote position. The posting calls out experience with AWS, GCP, Azure, Kubernetes and roughly 10+ years of relevant work. Compensation is listed at C$225,100–C$264,500 per year.
- Role
- Site Reliability Engineer
- Function
- software engineering
- Level
- staff
- Track
- Tech leadership
- Employment
- Full-time
- Location
- Remote (Canada)
- Work mode
- Remote
- Experience
- 10+ years
- Department
- Engineering
- Posted
- Jan 23, 2026
More roles at Confluent
Job description
from Confluent careersWe’re not just building better tech. We’re rewriting how data moves and what the world can do with it. With Confluent, data doesn’t sit still. Our platform puts information in motion, streaming in near real-time so companies can react faster, build smarter, and deliver experiences as dynamic as the world around them.
It takes a certain kind of person to join this team. Those who ask hard questions, give honest feedback, and show up for each other. No egos, no solo acts. Just smart, curious humans pushing toward something bigger, together.
One Confluent. One Team. One Data Streaming Platform.
About the Role:
Confluent Cloud processes millions of events per second across AWS, GCP, and Azure. When incidents happen in a multi-cloud streaming platform, they happen at scale—data in motion, exactly-once semantics, and cascading failure modes that require deep systems thinking. We need an expert-level engineer who can drive proactive reliability improvements that prevent these incidents before they occur.
This role combines hands-on technical work with strategic program ownership. You'll spend roughly 75% of your time on engineering: building automation, improving tooling, analyzing systemic failure patterns, and designing reliability improvements. The remaining 25% is teaching and coordination: coaching teams through post-mortems, training incident commanders, and evolving our incident response practices.