staff software engineering Site Reliability Engineer tech_leadership 10+ yrs · Posted Jan 23, 2026

C$225,100 – C$264,500

CAD per year

Skills

AWS GCP Azure Kubernetes Kafka CI/CD Jira PagerDuty Distributed Systems DevOps Observability Incident Response Confluence React Cloud Computing

About this role

Confluent is hiring a staff-level Site Reliability Engineer in the software engineering function as a remote position. The posting calls out experience with AWS, GCP, Azure, Kubernetes and roughly 10+ years of relevant work. Compensation is listed at C$225,100–C$264,500 per year.

Role: Site Reliability Engineer
Function: software engineering
Level: staff
Track: Tech leadership
Employment: Full-time
Location: Remote (Canada)
Work mode: Remote
Experience: 10+ years
Department: Engineering
Posted: Jan 23, 2026

AI Summary

Staff-level SRE driving proactive reliability improvements for Confluent's multi-cloud streaming platform. Spend 75% on engineering (automation, tooling, failure analysis) and 25% on incident response training and coordination. Requires 10+ years SRE/reliability experience, deep distributed systems expertise, and proficiency with incident management tools like Rootly and PagerDuty.

Upgrade to Pro for AI summaries, resume match scores & career intelligence →

More roles at Confluent

Staff Software Engineer I - Stream Governance

Remote (Canada) · staff

React AWS GCP

Staff Software Engineer - Apache Kafka

Remote (Poland) · staff

Kafka React Backend Development

Senior Security Engineer II

Remote (India) · senior

AWS GCP Azure

Staff Software Engineer I - Internal Access Management

Remote (United States) · staff

Kafka Security DevOps

Staff Product Manager, Confluent Cloud Kafka

Remote (Canada) · staff

React Kafka All Confluent jobs →

Job description

from Confluent careers

We’re not just building better tech. We’re rewriting how data moves and what the world can do with it. With Confluent, data doesn’t sit still. Our platform puts information in motion, streaming in near real-time so companies can react faster, build smarter, and deliver experiences as dynamic as the world around them.

It takes a certain kind of person to join this team. Those who ask hard questions, give honest feedback, and show up for each other. No egos, no solo acts. Just smart, curious humans pushing toward something bigger, together.

One Confluent. One Team. One Data Streaming Platform.

About the Role:

Confluent Cloud processes millions of events per second across AWS, GCP, and Azure. When incidents happen in a multi-cloud streaming platform, they happen at scale—data in motion, exactly-once semantics, and cascading failure modes that require deep systems thinking. We need an expert-level engineer who can drive proactive reliability improvements that prevent these incidents before they occur.

This role combines hands-on technical work with strategic program ownership. You'll spend roughly 75% of your time on engineering: building automation, improving tooling, analyzing systemic failure patterns, and designing reliability improvements. The remaining 25% is teaching and coordination: coaching teams through post-mortems, training incident commanders, and evolving our incident response practices.

This is an excerpt. Read the full job description on Confluent careers →

All software engineering jobs software engineering salaries software engineering career path

All Confluent Jobs Browse software engineering roles staff positions

Staff Site Reliability Engineer - Incident Management & Reliability (Remote - Canada)

About this role

More roles at Confluent

Job description

About the Role: