Senior Manager, Cloud Platform & Site Reliability
Baseten · San Francisco, CA · EPD
About this role
Baseten is hiring a senior-level Operations Manager based in San Francisco, CA. The posting calls out experience with Kubernetes, Terraform, Pulumi, Helm. Compensation is listed at $165,000–$330,000 per year.
- Role
- Operations Manager
- Function
- operations
- Level
- senior
- Track
- Individual contributor
- Employment
- Full-time
- Location
- San Francisco, CA
- Department
- EPD
- Posted
- May 17, 2026
More roles at Baseten
Job description
from Baseten careersABOUT BASETEN
Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E, backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products.
THE ROLE
As Senior Manager of Cloud Platform and Site Reliability, you will lead and grow the org responsible for the infrastructure that powers Baseten's machine learning platform. This is a manager-of-managers role: you will lead team leads across our Cloud Platform and Site Reliability Engineering functions, setting the technical direction, defining reliability standards, and building the organizational muscle to scale our infrastructure alongside the product.
You will own the end-to-end health of our cloud infrastructure and SRE practice — from coaching your leads through complex incident response and enterprise customer escalations, to shaping the multi-year roadmap for multi-cloud capacity, GPU inference infrastructure, and observability platforms. You operate at the intersection of people, strategy, and systems: you know how to build and develop strong teams, hold a high bar for engineering excellence, and make principled tradeoffs between long-term investment and short-term operational demands.