Principal Engineer - Observability
CoreWeave · New York City, NY | Sunnyvale, CA · Technology
About this role
CoreWeave is hiring a principal-level Site Reliability Engineer in the software engineering function based in New York City, NY | Sunnyvale, CA. The posting calls out experience with Python, Rust, C, Spring. Compensation is listed at $206,000–$303,000 per year.
- Role
- Site Reliability Engineer
- Function
- software engineering
- Level
- principal
- Track
- Individual contributor
- Employment
- Full-time
- Location
- New York City, NY | Sunnyvale, CA
- Department
- Technology
More roles at CoreWeave
Job description
from CoreWeave careersWhat You’ll Do:
We are looking for a highly experienced and strategic Principal Engineer, Observability to lead the architecture, development, and operations of our Observability platform. In this role, you will define how customers monitor, troubleshoot, and operate their AI workloads at scale on CoreWeave.
You will work directly with customers and partner closely with engineering leaders across multiple teams to drive a unified Observability experience across CoreWeave products—spanning metrics, logs, traces, and customer-facing insights.
About the role:
- Lead the Observability strategy and roadmap, ensuring clear alignment with business goals, product direction, and performance/SLA objectives
- Design and implement low-latency, high-scale telemetry pipelines and data stores solutions that power observability across all CoreWeave products
- Build customer-facing experiences—dashboards, alerts, and workflows—that enable rapid troubleshooting and deep insight into AI workloads and platform health