Infrastructure Vendor Ops Manager
Together AI · San Francisco, CA · Business Operations
About this role
Together AI is hiring a manager-level IT Manager in the operations function based in San Francisco, CA. The posting calls out experience with Jira, Data Structures, Cloud Computing.
- Role
- IT Manager
- Function
- operations
- Level
- manager
- Track
- Management
- Employment
- Full-time
- Location
- San Francisco, CA
- Department
- Business Operations
More roles at Together AI
Job description
from Together AI careersAbout The Role
Together AI is scaling its GPU infrastructure rapidly, working with a growing network of compute suppliers. As we expand, we need someone who owns the operational and financial accountability layer of our vendor relationships: tracking SLA compliance, managing credits, auditing invoices, and ensuring every dollar we spend on compute is accurate and accounted for.
This role sits within the Infrastructure Strategy team and is highly cross-functional, working with infrastructure engineering, finance, and go-to-market teams. When incidents happen, our engineering team produces root-cause analyses; your job is to take that technical detail, build an airtight case for credit claims, and negotiate directly with providers until credits are recovered. You will also partner with GTM and finance to assess the downstream impact of service disruptions and inform how we handle customer-facing commitments. This requires someone with sharp attention to detail, comfort navigating technical documentation, and the persistence to hold vendors accountable.
Responsibilities
- SLA tracking and credit recovery across all GPU compute and data center suppliers, including monitoring uptime and performance commitments, documenting violations, and driving credit claims to resolution
- Invoice review and validation for compute infrastructure contracts, flagging discrepancies and resolving billing issues directly with vendors.
- Regular audits of vendor contracts and SLA performance to verify accuracy of charges and identify cost recovery opportunities