senior software engineering Cloud Engineer ic 5+ yrs
$160,000 – $230,000
USD per year

About this role

Together AI is hiring a senior-level Cloud Engineer in the software engineering function based in San Francisco, CA. The posting calls out experience with Go, AWS, GCP, Azure and roughly 5+ years of relevant work. Compensation is listed at $160,000–$230,000 per year.

Role
Cloud Engineer
Function
software engineering
Level
senior
Track
Individual contributor
Employment
Full-time
Location
San Francisco, CA
Experience
5+ years
Department
Engineering
AI Summary
Senior Backend Engineer building distributed GPU scheduling systems, global management planes, and customer-facing cloud services for Together AI's AI acceleration platform. Requires 5+ years designing fault-tolerant distributed systems, strong systems knowledge across compute/networking/storage, and expert-level programming in Golang or similar languages.

More roles at Together AI

Finance Analytics Engineer
San Francisco, CA · mid
Python SQL Snowflake
Forward Deployed Engineer (GPU Clusters)
San Francisco, CA · mid
Python Bash Kubernetes
Forward Deployed Engineer (Inference & Post-Training)
San Francisco, CA · mid
Python LLMs Reinforcement Learning
Infrastructure Accounting Manager
San Francisco, CA · manager
Networking Data Structures
Infrastructure Design Engineer
San Francisco, CA · mid
Airflow Networking Data Structures
All Together AI jobs →

Job description

from Together AI careers

About the Role

Together AI is building the AI Acceleration Cloud, an end-to-end platform for the full generative AI lifecycle, combining the fastest LLM inference engine with state-of-the-art AI cloud infrastructure.

As a Senior Backend Engineer, you will play a key role in building the next generation AI cloud platform – a highly available, global, blazing-fast cloud infrastructure that virtualizes cutting-edge ML hardware (GB200s/GB300s, BlueField DPUs) and enables state-of-the-art ML practitioners with self-serve AI cloud services, such as on-demand + managed Kubernetes and Slurm clusters. This platform serves both our internal StaaS products (inference, fine-tuning) and our external cloud customers, spanning dozens of data centers across the world.

Some of what you’ll work on:

  • Work on a distributed GPU scheduling system for the on-demand clusters product, Instant Clusters.
  • Build out a global management plane for managing our data center compute, networking, and storage.
  • Design and build new customer-facing cloud platform services, delivering killer enterprise AI cloud features.

Responsibilities

  • Identify, design, and develop foundational backend services that power Together’s cloud platform
  • Analyze and improve the robustness and scalability of existing distributed systems, APIs, databases, and infrastructure
  • Partner with product teams to understand functional requirements and deliver solutions that meet business needs
  • This is an excerpt. Read the full job description on Together AI careers →
All software engineering jobs software engineering in San Francisco, CA Jobs in San Francisco, CA software engineering salaries software engineering career path
All Together AI Jobs Browse software engineering roles senior positions