mid operations Systems Engineer ic 7+ yrs · Posted Oct 31, 2024
$295,000 – $445,000
USD per year

About this role

OpenAI is hiring a mid-level Systems Engineer in the operations function based in San Francisco, CA. The posting calls out experience with Python, SQL, Bash, pandas and roughly 7+ years of relevant work. Compensation is listed at $295,000–$445,000 per year.

Role
Systems Engineer
Function
operations
Level
mid
Track
Individual contributor
Employment
Full-time
Location
San Francisco, CA
Experience
7+ years
Department
Scaling
Posted
Oct 31, 2024
AI Summary
Develop and implement power management solutions for large-scale supercomputers, building automation to monitor consumption patterns and stabilize grid reliability. Requires 7+ years software engineering experience, strong Python proficiency, distributed systems knowledge, and electrical engineering concepts understanding.

More roles at OpenAI

Research Engineer
San Francisco, CA · mid
Deep Learning Distributed Systems Data Structures
Account Director - Tokyo
Tokyo, Japan · director
OpenAI
Research Engineer, Retrieval & Search, Applied Engineering
San Francisco, CA · mid
Data Structures API Development Machine Learning
Researcher, Robustness & Safety Training
San Francisco, CA · mid
Deep Learning Machine Learning OpenAI
Software Engineer, Data Infrastructure
San Francisco, CA · mid
Machine Learning Performance Optimization Data Analytics
All OpenAI jobs →

Job description

from OpenAI careers

About the Team

The Frontier Systems team at OpenAI builds, launches, and supports the largest supercomputers in the world that OpenAI uses for its most cutting edge model training.

We take data center designs, turn them into real, working systems and build any software needed for running large-scale frontier model trainings.

Our mission is to bring up, stabilize and keep these hyperscale supercomputers reliable and efficient during the training of the frontier models.

About the Role

As a Software Engineer on the Frontier Systems team focused on power management, you will work on critical infrastructure to support cutting-edge research. With large-scale supercomputers consuming substantial amounts of power, managing this efficiently is key to maximizing computational capacity.  This role is critical to ensuring that our cutting-edge research supercomputing infrastructure runs smoothly, while maintaining reliability and grid-level power stability.

Our team empowers strong engineers with a high degree of autonomy and ownership, as well as ability to effect change. This role will require a keen focus on system-level comprehensive investigations and the development of automated solutions. We want people who go deep on problems, investigate as thoroughly as possible, and build automation for detection and remediation at scale. 

In this role, you will:

This is an excerpt. Read the full job description on OpenAI careers →
All operations jobs operations in San Francisco, CA Jobs in San Francisco, CA operations salaries operations career path
All OpenAI Jobs Browse operations roles mid positions