mid Machine Learning Engineer ic
$189,600 – $237,000
USD per year

About this role

Scale AI is hiring a mid-level Machine Learning Engineer based in San Francisco, CA | New York City, NY. The posting calls out experience with AWS, GCP, TensorFlow, PyTorch. Compensation is listed at $189,600–$237,000 per year.

Role
Machine Learning Engineer
Function
machine learning
Level
mid
Track
Individual contributor
Employment
Full-time
Location
San Francisco, CA | New York City, NY
Department
Research

More roles at Scale AI

Finance Systems & Automations Manager
San Francisco, CA · director
LLMs API Development Full Stack
Forward Deployed AI Engineer, Enterprise
San Francisco, CA | New York City, NY · mid
Python AWS GCP
Forward Deployed AI Engineering Manager, Enterprise
San Francisco, CA | New York City, NY · manager
Python AWS GCP
Forward Deployed AI Engineering Manager, GenAI Applications
Berlin, Germany | London, United Kingdom · manager
AWS GCP Azure
Forward Deployed Engineer, GenAI
San Francisco, CA | New York City, NY · mid
LLMs Reinforcement Learning Distributed Systems
All Scale AI jobs →

Job description

from Scale AI careers
About This Role

Lead applied ML engineering on Scale's Applied ML team, powering data infrastructure for leading agentic LLMs (ChatGPT, Gemini, Llama). You will build scalable multi-agent systems to validate agentic reasoning and behaviors, scale human expertise, and drive research into real-world agent reliability failures despite strong benchmarks, shipping production fixes.

Ideal for exceptional engineers with deep research rigor and a relentless focus on practical, high-impact systems. You will iterate rapidly with data, leverage AI tools to accelerate development, and collaborate tightly across engineering, product, and research.

If you excel at turning frontier agent research into reliable deployed systems, we want to hear from you.

You will:

  • Build and deploy multi-agent systems for agentic reasoning validation
  • Develop pipelines to detect errors and scale human judgment
  • Combine classical ML, LLMs, and multi-agent techniques for reliability
  • Lead research into agent failure modes and ship fixes
  • Use AI tools to speed prototyping and iteration
  • Build data-driven evaluations and deploy rapid improvements
  • Integrate systems into Scale's platform

Ideally You’ll Have:

  • PhD or MSc in Computer Science, Mathematics, Statistics, or related field
  • 3+ years shipping scaled production ML systems
  • Demonstrated real-world impact
  • Mastery of PyTorch, TensorFlow, JAX, or scikit-learn
  • Deep expertise in agentic LLMs and multi-agent systems
  • This is an excerpt. Read the full job description on Scale AI careers →
All machine learning jobs machine learning in San Francisco, CA Jobs in San Francisco, CA machine learning salaries machine learning career path
All Scale AI Jobs Browse machine learning roles mid positions