Research Engineer, Frontier Red Team (Autonomy)
Anthropic · San Francisco, CA · AI Research & Engineering
About this role
Anthropic is hiring a mid-level Research Scientist in the machine learning function based in San Francisco, CA. The posting calls out experience with Python, LLMs, Reinforcement Learning, Security. Compensation is listed at $320,000–$850,000 per year.
- Role
- Research Scientist
- Function
- machine learning
- Level
- mid
- Track
- Individual contributor
- Employment
- Full-time
- Location
- San Francisco, CA
- Department
- AI Research & Engineering
More roles at Anthropic
Job description
from Anthropic careersAbout Anthropic
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.
About the Team
The Frontier Red Team (FRT) is a small, focused technical research team within Anthropic's Policy organization. Our goal is to make the entire world safer in this era of advanced AI by understanding what these systems can do and building the defenses that matter.
In 2026, we're focused on researching and ensuring safety with self-improving, highly autonomous AI systems—especially ones with cyberphysical capabilities. See our previous related work on cyberdefense, robotics, and Project Vend. We'll also be collaborating closely with the Emerging Risks workstream to understand novel, societal-scale risks that arise when agents interface with the external world. This is early-stage, high-conviction research with the potential for outsized impact.
Note: We are exclusively hiring in SF. We support relocation, but all hires must relocate before starting.
About the Role
Our team is focused on a critical question: how do we defend against a world where powerful, autonomous, self-improving AI systems may be used adversarially?