Research Engineer / Scientist, Frontier Red Team (Cyber)
Anthropic · San Francisco, CA · AI Research & Engineering
About this role
Anthropic is hiring a mid-level Research Scientist in the machine learning function based in San Francisco, CA. The posting calls out experience with Python, LLMs, Security, OpenAI. Compensation is listed at $320,000–$485,000 per year.
- Role
- Research Scientist
- Function
- machine learning
- Level
- mid
- Track
- Individual contributor
- Employment
- Full-time
- Location
- San Francisco, CA
- Department
- AI Research & Engineering
More roles at Anthropic
Job description
from Anthropic careersAbout Anthropic
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.
About the Team
The Frontier Red Team (FRT) is a small, focused technical research team within Anthropic's Policy organization. Our goal is to make the entire world safer in an era of advanced AI by understanding what these systems can do and building the defenses that matter.
In 2026, we're focused on researching and ensuring safety with self-improving, highly autonomous AI systems, especially ones related to cyberphysical capabilities. See our previous related work on exploits, partnering with Mozilla, and zero days. This is early-stage, high-conviction research with the potential for outsized impact — Glasswing is one example.
Note: We are exclusively hiring in SF. We support relocation, but all hires must relocate before starting.
About the Role
In the last year, we've seen compelling signs that LLMs and agents are increasingly capable of novel cyber capabilities. We think 2026 will be the year where models reach expert-level, even superhuman, in several cybersecurity domains. This is a novel and massive threat surface.