Research Engineer, Reward Models Platform
Anthropic · Remote | San Francisco, CA | Seattle, WA | New York City, NY · AI Research & Engineering
About this role
Anthropic is hiring a mid-level Research Scientist in the machine learning function as a remote position. The posting calls out experience with Python, Kubernetes, Spark, Reinforcement Learning. Compensation is listed at $350,000–$500,000 per year.
- Role
- Research Scientist
- Function
- machine learning
- Level
- mid
- Track
- Individual contributor
- Employment
- Full-time
- Location
- Remote | San Francisco, CA | Seattle, WA | New York City, NY
- Work mode
- Remote
- Department
- AI Research & Engineering
More roles at Anthropic
Job description
from Anthropic careersAbout Anthropic
Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems.
About the role
You will deeply understand the research workflows of our Finetuning teams and automate the high-friction parts – turning days of manual experimentation into hours. You’ll build the tools and infrastructure that enable researchers across the organization to develop, evaluate, and optimize reward signals for training our models. Your scalable platforms will make it easy to experiment with different reward methodologies, assess their robustness, and iterate rapidly on improvements to help the rest of Anthropic train our reward models.