Engineering Manager (AI Inference)
Perplexity · San Francisco, CA · AI
About this role
Perplexity is hiring a manager-level Engineering Manager in the software engineering function based in San Francisco, CA. The posting calls out experience with CUDA, Kubernetes, TensorFlow, PyTorch and roughly 5+ years of relevant work. Compensation is listed at $300,000–$485,000 per year.
- Role
- Engineering Manager
- Function
- software engineering
- Level
- manager
- Track
- hybrid
- Employment
- Full-time
- Location
- San Francisco, CA
- Experience
- 5+ years
- Department
- AI
- Posted
- Apr 13, 2026
More roles at Perplexity
Job description
from Perplexity careersAbout the Role
We are looking for an Inference Engineering Manager to lead our AI Inference team. This is a unique opportunity to build and scale the infrastructure that powers Perplexity's products and APIs, serving millions of users with state-of-the-art AI capabilities.
You will own the technical direction and execution of our inference systems while building and leading a world-class team of inference engineers. Our current stack includes Python, PyTorch, Rust, C++, and Kubernetes. You will help architect and scale the large-scale deployment of machine learning models behind Perplexity's Comet, Sonar, Search, Deep Research products.
Why Perplexity?
Build SOTA systems that are the fastest in the industry with cutting-edge technology
High-impact work on a smaller team with significant ownership and autonomy
Opportunity to build 0-to-1 infrastructure from scratch rather than maintaining legacy systems
Work on the full spectrum: reducing cost, scaling traffic, and pushing the boundaries of inference
Direct influence on technical roadmap and team culture at a rapidly growing company
Responsibilities
Lead and grow a high-performing team of AI inference engineers
Develop APIs for AI inference used by both internal and external customers
Architect and scale our inference infrastructure for reliability and efficiency
Benchmark and eliminate bottlenecks throughout our inference stack
This is an excerpt. Read the full job description on Perplexity careers →