Forward Deployed Engineer, AI Inference (vLLM and Kubernetes)
Red Hat · Remote (US WA)
About this role
Red Hat is hiring a mid-level Field Engineer in the software engineering function as a remote position. The posting calls out experience with Kubernetes, Linux, Python, Terraform. Compensation is listed at $184,940–$342,490 per year.
- Role
- Field Engineer
- Function
- software engineering
- Level
- mid
- Track
- Individual contributor
- Employment
- Full-time
- Location
- Remote (US WA)
- Work mode
- Remote
- Posted
- May 19, 2026
More roles at Red Hat
Job description
from Red Hat careersThe vLLM and LLM-D Engineering team at Red Hat is looking for a customer obsessed developer to join our team as a Forward Deployed Engineer. In this role, you will not just build software; you will be the bridge between our cutting-edge inference platform (LLM-D, and vLLM) and our customers' most critical production environments.
You will interface directly with the engineering teams at our customer to deploy, optimize, and scale distributed Large Language Model (LLM) inference systems. You will solve "last mile" infrastructure challenges that defy off-the-shelf solutions, ensuring that massive models run with low latency and high throughput on complex Kubernetes clusters. This is not a sales engineering role, you will be part of the core vLLM and LLM-D engineering team.
What You Will Do
Orchestrate Distributed Inference: Deploy and configure LLM-D and vLLM on Kubernetes clusters. You will set up and configure advanced deployment like disaggregated serving, KV-cache aware routing, KV Cache offloading etc to maximize hardware utilization.
Optimize for Production: Go beyond standard deployments by running performance benchmarks, tuning vLLM parameters, and configuring intelligent inference routing policies to meet SLOs for latency and throughput. You care about Time Per Output Token (TPOT), GPU utilization, GPU networking optimizations, and Kubernetes scheduler efficiency.
This is an excerpt. Read the full job description on Red Hat careers →