Senior Principal Machine Learning Engineer, vLLM
Red Hat · Boston, MA
About this role
Red Hat is hiring a senior-level Machine Learning Engineer based in Boston, MA. The posting calls out experience with Kubernetes, Linux, Data Structures, Machine Learning. Compensation is listed at $206,600–$351,050 per year.
- Role
- Machine Learning Engineer
- Function
- machine learning
- Level
- senior
- Track
- Tech leadership
- Employment
- Full-time
- Location
- Boston, MA
- Posted
- May 19, 2026
More roles at Red Hat
Job description
from Red Hat careersJob Summary
At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat AI Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. As leading developers, maintainers of the vLLM project, and inventors of state-of-the-art techniques for model quantization and sparsification, our team provides a stable platform for enterprises to build, optimize, and scale LLM deployments.
As a Senior Principal Machine Learning Engineer focused on model optimization algorithms, you will work closely with our product and research teams to develop SOTA deep learning software. You will collaborate with our technical and research teams to develop LLM training and deployment pipelines, implement model compression algorithms, and productize deep learning research. If you are someone who wants to contribute to solving challenging technical problems at the forefront of deep learning in the open source way, this is the role for you.
Join us in shaping the future of AI!
What you will do
Contribute to the design, development, and testing of various inference optimization algorithms in the vLLM, and related projects, such as llm-d, LLM-compressor and speculators.
This is an excerpt. Read the full job description on Red Hat careers →