About this role

Red Hat is hiring a mid-level Machine Learning Engineer based in Toronto - MSO. The posting calls out experience with Data Structures, Machine Learning, Python, Kubernetes.

Role: Machine Learning Engineer
Function: machine learning
Level: mid
Track: Individual contributor
Employment: Full-time
Location: Toronto - MSO
Posted: May 19, 2026

More roles at Red Hat

Product Security Engineer

Remote (US DC) · mid

Python AWS Azure

Security Community and Compliance Architect (Czech Republic)

Brno - Tech Park Brno - B · mid

Kubernetes Docker CI/CD

Software Engineer - Ecosystem Engineering

Raanana, Israel · mid

Python Java Go

Associate Software Engineer - Ecosystem Engineering

Raanana, Israel · junior

Python Java Go

Consultant

Mumbai, India · mid

Kubernetes Terraform Ansible All Red Hat jobs →

Job description

from Red Hat careers

Job Summary

At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat AI Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. As leading developers, maintainers of the vLLM project, and inventors of state-of-the-art techniques for model quantization and sparsification, our team provides a stable platform for enterprises to build, optimize, and scale LLM deployments.

As a Machine Learning Engineer focused on model optimization algorithms, you will work closely with our product and research teams to develop SOTA deep learning software. You will collaborate with our technical and research teams to develop LLM training and deployment pipelines, implement model compression algorithms, and productize deep learning research. If you are someone who enjoys bridging research and production, optimizing large models, and contributing to open-source AI tooling, this role is for you.

Join us in shaping the future of AI!

What you will do

Contribute to the design, development, and testing of various inference optimization algorithms in the LLM-compressor, Speculators, and vLLM projects.
Design, implement, and optimize model compression pipelines using techniques such as quantization and pruning.
This is an excerpt. Read the full job description on Red Hat careers →

All machine learning jobs machine learning salaries machine learning career path

All Red Hat Jobs Browse machine learning roles mid positions