mid Machine Learning Engineer ic · Posted May 19, 2026

About this role

Red Hat is hiring a mid-level Machine Learning Engineer based in Toronto - MSO. The posting calls out experience with Data Structures, Machine Learning, Python, Kubernetes.

Role
Machine Learning Engineer
Function
machine learning
Level
mid
Track
Individual contributor
Employment
Full-time
Location
Toronto - MSO
Posted
May 19, 2026

More roles at Red Hat

Product Security Engineer
Remote (US DC) · mid
Python AWS Azure
Security Community and Compliance Architect (Czech Republic)
Brno - Tech Park Brno - B · mid
Kubernetes Docker CI/CD
Software Engineer - Ecosystem Engineering
Raanana, Israel · mid
Python Java Go
Associate Software Engineer - Ecosystem Engineering
Raanana, Israel · junior
Python Java Go
Consultant
Mumbai, India · mid
Kubernetes Terraform Ansible
All Red Hat jobs →

Job description

from Red Hat careers

Job Summary

At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat AI Inference team accelerates AI for the enterprise and brings operational simplicity to GenAI deployments. As leading developers, maintainers of the vLLM project, and inventors of state-of-the-art techniques for model quantization and sparsification, our team provides a stable platform for enterprises to build, optimize, and scale LLM deployments.

As a Machine Learning Engineer focused on model optimization algorithms, you will work closely with our product and research teams to develop SOTA deep learning software. You will collaborate with our technical and research teams to develop LLM training and deployment pipelines, implement model compression algorithms, and productize deep learning research. If you are someone who enjoys bridging research and production, optimizing large models, and contributing to open-source AI tooling, this role is for you.

Join us in shaping the future of AI!

What you will do

  • Contribute to the design, development, and testing of various inference optimization algorithms in the LLM-compressor, Speculators, and vLLM projects. 

  • Design, implement, and optimize model compression pipelines using techniques such as quantization and pruning.

    This is an excerpt. Read the full job description on Red Hat careers →
All machine learning jobs machine learning salaries machine learning career path
All Red Hat Jobs Browse machine learning roles mid positions