Machine Learning Systems Research Intern, PhD, Summer 2026
Red Hat · Boston, MA
About this role
Red Hat is hiring a intern-level Research Scientist in the machine learning function based in Boston, MA. The posting calls out experience with Python, CUDA, PyTorch, LLMs.
- Role
- Research Scientist
- Function
- machine learning
- Level
- intern
- Track
- Individual contributor
- Employment
- Internship
- Location
- Boston, MA
- Posted
- May 15, 2026
More roles at Red Hat
Job description
from Red Hat careersJob Summary
At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. We are seeking a highly motivated summer intern to join our Machine Learning Research Team. As an intern, you will work on cutting-edge AI inference and model optimization techniques, and contribute to research and engineering efforts that make LLMs faster and more efficient. This is an exciting opportunity to gain hands-on experience in applied machine learning research while working with leading experts in the field.
Responsibilities
Research and implement techniques for LLM inference and LLM optimizations.
Conduct experiments to evaluate the impact of optimization methods on model accuracy, latency, and throughput.
Collaborate with researchers and engineers to integrate optimizations into real-world machine learning workflows.
Document findings and contribute to technical reports, blog posts, or research publications.
Requirements
Currently pursuing a Ph.D. degree in Computer Science, Electrical Engineering, Machine Learning, or a related field.
Strong programming skills in C++, CUDA, and Python.
Experience with tensor math libraries such as PyTorch.
Familiarity with AI model optimization techniques such as quantization (e.g., INT4, FP8), pruning, and knowledge distillation.
Deep understanding and experience in GPU performance optimizations.
This is an excerpt. Read the full job description on Red Hat careers →