Machine Learning Research Engineer Co-op
Red Hat · Boston, MA
About this role
Red Hat is hiring a mid-level Machine Learning Engineer based in Boston, MA. The posting calls out experience with Python, Rust, C, LLMs.
- Role
- Machine Learning Engineer
- Function
- machine learning
- Level
- mid
- Track
- Individual contributor
- Employment
- Full-time
- Location
- Boston, MA
- Posted
- Jun 3, 2026
More roles at Red Hat
Job description
from Red Hat careersJob Summary
At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and distributed LLM inference to every enterprise. We are seeking a highly motivated research intern to join our Machine Learning Research Team. As a research intern, you will work on cutting-edge networking techniques for ML workloads and contribute to research and engineering efforts that make distributed LLM inference faster, efficient, and more accessible. This is an exciting opportunity to gain hands-on experience in applied networking for ML while working with leading experts in the field.
Responsibilities
Research via experimentation and theoretical modeling the network bandwidth requirements and trade-offs in Prefill-Decode (P/D) disaggregated LLM serving.
Research and implement networking techniques/methods for high-performance KV cache transfers in deployment setups without RDMA networking.
Conduct experiments to evaluate the impact of newly developed non-RDMA KV Cache transfer techniques on performance (latency and throughput) in P/D LLM serving.
Collaborate with researchers and engineers to integrate the networking techniques/methods into real-world distributed inference workflows (e.g. in llm-d)
Document findings and contribute to technical reports, research theses, blog posts, or research publications.
Requirements
Currently pursuing a Masters (with research) or Ph.D. degree in Computer Science, Electrical Engineering, Machine Learning, or a related field.
This is an excerpt. Read the full job description on Red Hat careers →