mid machine learning AI Engineer ic 3+ yrs

About this role

Cerebras Systems is hiring a mid-level AI Engineer in the machine learning function based in Toronto, Canada. The posting calls out experience with Python, C, C#, LLMs and roughly 3+ years of relevant work.

Role
AI Engineer
Function
machine learning
Level
mid
Track
Individual contributor
Employment
Full-time
Location
Toronto, Canada
Experience
3+ years
Department
Software
AI Summary
Prototype and benchmark cutting-edge ML innovations on wafer-scale hardware, developing performance evaluation pipelines and agent-driven automation. Requires 3+ years building high-performance ML/systems software with strong Transformer math knowledge and full AI toolchain proficiency.

More roles at Cerebras Systems

Compute Server Platform Architect
Sunnyvale CA or Toronto Canada · senior
Python C C#
Contracts & Legal Operations Manager
Sunnyvale, CA · manager
C# LLMs Machine Learning
Cybersecurity GRC Manager
Sunnyvale CA or Toronto Canada · principal
C# LLMs Prompt Engineering
Data Center Commissioning Lead
Remote (United States) · senior
C# LLMs Machine Learning
Data Center - Network Fiber Engineer
Remote (United States) · mid
C# LLMs Networking
All Cerebras Systems jobs →

Job description

from Cerebras Systems careers

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.

Cerebras' current customers include top model labs, global enterprises, and cutting-edge AI-native startups. OpenAI recently announced a multi-year partnership with Cerebras, to deploy 750 megawatts of scale, transforming key workloads with ultra high-speed inference.

Thanks to the groundbreaking wafer-scale architecture, Cerebras Inference offers the fastest Generative AI inference solution in the world, over 10 times faster than GPU-based hyperscale cloud inference services. This order of magnitude increase in speed is transforming the user experience of AI applications, unlocking real-time iteration and increasing intelligence via additional agentic computation.

About The Role

Join the inference model team dedicated to bring up the state-of-the-art models, numerically validating and accelerating new model ideas on wafer-scale hardware. You will prototype architectural tweaks, build performance-eval pipelines, and turn hard numbers into changes that land in production.

This is an excerpt. Read the full job description on Cerebras Systems careers →
All machine learning jobs machine learning in Toronto, Canada Jobs in Toronto, Canada machine learning salaries machine learning career path
All Cerebras Systems Jobs Browse machine learning roles mid positions