Software Engineer, Inference AI/ML
CoreWeave · Sunnyvale, CA | Bellevue, WA · Technology
mid
Software Engineer
ic
Bachelor's
$92,000 – $135,000
USD per year
Skills
About this role
CoreWeave is hiring a mid-level Software Engineer based in Sunnyvale, CA | Bellevue, WA. The posting calls out experience with Python, C, CUDA, Spring. Listed education preference: a bachelor's degree or equivalent. Compensation is listed at $92,000–$135,000 per year.
- Role
- Software Engineer
- Function
- software engineering
- Level
- mid
- Track
- Individual contributor
- Employment
- Full-time
- Location
- Sunnyvale, CA | Bellevue, WA
- Education
- Bachelor's degree
- Department
- Technology
AI Summary
Mid-level software engineer implementing features for GPU model-serving inference platform using Python/Go/C++. Requires CS/EE degree or equivalent, strong foundations in data structures and algorithms, and experience with containerization. Will work on latency, reliability, and cost improvements with mentorship from experienced engineers.
More roles at CoreWeave
Data Center Security Engineer
Livingston, NJ · mid
C SQL Spring
Data Center Technician - Ellendale, ND
Ellendale, ND · mid
Python C Bash
Data Center Technician – Express Your Interest
Multiple U.S. Data Center Locations · mid
Python C Bash
Data Center Technician - Mesa, AZ
Mesa, AZ · mid
Python C Bash
Deputy General Counsel, Energy
Livingston, NJ | New York City, NY | Sunnyvale, CA | Bellevue, WA · director
C Spring Cloud Computing
All CoreWeave jobs →
Job description
from CoreWeave careersCoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. Trusted by leading AI labs, startups, and global enterprises, CoreWeave combines superior infrastructure performance with deep technical expertise to accelerate breakthroughs and turn compute into capability. Founded in 2017, CoreWeave became a publicly traded company (Nasdaq: CRWV) in March 2025. Learn more at www.coreweave.com.
What You’ll Do:
Join the Inference team to ship production features that improve latency, reliability, and cost for model serving on our GPU platform. As an IC1, you’ll implement well-scoped changes, learn our operational practices, and grow quickly with mentorship from experienced engineers.
About the role:
- Implement well-scoped features and fixes in Python/Go/C++ for model-serving services (e.g., Triton, vLLM, TensorRT-LLM, Ray Serve).
- Write tests, code comments, and short design docs; participate in code reviews.
- Add basic metrics and dashboards; assist with alarms and runbooks.
- Follow on-call runbooks and learn incident response in a guided rotation.
- Contribute to performance experiments (e.g., request batching, concurrency, caching) with guidance.
Who You Are:
- BS/MS in CS, EE, or related field, or equivalent practical experience.
This is an excerpt. Read the full job description on CoreWeave careers →