AI/ML Data Engineer
Marvell · Santa Clara, CA
About this role
Marvell is hiring a mid-level Data Engineer based in Santa Clara, CA. The posting calls out experience with JavaScript, TypeScript, SQL, React.
- Role
- Data Engineer
- Function
- data engineering
- Level
- mid
- Track
- Individual contributor
- Employment
- Full-time
- Location
- Santa Clara, CA
- Posted
- May 19, 2026
More roles at Marvell
Job description
from Marvell careersAbout Marvell
Marvell’s semiconductor solutions are the essential building blocks of the data infrastructure that connects our world. Across enterprise, cloud and AI, and carrier architectures, our innovative technology is enabling new possibilities.
At Marvell, you can affect the arc of individual lives, lift the trajectory of entire industries, and fuel the transformative potential of tomorrow. For those looking to make their mark on purposeful and enduring innovation, above and beyond fleeting trends, Marvell is a place to thrive, learn, and lead.
Your Team, Your Impact
Embedded within the AI/ML team, this role owns the data engineering layer that powers both Gen AI applications and ML model development. Responsible for building production-grade pipelines, curating AI-ready datasets for LLMs and ML models, and contributing to front-end interfaces when required — ensuring the team can deliver complete, data-driven AI products without external dependency.What You Can Expect
Key Responsibilities
Architect and deliver production-grade ELT/ETL pipelines across Databricks and Snowflake for ML training, validation, and inference workflows
Build and maintain AI-ready datasets optimized for both ML model consumption and Gen AI use cases — clean, versioned, and reproducible
Curate and structure high-quality datasets for RAG pipelines and embedding generation; design document chunking strategies, metadata schemas, and grounding data layers that directly improve retrieval accuracy and Gen AI application performance
This is an excerpt. Read the full job description on Marvell careers →