Infrastructure Engineer
Dataiku · France, Paris; France, Remote; Germany, Berlin - Remote; Netherlands, Remote; United Kingdom, London; United Kingdom, Remote · Engineering
Dataiku is the Platform for AI Success, the enterprise orchestration layer for building, deploying, and governing AI. In a single environment, teams design and operate analytics, machine learning, and AI agents with the transparency, collaboration, and control enterprises require. Sitting above data platforms, cloud infrastructure, and AI services, Dataiku connects the full enterprise AI stack — empowering organizations to run AI across multi-vendor environments with centralized governance.
The world’s leading companies rely on Dataiku to operationalize AI and run it as a true business performance engine delivering measurable value. For more, visit the Dataiku blog, LinkedIn, X, and YouTube.
How you’ll make an impact
At Dataiku, our mission is to enable customers to bring large-scale data analytics and AI technologies into a centralized, easy-to-use platform. To support this mission, we are looking for an Infrastructure Engineer to help operate, maintain, and troubleshoot our internal and customer-facing infrastructure.
You will work closely with experienced infrastructure and platform engineers, contributing to the reliability and day-to-day operations of our systems. This role is hands-on and operationally focused, with a strong emphasis on UNIX/Linux systems and cloud infrastructure.
Our infrastructure primarily runs on AWS, with some components on Azure and GCP. The tooling environment includes Terraform, Ansible, Kubernetes, and Python, though deep expertise in all of these is not required at entry.
What you’ll work on
- Operate, maintain, and troubleshoot UNIX/Linux systems running in cloud environments
- Support and maintain existing configuration management and Infrastructure as Code setups
- Assist with the operation of cloud-based infrastructure, including virtual machines, networking components, and managed services
- Help monitor system health and performance, investigate alerts, and participate in incident response and root cause analysis
- Perform routine infrastructure updates and maintenance to ensure systems remain secure, reliable, and up to date
- Support Kubernetes clusters and containerized workloads, primarily from an operational and troubleshooting perspective
- Collaborate with senior engineers to improve automation, monitoring, and operational practices
- Document procedures, operational runbooks, and troubleshooting steps to improve team efficiency
What you need to be successful
- Experience working with UNIX/Linux systems, including hands-on troubleshooting and shell scripting
- Understanding of networking fundamentals (TCP/IP, DNS, routing, firewalls, load balancing) in cloud or data-center environments
- Basic experience operating infrastructure in a cloud environment (preferably AWS), including compute, networking, and monitoring services
- Basic scripting or development experience (e.g., Python)
- Clear communication skills and a collaborative, respectful approach to working with teammates
- Willingness to learn, ask questions, and grow technical depth over time
Nice to have
- Exposure to Infrastructure as Code tools such as Terraform
- Familiarity with at least one configuration management or automation tool (e.g., Ansible, Chef, Puppet, SaltStack)
- Familiarity with Kubernetes or container-based environments
- Experience with monitoring tools such as Grafana or similar platforms
- Ability to investigate incidents, follow runbooks, and escalate appropriately when needed
- Interest in automation and reliability, even if you have not yet designed large-scale systems yourself #LI-Hybrid #LI-FR1