🎓 Claude Code Masterclass | Learn AI-assisted development on Udemy — plus the companion book on Leanpub & Amazon. Start Learning

ZenML and Kitaru at KubeCon EU 2026: Durable AI Agents

Caught up with Safoine Khabich from ZenML at KubeCon EU 2026. Five years of enterprise ML pipeline experience channeled into Kitaru — an open-source.

April 10, 2026 · 3 min read

The hype around AI agents is everywhere, but how do we actually make them durable, secure, and production-ready?

I had a fantastic time catching up with Safoine Khabich from the ZenML team at KubeCon to dive into exactly this.

From ML Pipelines to Agentic Workflows

We talked about the massive industry shift towards “agentic workflows” and how developers are moving beyond simple models to building complex, autonomous agents.

Safoine walked me through how they are addressing this by taking five years of enterprise ML pipeline experience and channeling it into their open-source ecosystem. Alongside ZenML for ML pipelines, they have introduced Kitaru — a framework specifically designed to build durable AI agents that you can deploy anywhere.

The evolution makes sense:

ML pipelines (ZenML’s foundation) — orchestrating training, evaluation, and deployment workflows
Inference serving — deploying models as APIs with proper model profiles and autoscaling
Agentic workflows (Kitaru) — autonomous agents that chain reasoning, tool use, and human feedback in production loops

Each phase builds on the last. You cannot build reliable agents without reliable pipelines underneath, and you cannot scale agents without proper inference infrastructure.

The Human-in-the-Loop Challenge

What really stood out in our chat was the focus on the “human-in-the-loop.” Building an AI agent is one thing, but building one that is durable, secure, and can seamlessly integrate human feedback in a real-world production environment is the true challenge.

In practice, this means:

Approval gates — agents that pause and request human sign-off before high-impact actions
Feedback loops — capturing human corrections to improve agent behavior over time
Audit trails — every agent decision traceable back to its reasoning chain and the data it used
Graceful degradation — when the agent is uncertain, it escalates rather than guessing
Security boundaries — agents that respect permission scopes and cannot escalate their own access

This connects to what I discussed with Clemens Scholz about AI as a digital companion. The best AI systems augment human judgment rather than replacing it. Kitaru’s human-in-the-loop approach is how you build that trust in enterprise environments.

Open Source All the Way

The fact that they are building these foundational tools completely open source is a massive win for the community. In the ML tooling space, open source matters because:

No vendor lock-in — you can run ZenML and Kitaru on any cloud or on-premises
Transparency — you can audit exactly what the framework does with your data and models
Community-driven — practitioners shape the roadmap based on real production needs
Composability — integrates with the broader cloud native ecosystem including Kubernetes and existing CI/CD pipelines

The Bigger Picture

The shift from “model serving” to “agent orchestration” is one of the defining trends of 2026. At KubeCon, I saw this theme everywhere — from Rootly’s AI SRE agents to Dynatrace’s observability for AI workloads to ZenML’s durable agent framework. The infrastructure layer is maturing to support not just inference, but autonomous reasoning loops.

Learn More

If you are building ML pipelines or experimenting with AI agents, check out their work: zenml.io

About the Author

I am Luca Berton, AI and Cloud Advisor. I help enterprises move from AI experiments to production-grade agentic workflows. Book a consultation.

Related Articles

Embodied AI infrastructure for robotics and physical-world intelligence

Embodied AI Infrastructure for the Physical World

Embodied AI moves artificial intelligence from screens into machines. Here is the infrastructure, data and safety stack required to make it work.

Analytics screenshot showing a traffic spike on ansiblepilot.com

Is Your Website Ready for AI Agents?

AI agents will not browse websites like humans. Here is how to prepare your website, database and infrastructure for autonomous traffic.

AI governance dashboard showing findings remediation status and agent identity controls

AI Governance in Practice: Findings Remediation and Agent Identity

What separates companies that manage AI risk well from those that just publish a policy PDF: fast findings remediation and real agent identity controls.

Calendar view of scoping events and mock sessions in an enterprise Copilot assessment delivery pipeline

What Delivering Enterprise Copilot Assessments Actually Looks Like

Behind the scenes of enterprise Copilot rollouts: scoping events, mock sessions, and the delivery cadence that decides if a program is seen as working.

Need expert guidance?

Free 30-min consultation with Luca Berton

Free 30-min AI & Cloud consultation