Skip to main content
🎓 Claude Code Masterclass Learn AI-assisted development on Udemy — plus the companion book on Leanpub & Amazon. Start Learning
Luca Berton speaking at KubeCon Europe 2026
Conferences

Speaking at KubeCon Europe 2026

Luca Berton presents 'Lessons Learned Orchestrating Multi-Tenant GPUs on OpenShift AI with NVIDIA KAI (G/H200)' at KubeCon + CloudNativeCon Europe 2026 in.

LB
Luca Berton
· 4 min read

I’m Speaking at KubeCon Europe 2026!

I’m thrilled to announce that I’ll be presenting at KubeCon + CloudNativeCon Europe 2026 in Amsterdam! This is the world’s largest and most important Kubernetes conference, and being selected as a speaker is an incredible honour. Out of thousands of submissions, only a fraction make the cut — and I’m proud to bring real-world production experience to the stage.


Session Details

DetailInformation
EventKubeCon + CloudNativeCon Europe 2026
TitleLessons Learned Orchestrating Multi-Tenant GPUs on OpenShift AI with NVIDIA KAI (G/H200)
DateTuesday, March 24, 2026
Time16:15 – 16:45 CET
LocationRAI Amsterdam — Hall 8, Room F
LevelIntermediate
TrackAI + ML
AffiliationDell Technologies

Talk: Lessons Learned Orchestrating Multi-Tenant GPUs on OpenShift AI with NVIDIA KAI (G/H200)

Abstract

How do you run shared, production-grade GPUs for AI/ML safely and efficiently? This experience report distils hard-won lessons from implementing multi-tenant GPU orchestration on OpenShift AI using NVIDIA KAI on G/H200 hardware, fronted by Traefik and backed by Dell Technologies platforms.

What You’ll Learn

  • Tenant Isolation Patterns: Namespaces, quotas, and priority classes for safe multi-tenancy
  • Heterogeneous Node Scheduling: Placing workloads intelligently across mixed GPU types
  • MIG vs. Full-GPU Trade-offs: When to partition and when to dedicate entire GPUs
  • Throughput vs. Latency Tuning: Optimising for batch training vs. real-time inference
  • Driver & Firmware Pitfalls: Lessons from the trenches on GPU driver lifecycle
  • Upgrade & Rollback Strategies: Zero-downtime GPU stack updates
  • Day-2 Operations: Observability, autoscaling, and chargeback for GPU compute
  • Practical Manifests & Guardrails: Ready-to-use configurations you can apply immediately

Why This Talk Matters for the Ecosystem

GPU compute is the most expensive and contested resource in modern IT. As every organisation races to deploy AI/ML workloads, the Kubernetes ecosystem faces a critical challenge: how do you share GPU infrastructure safely, efficiently, and at scale?

This talk matters because:

For the Cloud-Native Community

  • GPU multi-tenancy is largely uncharted territory — most Kubernetes documentation and tooling assume CPU-based workloads. This talk fills a real knowledge gap with production-tested patterns.
  • NVIDIA KAI on Kubernetes is still new — attendees get early, honest insights into what works and what breaks.
  • OpenShift AI + Traefik + Dell is a real enterprise stack — not a demo on a single node, but lessons from running at scale.

For AI/ML Teams

  • Reduce GPU waste: Learn isolation and scheduling patterns that push utilisation from 30% to 80%+.
  • Avoid costly mistakes: Driver/firmware issues can take down entire GPU pools — hear how to prevent them.
  • Budget transparency: Chargeback and observability patterns so teams know exactly what GPU time costs.

For Platform Engineers

  • Battle-tested manifests: Walk away with YAML you can deploy on Monday morning.
  • Upgrade without downtime: GPU driver and firmware lifecycle strategies that don’t require maintenance windows.
  • Autoscaling GPU nodes: Patterns for scaling expensive hardware responsibly.

This isn’t a theoretical talk — it’s an experience report from running production GPU workloads. Every recommendation comes from something that either worked brilliantly or failed spectacularly.


Join Me at KubeCon!

If you’re planning to attend KubeCon Europe 2026, I’d love to see you there!

👉 View Session on Sched


What Is KubeCon + CloudNativeCon?

KubeCon + CloudNativeCon is the flagship conference of the Cloud Native Computing Foundation (CNCF) — the vendor-neutral home of Kubernetes, Prometheus, Envoy, and 180+ other open-source projects that power modern infrastructure.

It is, quite simply, the most important event in cloud-native computing.

By the Numbers

MetricScale
Attendees12,000+ in-person + thousands virtual
Sessions300+ talks, workshops, and tutorials
Sponsors200+ companies from startups to hyperscalers
Co-located Events20+ including Istio Day, ArgoCon, Platform Engineering Day
Countries Represented100+

Why KubeCon Matters

KubeCon is where the future of infrastructure is decided. It’s where:

  • Kubernetes releases are previewed and discussed by the people who build them
  • New CNCF projects are announced (graduated, incubating, sandbox)
  • Enterprise adoption patterns are shared through real-world case studies
  • The AI/ML on Kubernetes movement is accelerating — GPU scheduling, model serving, MLOps pipelines
  • Platform engineering best practices are shaped by practitioners, not vendors

Being selected to speak at KubeCon means your work has been peer-reviewed by the program committee and deemed valuable to the global community. It’s the gold standard for cloud-native thought leadership.

Why Attend?

  • Technical Deep-Dives: From Kubernetes internals to AI/ML orchestration
  • Networking: Meet the maintainers, contributors, and adopters shaping the ecosystem
  • Sponsor Showcase: Explore the latest tools and platforms from 200+ companies
  • Hallway Track: Some of the best conversations happen between sessions
  • Career Growth: KubeCon speakers and attendees are at the forefront of the industry

Stay Updated

Want to follow my KubeCon journey? Subscribe and follow for updates:


Thank You

A huge thank you to the KubeCon program committee for selecting this talk. I’m looking forward to sharing our experiences and learning from the incredible cloud-native community!

See you in Amsterdam! 🇳🇱



Have questions about GPU orchestration or OpenShift AI? Drop a comment or reach out on social media — I’m always happy to chat about cloud-native infrastructure!

Free 30-min AI & Cloud consultation

Book Now