Skip to main content
๐ŸŽ“ Claude Code Masterclass Learn AI-assisted development on Udemy โ€” plus the companion book on Leanpub & Amazon. Start Learning
KubeCon + CloudNativeCon Europe 2026 ยท Amsterdam

Multi-tenant GPUs on
Bare Metal OpenShift AI

A GitOps Blueprint from the Trenches

by Luca Berton ยท AI & Cloud Advisor

๐ŸŽฌ Watch the Talk

๐ŸŽฏ 780 attendees saved this session in the KubeCon agenda

What This Talk Covers

Running GPU workloads on bare metal is hard. Sharing them across teams without chaos is harder. This talk shares a battle-tested framework for building a safe, fair, and efficient multi-tenant GPU platform โ€” everything managed through GitOps.

๐Ÿ›ก๏ธ

Safety

Blast radius = zero. RBAC, NetworkPolicy, quotas โ€” all in Git. One team can never break another.

โš–๏ธ

Fairness

Deterministic contention. PriorityClasses, preemption rules, and GPU caps โ€” no more "random wins."

โšก

Efficiency

Outcomes per GPU-hour. Time slicing, MIG, GPUDirect RDMA โ€” measured and optimized per tenant.

Key Topics

โœ“ GitOps-first with Argo CD + Kustomize โ€” everything auditable
โœ“ Bare metal bootstrap: Ansible โ†’ GitOps โ†’ Day 2 ops
โœ“ SR-IOV with NVIDIA Network Operator vs OpenShift SR-IOV
โœ“ Open kernel modules + DMA-BUF for safer GPU upgrades
โœ“ Multi-tenant networking: HAProxy + Keepalived + per-tenant VIPs
โœ“ KAI Scheduler for GPU-aware scheduling + visibility
โœ“ Per-tenant monitoring, chargeback, and self-service dashboards
โœ“ 7 guardrails you can apply today

The Stack

OpenShift AI NVIDIA H200 GPU Operator Network Operator SR-IOV Argo CD Kustomize HAProxy Keepalived Run:AI KAI Scheduler GPUDirect RDMA DMA-BUF InfiniBand Dell PowerEdge PowerScale NAS Ansible Quay Prometheus GitOps

About the Speaker

Luca Berton is an AI & Cloud Advisor with 15+ years building enterprise infrastructure. Author of 8 books including Kubernetes Recipes, creator of the 40,000+ student learning community, and hands-on consultant for GPU platforms, AI integration, and platform engineering.

Based in Amsterdam ๐Ÿ‡ณ๐Ÿ‡ฑ ยท Previously at Red Hat, Allianz, Databricks

Let's Talk GPU Infrastructure

Building a multi-tenant GPU platform? Migrating to bare metal AI? Considering OpenShift AI?

I'm in Amsterdam for KubeCon โ€” let's grab a coffee and talk shop.

Or find me at the conference โ€” I'm always up for a good conversation about GPU scheduling.

Get insights on AI infrastructure, GPU platforms, and cloud native engineering

๐Ÿ“ฌ Subscribe to the Newsletter

Free 30-min AI & Cloud consultation

Book Now