What AI and cloud consulting services does Luca Berton offer?

Luca Berton provides expert consulting in AI/ML platform strategy, multi-tenant GPU orchestration on OpenShift AI, MLOps enablement, cloud infrastructure design, Kubernetes workshops, and Ansible & Python training.

What is Ansible Pilot?

Ansible Pilot is the leading resource for Ansible automation learning, featuring a YouTube channel with 6.1K subscribers and 1M+ views, plus AnsiblePilot.com with 648K total users.

How can I book a consultation with Luca Berton?

Schedule a free consultation through Calendly at calendly.com/lucaberton or visit lucaberton.com/contact.

Building a Private AI Cloud with RHEL AI and InstructLab

Luca Berton • Thu Feb 26 2026 • 1 min read •

#rhel-ai#instructlab#private-cloud#ai#enterprise

🏢 Your Own AI Cloud

For regulated industries — healthcare, finance, defense, government — sending data to OpenAI or Anthropic isn’t an option. RHEL AI and InstructLab let you build a fully private AI cloud with custom models trained on your data.

Architecture

RHEL AI (Base Platform)
  ├── InstructLab (Model Customization)
  │   ├── Taxonomy-based training
  │   └── Synthetic data generation
  ├── vLLM (Model Serving)
  ├── Granite Models (Base Foundation)
  └── GPU Management (NVIDIA drivers + container toolkit)

Installation

# RHEL AI bootable container image
sudo bootc switch registry.redhat.io/rhel-ai/rhel-ai-nvidia:1.4

# Initialize InstructLab
ilab config init
ilab model download --repository instructlab/granite-7b-lab

# Serve the base model
ilab model serve

Custom Model Training

1. Define Your Knowledge

Create taxonomy entries for your domain:

# taxonomy/knowledge/company/policies/qna.yaml
created_by: platform-team
domain: company_policies
seed_examples:
  - question: What is the data retention policy?
    answer: |
      Data must be retained for 7 years for financial records,
      3 years for operational data, and deleted within 30 days
      upon customer request per GDPR Article 17.
  - question: What are the approved cloud regions?
    answer: |
      Production workloads must run in EU-West-1 (Ireland) or
      EU-Central-1 (Frankfurt). US regions require CISO approval.
document:
  repo: https://gitlab.internal/policies
  commit: abc123
  patterns:
    - "*.md"

2. Generate Synthetic Training Data

ilab data generate \
  --taxonomy-path ./taxonomy \
  --num-instructions 1000 \
  --model granite-7b-lab

InstructLab generates diverse question-answer pairs from your seed examples — multiplying 10 examples into 1000+ training samples.

3. Train

ilab model train \
  --model-path models/granite-7b-lab \
  --data-path generated_data \
  --num-epochs 5 \
  --effective-batch-size 16 \
  --device cuda

4. Evaluate and Deploy

# Test the model
ilab model evaluate --model models/granite-7b-trained

# Serve in production
ilab model serve \
  --model-path models/granite-7b-trained \
  --host 0.0.0.0 \
  --port 8000

Production Deployment on Kubernetes

apiVersion: apps/v1
kind: Deployment
metadata:
  name: private-llm
  namespace: ai-platform
spec:
  replicas: 2
  template:
    spec:
      containers:
      - name: vllm
        image: registry.internal/vllm:latest
        args:
        - "--model=/models/granite-7b-company"
        - "--max-model-len=8192"
        resources:
          limits:
            nvidia.com/gpu: "1"
        volumeMounts:
        - name: models
          mountPath: /models
      volumes:
      - name: models
        persistentVolumeClaim:
          claimName: model-storage

Why Private AI?

Data sovereignty: Your data never leaves your infrastructure
Compliance: Meet GDPR, HIPAA, SOX requirements by design
Customization: Models trained on YOUR domain knowledge outperform general models
Cost predictability: No per-token API costs, just infrastructure
Availability: No dependency on external API providers

Building a private AI cloud? I help organizations deploy RHEL AI and InstructLab for custom AI platforms. Get in touch.

📌 Need expert help with this topic?

🧠

AI Integration & GPU Platforms

Need help deploying AI/ML platforms? Get expert consulting on OpenShift AI, GPU orchestration, and MLOps.

☸️

Kubernetes & Containerization

Master Kubernetes and container orchestration with hands-on workshops and architecture consulting.

Book a free consultation →

Luca Berton

AI & Cloud Advisor with 18+ years experience. Author of 8 technical books, creator of Ansible Pilot, and instructor at CopyPasteLearn Academy. Speaker at KubeCon EU & Red Hat Summit 2026.

LinkedIn Bluesky YouTube Contact →

← Back to Blog

JSON vs TOON for AI Input: Token-Efficient Data for LLMs

Compare JSON and TOON (Token-Oriented Object Notation) for feeding structured data to Large Language Models. See how TOON cuts token counts by up to 50 percent while keeping JSON compatibility.

Tue Mar 03 2026

Building Custom AI Skills with InstructLab Taxonomy

Create domain-specific AI capabilities using InstructLab's taxonomy system—from writing skill definitions to generating synthetic training data and validating fine-tuned models.

Mon Mar 02 2026

Accessing the OpenClaw Control UI Dashboard on Azure

How to access the OpenClaw Control UI dashboard from an Azure VM — via SSH tunnel (secure) or public IP. Covers device pairing, dashboard authentication, and the browser-based management interface.

Thu Feb 26 2026