Skip to main content
๐ŸŽค Speaking at KubeCon EU 2026 Lessons Learned Orchestrating Multi-Tenant GPUs on OpenShift AI View Session
๐ŸŽค Speaking at Red Hat Summit 2026 GPUs take flight: Safety-first multi-tenant Platform Engineering with NVIDIA and OpenShift AI Learn More
Future of DevOps with AI agents
DevOps

The Future of DevOps: How AI Agents Are Reshaping Operations

AI agents are transforming DevOps practices. From automated incident response to AI-generated infrastructure code, here is what the future looks like.

LB
Luca Berton
ยท 2 min read

DevOps as we knew it is being restructured by AI agents. Not replaced โ€” restructured. The humans are still essential, but their role is shifting from executing procedures to supervising intelligent systems.

What Changed in 2025-2026

Three developments converged:

  1. AI agents that can execute โ€” not just suggest, but run commands, interpret output, and iterate. Tools like OpenClaw give agents real infrastructure access.
  2. Context-aware models โ€” GPT-5, Claude 4, and open-source alternatives understand infrastructure context well enough to make reasonable operational decisions.
  3. Standardized tool interfaces โ€” MCP (Model Context Protocol) gives agents structured access to monitoring, ticketing, and deployment systems.

The New DevOps Stack

Traditional: Human โ†’ Runbook โ†’ Terminal โ†’ Infrastructure

Emerging: Human โ†’ AI Agent โ†’ Tools โ†’ Infrastructure โ†’ Human (review)

The agent handles the routine. The human handles the judgment. This is not theoretical โ€” I am building these systems at client sites right now.

Practical AI Agent Patterns

Incident triage โ€” Agent receives alert, queries monitoring systems, correlates with recent deployments, drafts incident summary, suggests remediation. Human approves or modifies.

# Example: OpenClaw skill for incident triage
triggers:
  - pagerduty_alert
  - alertmanager_webhook

steps:
  - query_prometheus: "Get metrics for affected service"
  - query_git_log: "Recent deployments in last 2 hours"
  - correlate: "Match alert timing with deployment"
  - draft_response: "Summarize findings and suggest action"
  - notify_human: "Send to on-call for approval"

Deployment validation โ€” Agent runs post-deployment checks, compares metrics against baseline, flags regressions, can auto-rollback for clear failures.

Documentation generation โ€” Agent reads infrastructure changes, updates runbooks, generates architecture diagrams. This is where agents excel โ€” tedious work that humans skip.

What AI Agents Cannot Do Yet

  • Make business decisions โ€” โ€œShould we migrate to a new cloud provider?โ€ requires business context agents lack
  • Handle novel failures โ€” agents pattern-match against known scenarios; genuinely new failure modes need human creativity
  • Navigate organizational politics โ€” infrastructure decisions involve people, budgets, and priorities that are invisible to agents

Building Trust Incrementally

Start with read-only agents. Let them observe and suggest. Once your team trusts their suggestions, grant write access with human approval. Then expand to auto-remediation for well-understood scenarios.

The platform engineering teams I advise follow this progression over 3-6 months.

The Role of the DevOps Engineer in 2027

You are becoming an AI supervisor. Your value is not in running kubectl commands โ€” it is in designing the systems that agents operate, setting the guardrails, reviewing the edge cases, and handling the situations that require judgment.

Learn to write agent skills and workflows. Understand AI agent architecture patterns. The engineers who adapt will be dramatically more productive. Those who resist will find their routine tasks automated regardless.

Luca Berton Ansible Pilot Ansible by Example Open Empower K8s Recipes Terraform Pilot CopyPasteLearn ProteinLens TechMeOut