Building Digital Sovereign AI: Red Hat and MetaX at Red

At Red Hat Summit 2026 in Atlanta, one of the most eye-opening sessions I attended was at Discovery Theater 3: “Building Digital Sovereign AI” — presented by Li Ming Tsai (Red Hat AI Architect) and Jiaju Zhang (Red Hat Chief Architect, Greater China Group).

The session introduced a sovereign, cloud-native AI infrastructure built with Red Hat OpenShift AI and MetaX — a Shanghai-based GPU company offering high-performance, cost-efficient alternatives to NVIDIA. Together with Red Hat AI Inference Server and MetaX MIM (MetaX Inference Microservice), they are redefining AI infrastructure with an open, sovereign approach.

Speaker presenting How is Asia shaping AI today at Discovery Theater 3

Three Pillars: Models, Chips, Supply Chain

The talk framed Asia’s AI influence across three dimensions:

Models: The APAC LLM Rise

Asia is no longer just consuming AI — it is producing frontier models at scale. The APAC LLM Rise is real:

Solar (Upstage) — South Korea’s enterprise LLM pushing multilingual capabilities
HyperCLOVA X — NAVER’s Korean-first foundation model
ERNIE 4.0 (Baidu) — China’s answer to GPT-4, deeply integrated into Baidu’s ecosystem
Qwen (Alibaba) — open-weight models competing with Meta’s Llama
DeepSeek — cost-efficient reasoning models that shocked the industry
Multiple multilingual and enterprise-focused models across Japan, India, and Southeast Asia

The pattern is clear: APAC is not waiting for Silicon Valley. They are building their own foundation model stacks, optimized for local languages, regulations, and enterprise needs.

Wide shot of Discovery Theater 3 — packed audience with headphones at Red Hat Summit

AI Chips: Beyond NVIDIA

The second pillar — AI Chips — showed Asia’s semiconductor ambitions going far beyond manufacturing for Western companies:

Huawei Ascend — the most visible challenger to NVIDIA’s data center GPUs, powering China’s sovereign AI infrastructure
TSMC — still the world’s most advanced chip fabricator, producing silicon for NVIDIA, AMD, Apple, and Qualcomm
Samsung — HBM (High Bandwidth Memory) production critical for every AI accelerator
Cambricon — Chinese AI chip startup with custom NPUs
Tenstorrent — Jim Keller’s RISC-V based AI accelerator, with significant Asian investment

The message: the AI chip supply chain has always run through Asia. Now Asia is designing the chips too, not just manufacturing them.

AI Supply Chain: The Hidden Leverage

The third dimension — perhaps the most strategic — is the AI supply chain itself:

Rare earth minerals — China controls approximately 60% of global rare earth mining and 90% of processing
Advanced packaging — TSMC’s CoWoS packaging is the bottleneck for every H100/B200 shipment
Memory — Samsung and SK Hynix dominate HBM production
Assembly and testing — concentrated in Taiwan, South Korea, Malaysia, and China

Any disruption in this supply chain ripples through the entire global AI infrastructure within weeks.

Close-up of the speaker discussing APAC LLM models and AI chips

Why This Matters for Enterprise AI

If you are building an enterprise AI strategy, ignoring Asia’s influence is a strategic blind spot:

Model diversity — APAC models offer alternatives to US-centric foundation models, especially for multilingual deployments and data sovereignty requirements
Hardware diversification — relying solely on NVIDIA means relying on Asian supply chains anyway; understanding the full stack matters
Regulatory landscape — China’s AI regulations, Japan’s copyright-friendly training rules, and India’s open-source AI push all create different deployment realities
Cost competition — DeepSeek proved that frontier-competitive models can be built at a fraction of the cost, pressuring the entire industry

Discovery Theater 3 wide shot — two speakers presenting with QR code for lightning talks

China’s Strategic Shift to Sovereign AI

The talk then zoomed into China AI Market Analysis (2026) — China’s deliberate pivot to sovereign AI independence:

China's strategic shift to Sovereign AI — market analysis 2026

Market Size and Growth:

Valued at approximately $28-32 Billion in 2025/2026, with CAGR exceeding 32%
Services (56%) and Software (49%) have overtaken hardware — the focus is on deploying models, not just buying GPUs

The “Sovereign Stack” Driver:

Massive investment in AI chip design, manufacture, and ecosystem creation
China is building a parallel “Eastern Tech” stack for business continuity — complete independence from Western technology

MaaS Explosion (Model-as-a-Service):

China leads in model efficiency
Domestic open-source models (Qwen, DeepSeek) account for nearly 30% of global token usage

Key Trend — Agentic Workflows:

The market has moved past “chat” into Autonomous Agents for manufacturing, fintech, robotics/physical AI, and “Smart City” infrastructure
A key government growth driver
OpenClaw and Hermes are very popular in China

Discovery Theater 3 — wide shot of China Sovereign AI slide with packed audience

Close-up of China's strategic shift to Sovereign AI slide

Speaker presenting China AI market data with headphone audience

The Sovereign AI Technology Stack

The most impressive slide — a complete Sovereign AI Technology Stack built entirely from Chinese and open-source components:

The Sovereign AI Technology Stack — full stack from hardware to applications

Layer	Category	Components
Applications	Application Layer	Dify, Agents: OpenClaw, Hermes
Models	Intelligence Layer	DeepSeek, Qwen, GLM, Kimi, MiniMax, MiMo, HunYuan, etc.
Inference Engine	Runtime Execution Layer	vLLM, SGLang, xLLM, MindIE, LMDeploy, chitu, etc.
Cache Management	Efficiency Layer	Mooncake
Accelerator	Accelerated Computing Layer	Ascend NPU, Alibaba T-Head, Baidu Kunlunxin, MetaX GPU, etc.
Hardware	Physical Infrastructure Layer	SuperPod: CloudMatrix 384, Shanghai Cube, etc.

This is a complete, production-ready alternative to the NVIDIA/CUDA/Western cloud stack. Every layer has Chinese-developed alternatives — from Ascend NPUs replacing NVIDIA GPUs to Mooncake for KV-cache management (replacing vLLM’s native caching) to DeepSeek/Qwen replacing GPT-4/Claude at the model layer.

The Open Blueprint for AI Sovereignty

The session concluded with Red Hat’s positioning: the open blueprint for AI sovereignty, delivered through the Red Hat AI platform:

The open blueprint for AI sovereignty — hybrid cloud control, security and trust, cost-efficient inferencing

Three pillars:

Hybrid cloud control — freedom from vendor lock-in, run AI workloads anywhere, not tied to one stack
Security and trust — transparency and auditability to inspect code and meet top security and regulatory standards in AI
Cost-efficient inferencing — run any model at scale with an open, modular inference stack for high-performance, cost-efficient AI

Red Hat AI Enterprise — complete end-to-end AI platform for predictive ML and generative AI

The Red Hat AI Enterprise slide showed the complete platform spanning experimentation and model development, training and tuning, model serving, registry and governance, and foundation and ops — with integration points for NVIDIA NIM, vLLM, InstructLab, Caikit, and the full Red Hat OpenShift stack.

Luca Berton selfie at Discovery Theater 3 — packed audience with headphones watching Asia AI talk

Luca Berton at Discovery Theater 3 — wide angle selfie during How is Asia shaping AI session

The Joint Solution: Red Hat AI Enterprise + MetaX

The core of the session — the complete joint architecture between Red Hat and MetaX:

The Joint Solution between Red Hat and MetaX — full architecture from OpenShift to MetaX GPUs

The stack targets verticals across FSI, Healthcare, Education, Transportation, Entertainment, Energy and more. The architecture layers:

Red Hat OpenShift AI — Agentic App Development at the top
Red Hat AI Inference Server + MIM (MetaX Inference Microservice) — optimized profiles for inference, handling prompts and responses
Model Monitoring and Observability — Metrics, Telemetry, Accelerator Profiles, Logging, Alerting
MetaX GPU Operator + Network Operator + MetaXLink + MIM Operator — the Kubernetes operators that make MetaX GPUs first-class citizens on OpenShift
Red Hat OpenShift foundation — Enterprise Kubernetes, Enterprise Linux, Container and VM Management, DevOps Tooling, Security, Ecosystem

The MetaX hardware is physically present on the slide — their GPU cards sitting alongside the Red Hat AI Enterprise platform. This is not a future roadmap. It is a production-ready, jointly certified stack.

Model Service via MIM: DeepSeek on MetaX

Model Service via MIM — DeepSeek-R1-Distill-Qwen-32B running on MetaX stack

The demo showed MIM DeepSeek-R1-Distill-Qwen-32B — a distilled reasoning model running entirely on the MetaX software stack:

Startserver / List-model-profiles — standard serving commands
Python / Torch — familiar ML frameworks
Infer Backends / Operator Library — MetaX-optimized inference kernels
MacaRT — MetaX’s runtime (their equivalent of CUDA Runtime)
Communication Library — multi-GPU collective communication
MXMACA — MetaX’s CUDA-equivalent programming framework
UMD/KMD — User/Kernel Mode Drivers
Firmware / OS filesystem — down to bare metal

This is the full software stack from OS to model serving — completely independent of NVIDIA CUDA. DeepSeek-R1-Distill-Qwen-32B is a perfect showcase: a Chinese reasoning model running on Chinese GPUs with Chinese inference software.

Li Ming Tsai presenting the Red Hat + MetaX joint solution architecture

Red Hat Value for Sovereign AI

The closing slide crystallized why Red Hat is the natural partner for sovereign AI deployments:

Red Hat Value — Leader, Stability, Ecosystem, Support

Leader — Red Hat is the leader in open source, Enterprise Linux, and Sovereign Cloud
Stability — Mature engineering capability turning open-source projects into enterprise products — stable Linux and OpenShift formed the foundation of AI Cloud
Ecosystem — Broader ecosystem and the mechanism to easily add any other plugins into the existing framework
Support — Enterprise support capability across various verticals

From Co-pilot to Digital Employee: OpenClaw/Hermes in China

The most forward-looking part of the talk — how China is using OpenClaw and Hermes to build enterprise “Digital Employee” systems:

Leverage OpenClaw/Hermes to build Digital Employee System — Gateway, Node, Channel architecture

Moving from “Co-pilot” to “Digital Employee” — the architecture has three components:

Gateway — Central control plane, responsible for global scheduling, permission management, policy distribution, and multi-channel access
Node — Distributed task execution nodes, close to data and computing power, providing an isolated, persistent sandbox environment
Channel — Multi-channel access layer (Feishu, DingTalk, Web), seamlessly embedding AI capabilities into enterprise operating systems

The key insight: “The maintainability of digital employees does not come from a longer Prompt, but from documentation, modularization, and Skill-ification.”

From Model Service to Task Service

From Model Service to Task Service — AI Infra vs Agent Infra comparison

The paradigm shift from AI Infra (Model) to Agent Infra (Task):

AI Infra (Model)	Agent Infra (Task)
Focus on throughput and latency	Focus on task success rate
Focus on inference performance	Focus on memory and planning
Stateless container environment	Persistent workspace
Passive Q&A response	Proactive task execution

“The Agent of the future is not deployed on a single model, but runs on an entire ‘task stack‘“

The Feature Set

Next Step: The feature set — Memory, Tool Use, Task Planning, Runtime/Sandbox, Multi-Agent, Security

The roadmap for enterprise AI agents:

Memory — Four-layer memory architecture (SOUL, TOOLS, USER, Session) to ensure task continuity and personalization
Tool Use — Standardized MCP/Skills interfaces to achieve seamless integration with existing enterprise systems (Jira, GitLab, etc.)
Task Planning — Ability to break down complex goals into executable steps, dynamically correct paths, and perform self-reflection
Runtime / Sandbox — Isolated execution environment, balancing execution efficiency with system security, supporting persistent workspaces
Multi-Agent — Cooperative teaming of digital employees with different responsibilities to handle complex, cross-domain, long-chain business tasks
Security and Identity — Dynamic permission management and full-process behavior auditing to ensure AI behavior compliance and protect enterprise asset security

Key Takeaway

Asia is not just part of the AI story — it is writing the AI story. From foundation models (Qwen, DeepSeek, Solar) to chips (Ascend, Cambricon) to the supply chain that makes all AI hardware possible (TSMC, Samsung HBM, rare earths), the center of gravity is shifting. Enterprise architects who understand this have a strategic advantage.

Great session at Discovery Theater 3 — the kind of perspective that does not get enough airtime at Western conferences.

Community Reactions

Steve Shirkey (MetaX) shared this article on LinkedIn:

“Great recap by Luca Berton on the Red Hat Summit 2026 session on our MetaX collaboration, presented by Li Ming Tsai and Jiaju Zhang.”

It is encouraging to see the MetaX team recognizing this coverage — a sign that the intersection of digital sovereignty, Asian AI ecosystems, and open source infrastructure resonates with the people building it.

Building Digital Sovereign AI: Red Hat and MetaX at Red