Skip to main content
🎓 Claude Code Masterclass Learn AI-assisted development on Udemy — plus the companion book on Leanpub & Amazon. Start Learning
How is Asia shaping AI today — Discovery Theater 3 at Red Hat Summit 2026
AI

Building Digital Sovereign AI: Red Hat and MetaX at Red

Li Ming Tsai and Jiaju Zhang present China's sovereign AI stack at Red Hat Summit 2026 — MetaX GPUs, Qwen/DeepSeek at 30% global token usage, and the.

LB
Luca Berton
· 9 min read

At Red Hat Summit 2026 in Atlanta, one of the most eye-opening sessions I attended was at Discovery Theater 3: “Building Digital Sovereign AI” — presented by Li Ming Tsai (Red Hat AI Architect) and Jiaju Zhang (Red Hat Chief Architect, Greater China Group).

The session introduced a sovereign, cloud-native AI infrastructure built with Red Hat OpenShift AI and MetaX — a Shanghai-based GPU company offering high-performance, cost-efficient alternatives to NVIDIA. Together with Red Hat AI Inference Server and MetaX MIM (MetaX Inference Microservice), they are redefining AI infrastructure with an open, sovereign approach.

Speaker presenting How is Asia shaping AI today at Discovery Theater 3

Three Pillars: Models, Chips, Supply Chain

The talk framed Asia’s AI influence across three dimensions:

Models: The APAC LLM Rise

Asia is no longer just consuming AI — it is producing frontier models at scale. The APAC LLM Rise is real:

  • Solar (Upstage) — South Korea’s enterprise LLM pushing multilingual capabilities
  • HyperCLOVA X — NAVER’s Korean-first foundation model
  • ERNIE 4.0 (Baidu) — China’s answer to GPT-4, deeply integrated into Baidu’s ecosystem
  • Qwen (Alibaba) — open-weight models competing with Meta’s Llama
  • DeepSeek — cost-efficient reasoning models that shocked the industry
  • Multiple multilingual and enterprise-focused models across Japan, India, and Southeast Asia

The pattern is clear: APAC is not waiting for Silicon Valley. They are building their own foundation model stacks, optimized for local languages, regulations, and enterprise needs.

Wide shot of Discovery Theater 3 — packed audience with headphones at Red Hat Summit

AI Chips: Beyond NVIDIA

The second pillar — AI Chips — showed Asia’s semiconductor ambitions going far beyond manufacturing for Western companies:

  • Huawei Ascend — the most visible challenger to NVIDIA’s data center GPUs, powering China’s sovereign AI infrastructure
  • TSMC — still the world’s most advanced chip fabricator, producing silicon for NVIDIA, AMD, Apple, and Qualcomm
  • Samsung — HBM (High Bandwidth Memory) production critical for every AI accelerator
  • Cambricon — Chinese AI chip startup with custom NPUs
  • Tenstorrent — Jim Keller’s RISC-V based AI accelerator, with significant Asian investment

The message: the AI chip supply chain has always run through Asia. Now Asia is designing the chips too, not just manufacturing them.

AI Supply Chain: The Hidden Leverage

The third dimension — perhaps the most strategic — is the AI supply chain itself:

  • Rare earth minerals — China controls approximately 60% of global rare earth mining and 90% of processing
  • Advanced packaging — TSMC’s CoWoS packaging is the bottleneck for every H100/B200 shipment
  • Memory — Samsung and SK Hynix dominate HBM production
  • Assembly and testing — concentrated in Taiwan, South Korea, Malaysia, and China

Any disruption in this supply chain ripples through the entire global AI infrastructure within weeks.

Close-up of the speaker discussing APAC LLM models and AI chips

Why This Matters for Enterprise AI

If you are building an enterprise AI strategy, ignoring Asia’s influence is a strategic blind spot:

  1. Model diversity — APAC models offer alternatives to US-centric foundation models, especially for multilingual deployments and data sovereignty requirements
  2. Hardware diversification — relying solely on NVIDIA means relying on Asian supply chains anyway; understanding the full stack matters
  3. Regulatory landscape — China’s AI regulations, Japan’s copyright-friendly training rules, and India’s open-source AI push all create different deployment realities
  4. Cost competition — DeepSeek proved that frontier-competitive models can be built at a fraction of the cost, pressuring the entire industry

Discovery Theater 3 wide shot — two speakers presenting with QR code for lightning talks

China’s Strategic Shift to Sovereign AI

The talk then zoomed into China AI Market Analysis (2026) — China’s deliberate pivot to sovereign AI independence:

China's strategic shift to Sovereign AI — market analysis 2026

Market Size and Growth:

  • Valued at approximately $28-32 Billion in 2025/2026, with CAGR exceeding 32%
  • Services (56%) and Software (49%) have overtaken hardware — the focus is on deploying models, not just buying GPUs

The “Sovereign Stack” Driver:

  • Massive investment in AI chip design, manufacture, and ecosystem creation
  • China is building a parallel “Eastern Tech” stack for business continuity — complete independence from Western technology

MaaS Explosion (Model-as-a-Service):

  • China leads in model efficiency
  • Domestic open-source models (Qwen, DeepSeek) account for nearly 30% of global token usage

Key Trend — Agentic Workflows:

  • The market has moved past “chat” into Autonomous Agents for manufacturing, fintech, robotics/physical AI, and “Smart City” infrastructure
  • A key government growth driver
  • OpenClaw and Hermes are very popular in China

Discovery Theater 3 — wide shot of China Sovereign AI slide with packed audience

Close-up of China's strategic shift to Sovereign AI slide

Speaker presenting China AI market data with headphone audience

The Sovereign AI Technology Stack

The most impressive slide — a complete Sovereign AI Technology Stack built entirely from Chinese and open-source components:

The Sovereign AI Technology Stack — full stack from hardware to applications

LayerCategoryComponents
ApplicationsApplication LayerDify, Agents: OpenClaw, Hermes
ModelsIntelligence LayerDeepSeek, Qwen, GLM, Kimi, MiniMax, MiMo, HunYuan, etc.
Inference EngineRuntime Execution LayervLLM, SGLang, xLLM, MindIE, LMDeploy, chitu, etc.
Cache ManagementEfficiency LayerMooncake
AcceleratorAccelerated Computing LayerAscend NPU, Alibaba T-Head, Baidu Kunlunxin, MetaX GPU, etc.
HardwarePhysical Infrastructure LayerSuperPod: CloudMatrix 384, Shanghai Cube, etc.

This is a complete, production-ready alternative to the NVIDIA/CUDA/Western cloud stack. Every layer has Chinese-developed alternatives — from Ascend NPUs replacing NVIDIA GPUs to Mooncake for KV-cache management (replacing vLLM’s native caching) to DeepSeek/Qwen replacing GPT-4/Claude at the model layer.

The Open Blueprint for AI Sovereignty

The session concluded with Red Hat’s positioning: the open blueprint for AI sovereignty, delivered through the Red Hat AI platform:

The open blueprint for AI sovereignty — hybrid cloud control, security and trust, cost-efficient inferencing

Three pillars:

  • Hybrid cloud control — freedom from vendor lock-in, run AI workloads anywhere, not tied to one stack
  • Security and trust — transparency and auditability to inspect code and meet top security and regulatory standards in AI
  • Cost-efficient inferencing — run any model at scale with an open, modular inference stack for high-performance, cost-efficient AI

Red Hat AI Enterprise — complete end-to-end AI platform for predictive ML and generative AI

The Red Hat AI Enterprise slide showed the complete platform spanning experimentation and model development, training and tuning, model serving, registry and governance, and foundation and ops — with integration points for NVIDIA NIM, vLLM, InstructLab, Caikit, and the full Red Hat OpenShift stack.

Luca Berton selfie at Discovery Theater 3 — packed audience with headphones watching Asia AI talk

Luca Berton at Discovery Theater 3 — wide angle selfie during How is Asia shaping AI session

The Joint Solution: Red Hat AI Enterprise + MetaX

The core of the session — the complete joint architecture between Red Hat and MetaX:

The Joint Solution between Red Hat and MetaX — full architecture from OpenShift to MetaX GPUs

The stack targets verticals across FSI, Healthcare, Education, Transportation, Entertainment, Energy and more. The architecture layers:

  • Red Hat OpenShift AI — Agentic App Development at the top
  • Red Hat AI Inference Server + MIM (MetaX Inference Microservice) — optimized profiles for inference, handling prompts and responses
  • Model Monitoring and Observability — Metrics, Telemetry, Accelerator Profiles, Logging, Alerting
  • MetaX GPU Operator + Network Operator + MetaXLink + MIM Operator — the Kubernetes operators that make MetaX GPUs first-class citizens on OpenShift
  • Red Hat OpenShift foundation — Enterprise Kubernetes, Enterprise Linux, Container and VM Management, DevOps Tooling, Security, Ecosystem

The MetaX hardware is physically present on the slide — their GPU cards sitting alongside the Red Hat AI Enterprise platform. This is not a future roadmap. It is a production-ready, jointly certified stack.

Model Service via MIM: DeepSeek on MetaX

Model Service via MIM — DeepSeek-R1-Distill-Qwen-32B running on MetaX stack

The demo showed MIM DeepSeek-R1-Distill-Qwen-32B — a distilled reasoning model running entirely on the MetaX software stack:

  • Startserver / List-model-profiles — standard serving commands
  • Python / Torch — familiar ML frameworks
  • Infer Backends / Operator Library — MetaX-optimized inference kernels
  • MacaRT — MetaX’s runtime (their equivalent of CUDA Runtime)
  • Communication Library — multi-GPU collective communication
  • MXMACA — MetaX’s CUDA-equivalent programming framework
  • UMD/KMD — User/Kernel Mode Drivers
  • Firmware / OS filesystem — down to bare metal

This is the full software stack from OS to model serving — completely independent of NVIDIA CUDA. DeepSeek-R1-Distill-Qwen-32B is a perfect showcase: a Chinese reasoning model running on Chinese GPUs with Chinese inference software.

Li Ming Tsai presenting the Red Hat + MetaX joint solution architecture

Red Hat Value for Sovereign AI

The closing slide crystallized why Red Hat is the natural partner for sovereign AI deployments:

Red Hat Value — Leader, Stability, Ecosystem, Support

  • Leader — Red Hat is the leader in open source, Enterprise Linux, and Sovereign Cloud
  • Stability — Mature engineering capability turning open-source projects into enterprise products — stable Linux and OpenShift formed the foundation of AI Cloud
  • Ecosystem — Broader ecosystem and the mechanism to easily add any other plugins into the existing framework
  • Support — Enterprise support capability across various verticals

From Co-pilot to Digital Employee: OpenClaw/Hermes in China

The most forward-looking part of the talk — how China is using OpenClaw and Hermes to build enterprise “Digital Employee” systems:

Leverage OpenClaw/Hermes to build Digital Employee System — Gateway, Node, Channel architecture

Moving from “Co-pilot” to “Digital Employee” — the architecture has three components:

  • Gateway — Central control plane, responsible for global scheduling, permission management, policy distribution, and multi-channel access
  • Node — Distributed task execution nodes, close to data and computing power, providing an isolated, persistent sandbox environment
  • Channel — Multi-channel access layer (Feishu, DingTalk, Web), seamlessly embedding AI capabilities into enterprise operating systems

The key insight: “The maintainability of digital employees does not come from a longer Prompt, but from documentation, modularization, and Skill-ification.”

From Model Service to Task Service

From Model Service to Task Service — AI Infra vs Agent Infra comparison

The paradigm shift from AI Infra (Model) to Agent Infra (Task):

AI Infra (Model)Agent Infra (Task)
Focus on throughput and latencyFocus on task success rate
Focus on inference performanceFocus on memory and planning
Stateless container environmentPersistent workspace
Passive Q&A responseProactive task execution

“The Agent of the future is not deployed on a single model, but runs on an entire ‘task stack‘“

The Feature Set

Next Step: The feature set — Memory, Tool Use, Task Planning, Runtime/Sandbox, Multi-Agent, Security

The roadmap for enterprise AI agents:

  • Memory — Four-layer memory architecture (SOUL, TOOLS, USER, Session) to ensure task continuity and personalization
  • Tool Use — Standardized MCP/Skills interfaces to achieve seamless integration with existing enterprise systems (Jira, GitLab, etc.)
  • Task Planning — Ability to break down complex goals into executable steps, dynamically correct paths, and perform self-reflection
  • Runtime / Sandbox — Isolated execution environment, balancing execution efficiency with system security, supporting persistent workspaces
  • Multi-Agent — Cooperative teaming of digital employees with different responsibilities to handle complex, cross-domain, long-chain business tasks
  • Security and Identity — Dynamic permission management and full-process behavior auditing to ensure AI behavior compliance and protect enterprise asset security

Key Takeaway

Asia is not just part of the AI story — it is writing the AI story. From foundation models (Qwen, DeepSeek, Solar) to chips (Ascend, Cambricon) to the supply chain that makes all AI hardware possible (TSMC, Samsung HBM, rare earths), the center of gravity is shifting. Enterprise architects who understand this have a strategic advantage.

Great session at Discovery Theater 3 — the kind of perspective that does not get enough airtime at Western conferences.

Community Reactions

Steve Shirkey (MetaX) shared this article on LinkedIn:

“Great recap by Luca Berton on the Red Hat Summit 2026 session on our MetaX collaboration, presented by Li Ming Tsai and Jiaju Zhang.”

It is encouraging to see the MetaX team recognizing this coverage — a sign that the intersection of digital sovereignty, Asian AI ecosystems, and open source infrastructure resonates with the people building it.

Free 30-min AI & Cloud consultation

Book Now