NVIDIA Unveils Vera Rubin Platform for Next-Generation Agentic AI Factories

NVIDIA  announced the NVIDIA Vera Rubin platform is ramping into full production to power agentic AI factories worldwide. Taiwan’s top server makers and global supply ......

NVIDIA Unveils Vera Rubin Platform for Next-Generation Agentic AI Factories

NVIDIA  announced the NVIDIA Vera Rubin platform is ramping into full production to power agentic AI factories worldwide.

Taiwan’s top server makers and global supply chain leaders are manufacturing Vera Rubin-based systems at scale — fueling AI labs, cloud providers and hyperscalers to build tomorrow’s intelligence.

Vera Rubin delivers NVIDIA’s most extensive POD-scale platform — five purpose-built racks operating as one massive AI supercomputer for agentic workloads. The platform unifies NVIDIA Vera Rubin NVL72 systems, NVIDIA Vera CPU, NVIDIA Groq 3 LPX, NVIDIA Vera BlueField-4 STX storage and NVIDIA Spectrum™-6 SPX Ethernet racks into a fully integrated system. Vera Rubin delivers 10x agent throughput at scale compared with the previous-generation NVIDIA Grace Blackwell platform.

“Agentic AI is a new kind of workload. One prompt can launch a thousand-step journey of reasoning, retrieval, tool use and response generation. Vera Rubin was built for this moment — an AI factory engine that delivers intelligence at scale, with the performance, efficiency and security needed to power the next industrial revolution.”

Jensen Huang, founder and CEO, NVIDIA

 

Vera Rubin Ramp

Vera Rubin marks the third generation of NVIDIA MGX™ rack-scale systems. With a proven, open source MGX design, hundreds of NVIDIA supply chain ecosystem partners — 150 in Taiwan alone — across 350+ factories and 30 countries are ramping Vera Rubin.

Top system builders, infrastructure software and storage partners are in full-scale production of Vera Rubin. This includes Dell Technologies, HPE, Lenovo and Supermicro, as well as AIC, Aivres, ASRock Rack, ASUS, Cloudian, Compal, DDN, Everpure, Foxconn, GIGABYTE, Hitachi Vantara, Hyve Solutions, IBM, Inventec, MinIO, MiTAC Computing, MSI, NetApp, Nutanix, Pegatron, Quanta Cloud Technology (QCT), VAST Data, WEKA, Wistron and Wiwynn.

 

Building the Fabric for Million-GPU AI Factories

To support scale-out and scale-across AI factory deployments, the Vera Rubin platform introduces NVIDIA Spectrum-X™ Ethernet Photonics, the world’s first co-packaged-optics (CPO)-based switches with 200Gb/s SerDes — now in production.

Spectrum-X Ethernet Photonics, a new generation of switching technology built on CPO, delivers 5x better power efficiency, 5x longer AI uptime and 1.3x faster time to deployment than networks using traditional transceivers.

By simplifying design and freeing more power for compute, NVIDIA co-packaged optics networking provides the foundational fabric for million-GPU AI factories, with CoreWeave, Lambda and Oracle Cloud Infrastructure among the first ecosystem partners and adopters.

The NVIDIA Vera Rubin platform also integrates NVIDIA BlueField®-4 DPUs, featuring software-defined networking at speeds of up to 800Gb/s and built-in multi-tenant isolation. With the NVIDIA BlueField-4 Advanced Secure Trusted Resource Architecture, customers can simplify network operations, improve tenant isolation and gain greater control across million GPU AI clusters.

 

Secure AI for AI Factories

AI factories are increasingly processing proprietary data, regulated content and mission-critical models in agentic workflows. This requires advanced infrastructure security tailored to autonomous agents in shared or cloud environments where infrastructure cannot be implicitly trusted.

The Vera Rubin platform was designed with full-stack NVIDIA Confidential Computing for a trusted execution environment at rack scale. Vera Rubin NVL72 combines Vera CPUs, Rubin GPUs, NVIDIA NVLink™ networking and security features into a unified platform, encrypting data across high-speed interconnects. This provides hardware-level attestation to ensure the system is tamper-proof.

Cloud providers CoreWeave, Firmus, GMI Cloud, IBM Cloud, IREN, Lambda, Microsoft Azure, Nebius, Nscale, SpaceXAI and Vultr are adopting NVIDIA Confidential Computing.

Delivering this level of protection at POD scale also requires a programmable software layer capable of enforcing, orchestrating and adapting security policies across the entire system. The NVIDIA DOCA™ software platform delivers advanced security across every Vera Rubin platform rack and layer of the AI factory — protecting data, agents, context memory and AI inference through capabilities enforced directly in BlueField-4 silicon.

DOCA enables multi-tenant network isolation, zero-trust policy enforcement, runtime threat detection and end-to-end encryption at speeds of up to 800Gb/s, all without taxing host CPU resources, so enterprises can scale AI factories with confidence.

 

Accelerating the Buildout of AI Factories

The NVIDIA DSX™ platform provides the complete design and operational foundation for Vera Rubin AI factories — unifying reference designs, simulation, infrastructure software, facilities and ecosystem technologies to help build and operate energy-efficient AI factories optimized for lowest token cost.

Built for the Vera Rubin POD architecture, DSX aligns every layer of the stack — from silicon and systems to lifecycle management and multi-tenant operations — dramatically accelerating deployment and setting a new bar for operational reliability and resiliency at scale.

Dell Technologies, HPE, Lenovo and Supermicro together with ASUS, Foxconn, GIGABYTE, Pegatron, Quanta Cloud Technology (QCT), Wistron and Wiwynn are adopting NVIDIA DSX to accelerate AI factory ramp with Vera Rubin.

 

Availability

Production shipments of Vera Rubin are set to begin starting this fall.