Oracle AI World 2025: Oracle Unveils Next-Generation Oracle Cloud Infrastructure Zettascale10 Cluster for AI
World's largest cloud AI supercomputer delivers 10× zettaFLOPS performance using Nvidia and Oracle Acceleron architecture
This is a Press Release edited by StorageNewsletter.com on October 16, 2025 at 2:02 pmSummary:
- Largest AI supercomputer in the cloud delivers 10X the amount of zettaFLOPS of peak performance
- Built on Oracle Acceleron RoCE networking architecture with NVIDIA AI infrastructure, OCI Zettascale10 will provide multi‑gigawatt AI workload capacity and scale
Oracle Corp. announced Oracle Cloud Infrastructure (OCI) Zettascale10, the largest AI HPC in the cloud. OCI Zettascale10 connects hundreds of thousands of Nvidia GPUs across multiple data centers to form multi-gigawatt clusters that deliver up to an unprecedented 16 zettaFLOPS of peak performance. OCI Zettascale10 is the fabric underpinning the flagship supercluster built in collaboration with OpenAI in Abilene, TX, as part of Stargate. Built on next-gen Oracle Acceleron RoCE networking architecture, OCI Zettascale10 is powered by Nvidia AI infrastructure that delivers breakthrough scale, extremely low GPU-GPU latency across the cluster, industry-leading price-performance, improved cluster utilization, and the reliability required for large scale AI workloads.OCI Zettascale10 is a powerful evolution of the first Zettascale cloud computing cluster, which was introduced in September 2024. OCI Zettascale10 clusters are housed in large gigawatt data center campuses that are hyper-optimized for density within a two-kilometer radius to offer the best GPU-GPU latency for large scale AI training workloads. This architecture is being deployed with OpenAI at the Stargate site in Abilene.
“With OCI Zettascale10, we’re fusing OCI’s groundbreaking Oracle Acceleron RoCE network architecture with next-gen Nvidia AI infrastructure to deliver multi‑gigawatt AI capacity at unmatched scale,” said Mahesh Thiagarajan, EVP, Oracle Cloud Infrastructure. “Customers can build, train, and deploy their largest AI models into production using less power per unit of performance and achieving high reliability. In addition, customers will have the freedom to operate across Oracle’s distributed cloud with strong data and AI sovereignty controls.”
“OCI Zettascale10 network and cluster fabric was developed and deployed first at the flagship Stargate site in Abilene, TX – our joint supercluster with Oracle,” said Peter Hoeschele, VP, infrastructure and industrial compute, OpenAI. “The highly scalable custom RoCE design maximizes fabric-wide performance at gigawatt scale while keeping most of the power focused on compute. We’re excited to keep scaling Abilene and the broader Stargate program together.”
OCI plans to offer multi-gigawatt deployments of OCI Zettascale10 to customers. Initially, OCI Zettascale10 clusters will target deployments of up to 800,000 Nvidia GPUs delivering predictable performance and strong cost efficiency, with high GPU‑to‑GPU bandwidth enabled by Oracle Acceleron’s ultra‑low‑latency RoCEv2 networking.
“Oracle and Nvidia are bringing together OCI’s distributed cloud and our full‑stack AI infrastructure to deliver AI at extraordinary scale,” said Ian Buck, VP, Hyperscale, Nvidia. “Featuring NVIDIA full-stack AI infrastructure, OCI Zettascale10 provides the compute fabric needed to advance state‑of‑the‑art AI research and help organizations everywhere move from experimentation to industrialized AI.”
Oracle Acceleron RoCE networking delivers scale, reliability, and efficiency for AI on OCI Zettascale10
Oracle Acceleron RoCE networking architecture is a critical innovation for customers to build, train, and inference AI workloads in the cloud, while taking full advantage of OCI Zettascale10’s power and capabilities. It uses the switching capability built into modern GPU NICs (network interface cards), allowing them to connect to multiple switches simultaneously, with each on a separate and isolated network plane. This approach dramatically increases the network’s overall scale and reliability by shifting traffic to other network planes when one has a problem, avoiding costly stalls and restarts. Key features of Oracle Acceleron RoCE networking that help customers with their critical AI workloads, include:
- Wide, shallow, resilient fabric: Helps customers deploy larger AI clusters faster at lower total cost by using the GPU NIC as a mini‑switch and connecting to multiple physically and logically isolated planes. This boosts scale while reducing network tiers, cost, and power.
- Higher reliability: Helps customers maintain the stability of AI jobs by eliminating data sharing across planes. This shifts traffic away from unstable or congested planes, which keeps training jobs running and avoids costly checkpoint restarts.
- Consistent performance: Provides customers with more uniform GPU‑to‑GPU latency by removing a tier vs. traditional three-tier designs, improving predictability for large‑scale AI training and inference.
- Power‑efficient optics: Supports customer workloads with Linear Pluggable Optics (LPO) and Linear Receiver Optics (LRO) to cut network and cooling costs without sacrificing 400G/800G throughput. This allows customers to devote more of their power budget to compute.
- Operational flexibility: Helps customers reduce downtime and speed up feature rollouts through plane‑level maintenance and independent network operating system updates.
OCI is now taking orders for OCI Zettascale10, which will be available in the second half of next calendar year, with up to 800,000 Nvidia AI infrastructure GPU platforms.
Comments
This news confirms that Oracle belongs to the top hyperscalers. For many years, all of us considered 3 hyperscalers - AWS, Azure and Google Cloud - and it is especially true as Larry Ellison, CTO and founder, refused the cloud wave as it impacts its on-prem oracle database installed base and business. Long story short, today he promotes cloud as Oracle has made big progress with OCI - Oracle Cloud Infrastructure - to finally faced the reality. And as AI changed the world, Ellison, a recognized remarkable business man, changed Oracle's approach and Oracle joined this small hyperscaler club.
This is confirmed with the US program Stargate and several big deals signed with OpenAI especially this one at Abilene, TX.
Click to enlarge
OCI Zettascale10 introduced super large AI data centers by numbers, 16 zettaFLOPS with up to 800,000 Nvidia GPUs with high GPU‑to‑GPU bandwidth fuled by Oracle Acceleron's ultra‑low‑latency RoCEv2 networking based on Nvidia Spectrum-X Ethernet coupling switchs and SuperNICs. We saw several storage players announcing the availability of their AI oriented offerings on OCI.