Supermicro introduces a rack scale total solution for AI Storage

Supermicro is launching a full stack optimised storage solution for AI and ML data pipelines from data collection to high performance data delivery.

  • Monday, 29th January 2024 Posted 1 year ago in by Phil Alsop

This new solution maximizes AI time-to-value by keeping GPU data pipelines fully saturated. For AI training, massive amounts of raw data at petascale capacities can be collected, transformed, and loaded into an organisation's AI workflow pipeline. This multi-tiered Supermicro solution has been proven to deliver multi-petabyte data for AIOps and MLOps in production environments. The entire multi-rack scale solution from Supermicro is designed to reduce implementation risks, enable organisations to train models faster, and quickly use the resulting data for AI inference.

“With 20 PB per rack of high-performance flash storage driving four application-optimised NVIDIA HGX H100 8-GPU based air-cooled servers or eight NVIDIA HGX H100 8-GPU based liquid-cooled servers, customers can accelerate their AI and ML applications running at rack scale,” said Charles Liang, president and CEO of Supermicro. “This solution can deliver 270 GB/s of read throughput and 3.9 million IOPS per storage cluster as a minimum deployment and can easily scale up to hundreds of petabytes. Using the latest Supermicro systems with PCIe 5.0 and E3.S storage devices and WEKA Data Platform software, users will see significant increases in the performance of AI applications with this field-tested rack scale solution. Our new storage solution for AI training enables customers to maximize the usage of our most advanced rack scale solutions of GPU servers, reducing their TCO and increasing AI performance.”

Petabytes of unstructured data used in large-scale AI training processing must be available to the GPU servers with low latencies and high bandwidth to keep the GPUs productive. Supermicro’s extensive portfolio of Intel and AMD based storage servers is a crucial element of the AI pipeline. These include the Supermicro Petascale All-Flash storage servers, which have a capacity of 983.04* TB per server of NVMe Gen 5 flash capacity and deliver up to 230 GB/s of read bandwidth and 30 million IOPS. This solution also includes the Supermicro SuperServer 90 drive bay storage servers for the capacity object tier. This complete and tested solution is available worldwide for customers in ML, GenAI, and other computationally complex workloads.

The new storage solution consists of:

All-Flash tier - Supermicro Petascale Storage Servers

Application tier – Supermicro 8U GPU Servers: AS -8125GS-TNHR and SYS-821GE-TNHR

Object tier - Supermicro 90 drive bay 4U SuperStorage Server running Quantum ActiveScale object storage

Software: WEKA Data Platform and Quantum ActiveScale object storage

Switches: Supermicro InfiniBand and Ethernet Switches

“The high performance and large flash capacity of Supermicro’s All-Flash Petascale Storage Servers perfectly complement WEKA’s AI-native data platform software. Together, they provide the unparalleled speed, scale, and simplicity demanded by today’s enterprise AI customers,” said Jonathan Martin, president at WEKA.

Assured Data Protection announces organisational changes to strengthen its growth trajectory, appointing Stacy Hayes as Chief Strategy Officer and...
Calero introduces a new SaaS Management offering to streamline IT processes, optimise resources, and centralise data for today's technology-driven...

Parallel Works launches ACTIVATE AI Partner Ecosystem

Posted 20 hours ago by Aaron Sandhu
Parallel Works introduces its ACTIVATE AI Partner Ecosystem, enhancing AI infrastructure with scalable, integrated solutions across hybrid cloud...

Zurich Insurance Group acquires BOXX Insurance Inc.

Posted 2 days ago by Aaron Sandhu
BOXX Insurance is set to join Zurich Insurance Group, continuing its mission in cyber insurance and protection as an independent entity.
CISPE appeals Broadcom's VMware acquisition approval, citing competition risks and exclusion of smaller providers.
Discover inforcer’s journey to simplify AI and security adoption for MSPs with its recent $35 million Series B funding.
A groundbreaking study redefines RAID rebuild operations, showcasing Xinnor's xiRAID software's remarkable performance and energy efficiency on...
Ekinops partners with Telegraph42 to boost high-speed data transmission in Eurasia using advanced wavelength technology.