Fusion HCI Performance Boost for AI Apps


Fine-tuning and inferencing with foundation models used in AI applications increasingly requires faster and more powerful hardware as AI foundation models grow ever larger.  In response to this need, IBM Storage Fusion HCI System released a new GPU server option earlier this month called the G02 that gives a performance boost for AI applications.

The new G02 GPU server is based on the Lenovo SR650 V3.  The configuration used in the G02 includes a pair of Intel Xeon Gold 6418H 24-core "Sapphire Rapids" processors.  This processor model includes Intel Deep Learning Boost, designed to accelerate AI use cases by extending the Intel AVX-512 instruction extension with a new instruction that increases deep learning inference performance.

But the big boost provided by the G02 for AI apps comes from the three NVIDIA A100 80GB GPU PCIe adapter cards in each G02 server.  This newer version of the A100 has up to 1.25x higher AI inference performance over the A100 40GB that was used in the original Fusion HCI GPU server, the G01.  And as the name suggests, these newer A100 GPU adapters also have twice as much RAM.  This larger amount of RAM is critical for loading larger foundation models that need to be loaded in their entirety to get the best performance.

In addition to these significant GPU and CPU resources, there is 512GB of RAM configured as 16x 32GB DIMMs.  That works out to be one DIMM in every memory channel for maximum memory performance.  The CoreOS operating system used by Red Hat OpenShift is installed on two 960GB M.2 NVMe OS drives in a RAID 1 configuration for redundancy.  Local storage is provided by a pair of 2.5 inch Samsung PM1655 3.2TB SAS 24Gb SSDs that also use RAID 1 mirroring for redundancy.

Like every other server in a Fusion HCI rack, the G02 server has redundant network connections to the OpenShift application network. These are provided by an NVIDIA ConnectX-6 LX dual-port 25GbE PCIe NIC. And the G02 also has redundant connections to the high-speed storage network using an NVIDIA ConnectX-6 DX dual-port 100GbE PCIe NIC.

Designed to boost the performance of AI applications with powerful GPU and CPU technology, the new Fusion HCI G02 GPU server is the ideal option to add to an IBM Storage Fusion HCI System!

To learn more about how Fusion HCI uses its GPU resources like the G02 in support of AI workloads, check out this page: https://ibm.github.io/storage-fusion/watsonx/overview

Previous post: A New Direction for IBM Storage




The opinions expressed in this post are those of the author.

Comments

Popular posts from this blog

Inside the Storage/Compute Servers of IBM Spectrum Fusion HCI

Fusion HCI Adds 8x the GPU Power