Posts

Showing posts from December, 2023

Fusion HCI Performance Boost for AI Apps

Image
Fine-tuning and inferencing with foundation models used in AI applications increasingly requires faster and more powerful hardware as AI foundation models grow ever larger.  In response to this need, IBM Storage Fusion HCI System released a new GPU server option earlier this month called the G02 that gives a performance boost for AI applications. The new G02 GPU server is based on the Lenovo SR650 V3.  The configuration used in the G02 includes a pair of Intel Xeon Gold 6418H 24-core "Sapphire Rapids" processors.  This processor model includes Intel Deep Learning Boost, designed to accelerate AI use cases by extending the Intel AVX-512 instruction extension with a new instruction that increases deep learning inference performance. But the big boost provided by the G02 for AI apps comes from the three  NVIDIA A100 80GB GPU PCIe adapter cards in each G02 server.  This newer version of the A100 has u p to 1.25x higher AI inference performance over the A100 40GB  that was used i