Intel Gaudi 2 - Alternative to NV H100 for Generative AI Performance

Today, MLCommons published results of the industry-standard MLPerf v4.0 benchmark for inference. Intel’s results for Intel Gaudi 2 accelerators and 5th Gen Intel Xeon Scalable processors with Intel Advanced Matrix Extensions (Intel AMX) reinforce the company’s commitment to bring “AI Everywhere” with a broad portfolio of competitive solutions. The Intel Gaudi 2 AI accelerator remains the only benchmarked alternative to Nvidia H100 for generative AI (GenAI) performance and provides strong performance-per-dollar. Further, Intel remains the only server CPU vendor to submit MLPerf results. Intel’s 5th Gen Xeon results improved by an average of 1.42x compared with 4th Gen Intel Xeon processors’ results in MLPerf Inference v3.1.

“We continue to improve AI performance on industry-standard benchmarks across our portfolio of accelerators and CPUs. Today’s results demonstrate that we are delivering AI solutions that deliver to our customers’ dynamic and wide-ranging AI requirements. Both Intel Gaudi and Xeon products provide our customers with options that are ready to deploy and offer strong price-to-performance advantages,” said Zane Ball, Intel corporate vice president and general manager, DCAI Product Management.

Building on its training and inference performance from previous MLPerf rounds, Intel’s MLPerf results provide customers with industry-standard benchmarks to evaluate AI performance.

About the Intel Gaudi 2 Results: The Intel Gaudi software suite continues to increase model coverage of popular large language models (LLMs) and multimodal models. For MLPerf Inference v4.0, Intel submitted Gaudi 2 accelerator results for state-of-the-art models Stable Diffusion XL and Llama v2-70B.

Due to strong customer demand for Hugging Face Text Generation Inference (TGI), Gaudi’s Llama results used the TGI toolkit, which supports continuous batching and tensor parallelism, enhancing the efficiency of real-world LLM scaling. For Llama v2-70B, Gaudi 2 delivered 8035.0 and 6287.5 for offline and server tokens-per-second, respectively. On Stable Diffusion XL, Gaudi 2 delivered 6.26 and 6.25 for offline samples-per-second and server queries-per-second, respectively. With these results, Intel Gaudi 2 continues to offer compelling price/performance, an important consideration when looking at the total cost of ownership (TCO).

About the Intel 5th Gen Xeon Results: Following hardware and software improvements, Intel’s 5th Gen Xeon results improved by a geomean of 1.42x compared with 4th Gen Intel Xeon processors’ results in MLPerf Inference v3.1. As an example, for GPT-J with software optimizations including continuous batching, the 5th Gen Xeon submission showed about 1.8x performance gains compared with the v3.1 submission. Similarly, DLRMv2 showed about 1.8x performance gains and 99.9 accuracy due to MergedEmbeddingBag and other optimizations utilizing Intel AMX.

Intel is proud of its collaboration with OEM partners – Cisco, Dell, Quanta, Supermicro and WiWynn – to deliver their own MLPerf submissions. Additionally, Intel has submitted MLPerf results for four generations of Xeon products, starting in 2020, and Xeon is the host CPU for many accelerator submissions.

How to Try AI Solutions on Intel Developer Cloud: 5th Gen Xeon processors and Intel Gaudi 2 accelerators are available for evaluation in the Intel Developer Cloud. In this environment, users can run both small- and large-scale training (LLM or GenAI) and inference production workloads at scale, manage AI compute resources and more.

Razer Officially Launches PC Remote Play

ASUS Republic of Gamers Announces New Gaming Peripherals

Razer Expands Premium Laptop Accessory Range with New Adjustable Aluminium Stand

CORSAIR Launches New Web-Based Firmware Update Utility, Enabling Updates Without Additional Software

Cooler Master MasterHUB Review

Elgato Stream Deck + and XLR Dock Bundle Review

DJI Osmo Pocket 3 Creator Combo Review

Elgato Key Light Neo Review

The Funky Kit Show LIVE Ep.335 – Gigabyte B850 AORUS Elite WiFi7, GeForce RTX 5060…

Prize Giveaway #207 – Win a ASRock Z790 LiveMixer Motherboard

Our Podcast Show Ep.125 – Apple WWDC Rumours & Garmin Pisses Off Users

The Funky Kit Show LIVE Ep.334 – TRYX Panorama SE 360 AIO, Nintendo Switch 2,…

Gigabyte B860 AORUS ELITE WiFi7 ICE Motherboard Review

Gigabyte B850 AORUS ELITE WiFi7 Motherboard Review

MSI MEG Ai1600T PCIE5 Power Supply Review

TRYX PANORAMA SE 360 ARGB AIO CPU Cooler Review

Prize Giveaway #207 – Win a ASRock Z790 LiveMixer Motherboard

Prize Giveaway #205 – Win a ASRock B650M PG Riptide WiFi Motherboard

Prize Giveaway #204 – Win a Gigabyte Z790 AORUS ELITE X WIFI7 Motherboard

Prize Giveaway #203 – Win a ASRock B650M PG Riptide WiFi Motherboard

Prize Giveaway #202 – Win a ASRock Z790 PG SONIC Motherboard

Computex 2024: MSI Cubi NUC, MEG Vision X AI

Computex 2024: Adata, Deepcool, Enermax, Noctua, Raijintek, TeamGroup

Computex 2024: Day 4 – Asus

Computex 2024: Day 4 – Thermaltake

Intel Gaudi 2 – Alternative to NV H100 for Generative AI Performance

Winston

Leave a Comment Cancel Reply

Gigabyte B860 AORUS ELITE WiFi7 ICE Motherboard Review

APNX V1 PC Chassis Review (including APNX fans and PC build)

Gigabyte B850 AORUS ELITE WiFi7 Motherboard Review

MSI MEG Ai1600T PCIE5 Power Supply Review

Crucial P310 2TB NVMe M.2 2230 SSD Review

Colorful GeForce RTX 5070 NB EX 12GB-V Graphics Card Review

GIGABYTE Debuts GeForce RTX 5060 Ti & 5060 with Advanced Cooling System for Ultimate Gaming and AI

PNY Unveils NVIDIA GeForce RTX 5060 Family of Graphics Cards

ZOTAC GAMING Announces GeForce RTX 5060 Graphics Card Series

CORSAIR Launches Upgraded HXi Series PSUs with Enhanced Cables and Dual-Color 12V-2×6...

MSI Releases the Custom NVIDIA GeForce RTX 5060 Series Graphics Cards

Acer Debuts Nitro Gaming PCs Featuring Latest NVIDIA GeForce RTX 50 Series...

Intel Gaudi 2 – Alternative to NV H100 for Generative AI Performance

Related posts

Leave a Comment Cancel Reply