Intel Data Center GPU Max NEXT vs NVIDIA CMP 70HX

Comparison of Intel Data Center GPU Max NEXT with 128 GB HBM2e and 20,480 cores vs NVIDIA CMP 70HX with 8 GB GDDR6X and 3,840 cores.

Loading...

Performance Rating

Intel Data Center GPU Max NEXT outperforms NVIDIA CMP 70HX by 709.91% in the overall GPU ARK performance rating

A100 A100
H200 H200
MI325X MI325X

Intel Data Center GPU Max NEXT

62.1

Intel Data Center GPU Max NEXT

62.1
RX 7900 XTX RX 7900 XTX
MI250 MI250
Instinct MI300X Instinct MI300X

NVIDIA CMP 70HX

7.7

NVIDIA CMP 70HX

7.7

Contents:

Memory ML Performance Compute Power Architecture & Compatibility ML Software Support Clocks & Performance Power Consumption Rendering Benchmarks Additional

Memory

Memory Size

128 ГБ
🔥 8 ГБ

Memory Type

HBM2e GDDR6X

Memory Bandwidth

🔥 3.21 TB/s
608.3 GB/s

Memory Bus Width

8,192 бит 256 бит

ML Performance

FP16 (Half Precision)

65.54 TFLOPS
🔥 10.71 TFLOPS

BF16 (Brain Float)

No No

TF32 (TensorFloat)

No No

Compute Power

FP32 (Single Precision)

65.54 TFLOPS
🔥 10.71 TFLOPS

FP64 (Double Precision)

65.54 TFLOPS
🔥 0.1674 TFLOPS

CUDA Cores

20,480
🔥 3,840

RT Cores

160
🔥 30

Architecture & Compatibility

GPU Architecture

Generation 12.5 Ampere

SM (Streaming Multiprocessor)

No
🔥 30

PCIe Version

PCIe 5.0 x16 PCIe 1.0 x4

ML Software Support

CUDA Version

No 8.6

Clocks & Performance

Base Clock

900
🔥 +52% 1,365

Boost Clock

1,600
🔥 1,395

Memory Clock

1,565
🔥 1,188

Power Consumption

TDP/TGP

800 W unknown

Recommended PSU

1200 W
🔥 -83% 200 W

Power Connector

8-pin EPS 1x 12-pin

Rendering

Texture Units (TMU)

1,280
🔥 120

ROP

160
🔥 30

L2 Cache

408 MB
🔥 4 MB

Benchmarks

MLPerf, llama2-70b-99.9 (UINT4)

527.5 tokens/s

MLPerf, llama3.1-8b (UINT4)

1 337 tokens/s

MLPerf, llama3.1-8b-edge (UINT4)

1 652 tokens/s

llama.cpp, llama-2-7b-Q4_0

8.21 tokens/s

Additional

Slots

OAM Module Dual-slot

Release Date

No July 16, 2021

Display Outputs

No outputs
No outputs

Renting is cheaper than buying