Home GPU Comparison NVIDIA A100 PCIe vs NVIDIA A40 PCIe

NVIDIA A100 PCIe vs NVIDIA A40 PCIe

AI GPU We compared a GPU: 40GB VRAM A100 PCIe and a Desktop platform GPU: 48GB VRAM A40 PCIe to see which GPU has better performance in key specifications, benchmark tests, power consumption, etc.

Main Differences

NVIDIA A100 PCIe 's Advantages
Larger VRAM bandwidth (1555GB/s vs 695.8GB/s)
Lower TDP (250W vs 300W)
NVIDIA A40 PCIe 's Advantages
Boost Clock has increased by 23% (1740MHz vs 1410MHz)
More VRAM (48GB vs 40GB)
3840 additional rendering cores

Score

Benchmark

FP32 (float)
A100 PCIe
19.49 TFLOPS
A40 PCIe +91%
37.42 TFLOPS
VS

Graphics Card

Jun 2020
Release Date
Oct 2020
Tesla
Generation
Tesla
AI GPU
Type
Desktop
PCIe 4.0 x16
Bus Interface
PCIe 4.0 x16

Clock Speeds

765 MHz
Base Clock
1305 MHz
1410 MHz
Boost Clock
1740 MHz
1215 MHz
Memory Clock
1812 MHz

Memory

40GB
Memory Size
48GB
HBM2e
Memory Type
GDDR6
5120bit
Memory Bus
384bit
1555GB/s
Bandwidth
695.8GB/s

Render Config

108
SM Count
84
-
Compute Units
-
6912
Shading Units
10752
432
TMUs
336
160
ROPs
112
432
Tensor Cores
336
-
RT Cores
84
192 KB (per SM)
L1 Cache
128 KB (per SM)
40 MB
L2 Cache
6 MB

Theoretical Performance

225.6 GPixel/s
Pixel Rate
194.9 GPixel/s
609.1 GTexel/s
Texture Rate
584.6 GTexel/s
77.97 TFLOPS
FP16 (half)
37.42 TFLOPS
19.49 TFLOPS
FP32 (float)
37.42 TFLOPS
9.746 TFLOPS
FP64 (double)
584.6 GFLOPS

Graphics Processor

GA100
GPU Name
GA102
-
GPU Variant
-
Ampere
Architecture
Ampere
TSMC
Foundry
Samsung
7 nm
Process Size
8 nm
54.2 billion
Transistors
28.3 billion
826 mm²
Die Size
628 mm²

Board Design

250W
TDP
300W
600 W
Suggested PSU
700 W
No outputs
Outputs
3x DisplayPort 1.4a
8-pin EPS
Power Connectors
8-pin EPS

Graphics Features

N/A
DirectX
12 Ultimate (12_2)
N/A
OpenGL
4.6
3.0
OpenCL
3.0
N/A
Vulkan
1.3
8.0
CUDA
8.6
N/A
Shader Model
6.6

Related GPU Comparisons

© 2024 - TopCPU.net   Contact Us Privacy Policy