A40 PCIe | A30 PCIe | A30X PCIe | |
GPU Architecture | NVIDIA Ampere | NVIDIA Ampere | NVIDIA Ampere |
GPU Memory | 48GB GDDR6 | 24GB HBM2 | 24GB HBM2e |
Memory Bandwidth | 696 GB/sec | 933 GB/sec | 1,223 GB/sec |
NVIDIA CUDA® Cores | 10,752 | 3.584 | 3.584 |
NVIDIA Tensor Cores | 336 | 224 | 224 |
Network | N/A | N/A | 100Gb Dual Port (Ethernet or IB) |
Double-Precision | N/A | 5.2 TFLOPS | 5.2 TFLOPS |
Single-Precision | FP32: 37.4 TFLOPSTF32: 74.8 TFLOPS | FP32: 10.3 TFLOPSTF32: 82 TFLOPS | FP32: 10.3 TFLOPSTF32: 82 TFLOPS |
Tensor Performance | N/A | N/A | N/A |
INT8 | 299.3 TOPS | 330 TOPS | 330 TOPS |
INT4 | 598.7 TOPS | 661 TOPS | 661 TOPS |
Support MIG | No | MAX: 4 MIGs @ 6GB each | MAX: 4 MIGs @ 6GB each |
Networking | N/A | N/A | 100Gb Dual Port (Ethernet or IB) |
Graphics Bus | PCI Express 4.0 x 16 | PCI Express 4.0 x 16 | PCI Express 4.0 x 16 |
Form Factor | Dual Slot | Dual Slot | Dual Slot |
Max TDP Power | 300W | 165W | 230W |