Skip to main content

GPUs for Running Falcon-40B

GPU Inference speed relative to 2x H100s (est) Speed / $ (relative est) Cost at Runpod Cost at FluidStack Cost at Lambda Labs
2x H100s 100% Not available Not available Not available in an on-demand 2x instance Not available in an on-demand 2x instance
2x 6000 Ada 48% 0.20 ✅ $2.38 Not available Not available
2x L40 43% 0.18 $2.38 Not available Not available
2x A100 80GB 43% 0.12 $3.58 $4.99 Not available
2x A6000 19% 0.12 ✅ $1.58 ✅ $1.60 ✅ $1.60
2x A40 19% 0.12 $1.58 $3.19 Not available