老潘的AI社区
FP8和INT8?
部署不内卷
tensorrt
imoldpan
2024 年4 月 18 日 09:36
1
image
1308×648 20.8 KB
参考
NVIDIA TensorRT Accelerates Stable Diffusion Nearly 2x Faster with 8-bit Post-Training Quantization | NVIDIA Technical Blog
https://www.reddit.com/r/StableDiffusion/comments/1baeo5h/nvidia_tensorrt_int8_fp8_quantization/