老潘的AI社区
模型推理稀疏化加速
部署不内卷
稀疏化
imoldpan
2023 年7 月 22 日 08:52
1
参考
https://developer.nvidia.com/blog/sparsity-in-int8-training-workflow-and-best-practices-for-tensorrt-acceleration/
https://developer.nvidia.com/blog/accelerating-inference-with-sparsity-using-ampere-and-tensorrt/