Abstract: Deep neural networks (DNNs) are widely used in many fields, such as artificial intelligence generated content (AIGC) and robotics. To efficiently support these tasks, the model pruning ...
To support int8 model deployment on mobile devices,we provide the universal post training quantization tools which can convert the float32 model to int8 model. mean and norm are the values you passed ...