Quant
UP supports the whole pipline of training and interfering;
Configs
It contains the illustration of common configs.
Quant setting
quant: ptq_only: False deploy_backend: tensorrt cali_batch_size: 900 prepare_args: extra_qconfig_dict: w_observer: MinMaxObserver a_observer: EMAMinMaxObserver w_fakequantize: FixedFakeQuantize a_fakequantize: FixedFakeQuantize leaf_module: [Space2Depth, FrozenBatchNorm2d] extra_quantizer_dict: additional_module_type: [ConvFreezebn2d, ConvFreezebnReLU2d]