TensorRT: Int8 calibration with hand-tuned scale factors

mattct6sg · February 9, 2018, 1:57am

I’m looking to do int8 calibration for a particular model. The scales generated by calibration are not performing adequately. Is there a way to hand-tune the scales?

I’ve noticed that the cached calibration file contains a hex value for each layer. Can you explain what these values represent? That way it becomes possible to try to tune these values for improved model accuracy.

Thanks!

AastaLLL · February 9, 2018, 6:28am

Hi,

Here is an useful tutorial for INT8 on TensorRT:
http://on-demand.gputechconf.com/gtc/2017/presentation/s7310-8-bit-inference-with-tensorrt.pdf

You can also check this source for calibrator information:
/usr/src/tensorrt/samples/sampleINT8/LegacyCalibrator.h

Thanks.

mattct6sg · February 9, 2018, 6:11pm

Thanks for the fast response and the additional resources! I have actually already implemented a calibration with TensorRT. I am curious if you can explain what the output values mean?

The cached calibration is a set 64-bit hex values. How are these used as scales?

mattct6sg · February 9, 2018, 7:00pm

Hmm sorry it’s actually a 32-bit hex value. Looks like it represents a float. Is this is the scale created by calibration?

AastaLLL · February 12, 2018, 7:33am

Hi,

Please check this page for information:

Thanks.

lch.xidian · August 7, 2019, 2:35am

Same question!