Global Quantization Improvements
Compare changes
This merge request aims to rework several parts of the existing PTQ and QAT code. The provided new changes don't add extra features to the quantization module, but rather enhance the implementation of the existing ones. Here are the lists of changes.
insertCompensationNodes()
routinegetTensorAbsMax()
rescaleTensor()
roundTensor()
float
to double
operator
folderaddBeforeForward()
instead of adding a calibration step in the workflowsetupQuantizer()
getTensorAbsMean()
getTensorStd()
Also, all the std::cout
verbose logs have been replaced with Log
ones.
Copyright © Eclipse Foundation, Inc. All Rights Reserved. Privacy Policy | Terms of Use | Copyright Agent