Skip to content

Refactor: removing scaling nodes

Added MulPTQ Operator and ScalingMeta for PTQ Module

This MR introduces the Scaling operator, which essentially implements a MUL operator within a MetaOperator. This operator is required for the PTQ (Post-Training Quantization) module.

Additionally, the Quantizer operator has been added to replace the now-deprecated ScalingNodes. The Quantizer acts as a meta-operator to handle scaling operations in the PTQ and removing the Scaling Node Operator (which is not compatible with ONNX).

Caution: This Merge Request aims to eventually replace the existing Scaling operator (which is not ONNX-compatible). As a result, the current operator is now considered deprecated.

For more details, please refer to the associated issues: #43 (closed)

Edited by Noam Zerah

Merge request reports

Loading