Add support for the MatMul operator (!45) · Merge requests · Eclipse Projects / aidge / aidge_quantization · GitLab

Snippets Groups Projects

Merged Benjamin Halimi requested to merge add_matmul into dev 2 months ago

Description

The goal of this MR is to allow the quantization of models containing MatMul operators, using the PTQ pipeline.

It is important to note that there are in fact two very different cases where the MatMul operator is used :

The first one is to represent a FC node which has no bias. To handle this case without adding complexity to the PTQ pipeline, we can use the fuseMatMultoFC() recipe. But we must first ensure that the weight is connected to the input 1 of the MatMul node (and no the input 0). That's why a reorderMatMulInputs() recipe is needed.
The second one is the case where the two inputs of the MatMuls are actual data (i.e. not weights). In this case we need to modify the different steps of the PTQ pipeline to ensure the scaling ratios are correclty flowing in the graph. The general idea is to multiply the two input scaling ratios that come from the branches that are merged by the MatMul operator.

TODO

To handle the first case :

modify the isAffine() function to catch MatMul nodes connected to a weight Tensor (using the isWeighted tag)
create the reorderMatMulInputs() recipe that ensure that the weight Producer is connected to input 1
handle MatMuls that are connected to a weight without replacing them with FC nodes

To handle the second case :

modify the isMerging() function to catch MatMul nodes not connected to a weight Tensor (using the isWeighted tag)
modify the normalizeParameters() and normalizeActivations() function to multiply the two input accumulated ratios
modify the quantizeNormalizeNetwork() function to rescale the MatMul scaling twice as much

Overall :

test and validate the changes on several network topologies

Files changed : mostly PTQ.cpp, but also various PTQ related files (CLE.cpp, headers, ...)

Note : Several other changes were made to improve the code quality (e.g. hasAttr(), addAttr(), ...)

Edited 2 months ago by Benjamin Halimi

Activity

Benjamin Halimi added Feature 🚀 StatusWork in Progress labels 2 months ago

added Feature 🚀 StatusWork in Progress labels
Benjamin Halimi assigned to @bhalimi 2 months ago

assigned to @bhalimi
Benjamin Halimi changed the description 2 months ago

changed the description
Benjamin Halimi added 1 commit 2 months ago
added 1 commit

6c170e5a - add a recipe for replacing MatMuls with FCs

Compare with previous version
Benjamin Halimi added 1 commit 2 months ago
added 1 commit

38e276af - minor changes

Compare with previous version
Benjamin Halimi added 1 commit 2 months ago
added 1 commit

a818dd58 - minor changes (case)

Compare with previous version
Benjamin Halimi added 1 commit 2 months ago
added 1 commit

73c86bc4 - add support of MatMuls which have a parameter input

Compare with previous version
Benjamin Halimi added 1 commit 2 months ago
added 1 commit

fd285110 - code improvements

Compare with previous version
Benjamin Halimi added 1 commit 2 months ago
added 1 commit

b983d4be - remove commented code

Compare with previous version
Benjamin Halimi marked the checklist item create the recipe that replaces the MatMuls with FC nodes when possible (first case) as completed 2 months ago

marked the checklist item create the recipe that replaces the MatMuls with FC nodes when possible (first case) as completed
Benjamin Halimi added 1 commit 2 months ago
added 1 commit

a0c3e603 - make use of getOrderedNodes()

Compare with previous version
Benjamin Halimi added 6 commits 2 months ago
added 6 commits

a0c3e603...e9463363 - 5 commits from branch dev

2ef00ee9 - Merge branch 'dev' into 'add_matmul'

Compare with previous version
Benjamin Halimi changed the description 2 months ago

changed the description
Benjamin Halimi added 1 commit 2 months ago
added 1 commit

294e7d6e - minor changes

Compare with previous version
Benjamin Halimi changed the description 2 months ago

changed the description
Benjamin Halimi added 1 commit 2 months ago
added 1 commit

478b96c0 - wip

Compare with previous version
Benjamin Halimi added 1 commit 2 months ago
added 1 commit

de5f4b7b - enhance reorderMatMulInputs() and remove replaceMatMulWithFC()

Compare with previous version
Benjamin Halimi added 1 commit 2 months ago
added 1 commit

2193615f - integration of matmul (insert scalings + norm parameters)

Compare with previous version

Please register or sign in to reply

Copyright © Eclipse Foundation, Inc. All Rights Reserved. Privacy Policy | Terms of Use | Copyright Agent