Multiple fixes to enable multi-GPUs forward execution (!402) · Merge requests · Eclipse Projects / aidge / aidge_core · GitLab

Snippets Groups Projects

Merged Olivier BICHLER requested to merge htmlescape into dev 1 month ago

Some of these fix allow multi-GPU forward execution of cloned graphs.

Correctly escape HTML special characters in GraphView::save();
Added const version of refCastFrom();
Make GraphView::add() from another GraphView preserve ordered inputs/outputs;
Operators attributes were not properly cloned (should fix aidge!105 (merged)).

This now works:

aidge_model0 = aidge_onnx.load_onnx("mymodel.onnx")
aidge_model1 = aidge_model0.clone()
aidge_model0.set_backend("cuda", 0)
aidge_model1.set_backend("cuda", 1)

aidge_models = aidge_core.GraphView()
aidge_models.add(aidge_model0)
aidge_models.add(aidge_model1)

scheduler = aidge_core.ParallelScheduler(aidge_models)
scheduler.forward(True, [in_model0, in_model1])

The models aidge_model0 and aidge_model1 will be run in parallel on 2 GPUs, yet in a synchronized fashion (the same layers are executed at the same time). This should be generalizable to any number of device/backend combinations!

This also works:

aidge_model0 = aidge_onnx.load_onnx("mymodel.onnx")
aidge_model1 = aidge_model0.clone()
aidge_model0.set_backend("cuda", 0)
aidge_model1.set_backend("cuda", 1)

scheduler0 = aidge_core.SequentialScheduler(aidge_model0)
scheduler1 = aidge_core.SequentialScheduler(aidge_model1)

import concurrent.futures

with concurrent.futures.ThreadPoolExecutor(max_workers=2) as e:
    e.submit(scheduler0.forward, True, [in_model0])
    e.submit(scheduler1.forward, True, [in_model1])

Here models aidge_model0 and aidge_model1 are run in parallel fully asynchronously.

Edited 1 month ago by Olivier BICHLER

Activity

Olivier BICHLER changed milestone to %aidge v0.6.0 1 month ago

changed milestone to %aidge v0.6.0
Olivier BICHLER added Fix 🔥🔥 label 1 month ago

added Fix 🔥🔥 label
Olivier BICHLER assigned to @olivierbichler 1 month ago

assigned to @olivierbichler
Olivier BICHLER changed title from Added htmlEscape to Multiple fixes 1 month ago

changed title from Added htmlEscape to Multiple fixes
Olivier BICHLER changed the description 1 month ago

changed the description
Olivier BICHLER added 2 commits 1 month ago
added 2 commits

0fa98cce - Added const version of refCastFrom

9b4eae91 - Make inputs/outputs order deterministic when adding a GraphView to a GraphView

Compare with previous version
Olivier BICHLER added 4 commits 1 month ago
added 4 commits

badf5d93 - 1 commit from branch dev

5f0a18f5 - Added htmlEscape

2480e05d - Added const version of refCastFrom

050faae9 - Make inputs/outputs order deterministic when adding a GraphView to a GraphView

Compare with previous version
Toggle commit list
Olivier BICHLER added 1 commit 1 month ago
added 1 commit

7d665ab6 - Correct inputs/outputs order

Compare with previous version
Olivier BICHLER added 1 commit 1 month ago
added 1 commit

7957949f - Fixed attributes not properly cloned

Compare with previous version
Olivier BICHLER changed the description 1 month ago

changed the description
Olivier BICHLER changed title from Multiple fixes to Multiple fixes to enable multi-GPUs forward execution 1 month ago

changed title from Multiple fixes to Multiple fixes to enable multi-GPUs forward execution
Olivier BICHLER added StatusReview Ready label 1 month ago

added StatusReview Ready label
Olivier BICHLER changed the description 1 month ago

changed the description
Olivier BICHLER changed the description 1 month ago

changed the description
Olivier BICHLER merged 1 month ago

merged
Olivier BICHLER mentioned in merge request aidge!105 (merged) 1 month ago

mentioned in merge request aidge!105 (merged)

Please register or sign in to reply

Copyright © Eclipse Foundation, Inc. All Rights Reserved. Privacy Policy | Terms of Use | Copyright Agent