Performance regression in finite response check after adding confluence check

Did you disable the confluence check? If not you are running both checks.

This issue can be reproduced using the bridge example in our CIF examples repository. After synthesizing a supervisor using do1_synthesize.tooldef, checking finite response with ESCET v0.6 concludes within a few seconds that there is no finite response. Checking finite response (and disabling confluence checking in the tool options) does not terminate with ESCET v 0.10.

I also notice now that the user-triggert termination is not catched by the tool, so I had to force close ESCET v0.10.

The problem is somewhere in the PrepareChecks class (so it doesn't matter which, if any, controller check is selected). When I run the checks in debug mode, I get eventually stuck at this line.

It seems that tree.conjunct takes more and more time with each cycle of the for loop of the variables.

Did you check what has changed in the related code since those previous releases?

I can confirm it is fast in v0.6, but slow in v0.7. We thus only need to check the changes made in v0.7, that relate to this.

Looking at the titles of the merge requests that are part of %v0.7, I think we indeed need to look at !351 (merged), where the confluence check was added to the CIF controller property checker tool.

I dug a bit in into !351 (merged). Some observations:

The line that @mgoorden7u4 indicates the algorithm gets stuck, is introduced here.
The class was renamed, and the method changed a bit, here. The changes to the update method don't seem to consequential.
The for loop in the update method where it gets stuck, is where identity updates are added for all variables that are not assigned on the edge. This prevents them from arbitrarily changing value. We do this in data-based synthesis as well.
We noted that for BDDs in the data-based synthesis tool, the order in which we add the identity updates can matter a lot, performance-wise. Also, if you have a lot of variables, this can become large. The controller checker uses MDDs, not BDDs. I assume through that these performance characteristics likely hold for MDDs as well. Also note that our implementation of MDDs is nowhere near as optimized as the BDDs library that we use. So, we may be able to speed this up. However, none of this matters, as we'll see below.
When we introduced the confluence checker, some of the common calculations between the confluence checker and the finite response checker were moved from the finite response checker to a PrepareChecks class. However, computing updates is new for the finite response checker, as it did not yet have that, see here for the PrepareChecks class as introduced then, and here for the parts of the finite response checker that were (re)moved (Gitlab may not scroll to the right place by itself).
PrepareChecks has the 'problematic' update method where execution seems to stall. The computation within the update method is returned from the method. The caller, processAutomaton uses it to determine guardedUpdate. It is subsequently used to update autGuardedUpdates and globalGuardedUpdatesByEvent. globalGuardedUpdatesByEvent is a private field. It can only can be obtained via public method getGlobalGuardedUpdatesByEvent. ConfluenceChecker invokes that method, FiniteResponseChecker does not.

So, my preliminary conclusion is that the confluence checker needs expensive computations, and these got added to the PrepareChecks that it shares with the finite response checker. The finite response checker does not need these extra computations, but nonetheless they are executed, leading to a severe performance regression.

Nice find!

@ahofkamp Do you remember why you added the updates to the shared class, while they are not needed for the finite response check? Could we just separate that again?

I never considered that dependency I think. So sure, I see no problems separating them.

changed title from Controller checks dont resolve to Controller checks don't resolve

changed the description

changed milestone to %v1.0

mentioned in merge request !604 (merged)

changed title from Controller checks don't resolve to Performance regression in finite response checker

I've updated the title, now that we better understand the problem.

assigned to @ddennis

created branch 639-performance-regression-in-finite-response-checker to address this issue

mentioned in commit 62ccd8b6

mentioned in commit 8003f34c

mentioned in commit 9e8709aa

mentioned in merge request !637 (merged)

I've created !637 (merged) with a fix for the performance regression.

mentioned in commit b1cfdc7a

mentioned in commit d4551dc6

mentioned in commit 636423fa

mentioned in commit e858a2a5

mentioned in commit 0d5d7134

mentioned in commit 990b373c

mentioned in commit 0280bfdd

mentioned in commit c667ad72

closed with merge request !637 (merged)

@koenveldik Thanks for reporting the issue. We solved it. Can you wait for the next Eclipse ESCET release at the end of September, or do you need this more urgently?

@ddennis Thanks for the fix! There is no hurry, I can use the finite response checker in earlier versions for now.

mentioned in issue #686 (closed)

mentioned in issue #693 (closed)

marked this issue as related to #686 (closed)

marked this issue as related to #693 (closed)

changed title from Performance regression in finite response checker to Performance regression in finite response check after adding confluence check

Made the issue title a bit more specific.

marked this issue as related to #695 (closed)

changed the description

Added links to follow-up issues in the issue description.

mentioned in issue #695 (closed)

Performance regression in finite response check after adding confluence check

Designs

Child items ...

Activity

Performance regression in finite response check after adding confluence check

Relates to

Activity