update penalnty calculation, dynamic import of gnn
This PR contains update on how to compute the penalnty for the RL through the compute_reward() function and also use it for assessment of the matchmaking.
The compute_reward_ga() function in main.py is used only to assess the matchmaking and it is not used to update the GA.