Distributed policy#

In these following notebooks, we train a distributed policy shared between the two agents.

As for the previous notebooks, the policy computes (linear) accelerations. We test different observations spaces, action spaces and training algorithms.