Distributed policy#
In these following notebooks, we train a distributed policy shared between the two agents.
As for the previous notebooks, the policy computes (linear) accelerations. We test different observations spaces, action spaces and training algorithms.
🟧 -> 🔶