Invariant Causal Imitation Learning for Generalizable Policies
Ioana Bica, Daniel Jarrett, Mihaela van der Schaar Neural Information Processing Systems (NeurIPS) 2021 Dependencies The code was implemented in Python 3.6 and the following packages are needed for running it: gym==0.17.2 numpy==1.18.2 pandas==1.0.4 tensorflow==1.15.0 torch==1.6.0 tqdm==4.32.1 scipy==1.1.0 scikit-learn==0.22.2 stable-baselines==2.10.1 Running and evaluating the model: The control tasks used for experiments are from OpenAI gym [1]. Each control task is associated with a true rewardfunction (unknown to the imitation algorithm). In each case, the “expert” demonstrator can be obtained by […]
Read more