Preliminary code for Representation learning with Generalized Similarity Functions
Code for GSF learning in offline Procgen. Note: The repo is under construction right now, some experiments might still be changed/ added. Since the dataset is very large due to operating on pixel observations, we provide a way to generate it from pre-trained PPO checkpoints instead of hosting 1Tb+ of data. Instructions Clone the repo Either train a PPO agent from scratch on 200 levels (see here: here), or download provided PPO checkpoints (same repo link). TLDR, you can run […]
Read more