Unofficial code reproducing Agent57

This repository contains unofficial code reproducing Agent57, which outperformed humans in all Atari games.

agent.py

define agent to play a supecific environment.
buffer.py

define buffer to store experiences with priorites.
learner.py

define learner to update parameter such as q networks and functions related to intrinsic reward.
main.py

run the main pipeline.
model.py

define some models such as q network and functions related to intrinsic reward.
segment_tree.py

define segment tree which decide segment index according to the priority.
tester.py

define tester which test performance of Agent57.
utils.py

define some classes and functions such as UCB and Retrace operator.