August 6, 2021 Deep Learning

Rainbow: Combining Improvements in Deep Reinforcement Learning

Rainbow

Rainbow: Combining Improvements in Deep Reinforcement Learning [1].

Results and pretrained models can be found in the releases.

[x] DQN [2]
[x] Double DQN [3]
[x] Prioritised Experience Replay [4]
[x] Dueling Network Architecture [5]
[x] Multi-step Returns [6]
[x] Distributional RL [7]
[x] Noisy Nets [8]

Run the original Rainbow with the default arguments:

python main.py

Data-efficient Rainbow [9] can be run using the following options (note that the “unbounded” memory is implemented here in practice by manually setting the memory capacity to be the same as the maximum number of timesteps):

python main.py --target-update 2000

               --T-max 100000

               --learn-start 1600

               --memory-capacity 100000

               --replay-frequency 1

               --multi-step 20

               --architecture data-efficient

               --hidden-size
 
 

 
To finish reading, please visit source site


		
		
	

		Categories
Categories


	
		
			Search for:
			
		
		
	


		
		Recent Posts
		
											
					Using Python’s .__dict__ to Work With Attributes
									
											
					Research Focus: Week of April 7, 2025
									
											
					Checking for Membership Using Python’s “in” and “not in” Operators
									
											
					Python News Roundup: April 2025
									
											
					Real-world healthcare AI development and deployment—at scale
									
					

		
Tags
Attention
blogathon
Calculus
Command-line Tools
Data Preparation
data science
data visualization
Deep Learning
Deep Learning for Computer Vision
Deep Learning for Natural Language Processing
Deep Learning for Time Series
Deep Learning Performance
Deep Learning with PyTorch
Ensemble Learning
Generative Adversarial Networks
Imbalanced Classification
Linear Algebra
Long Short-Term Memory Networks
machine learning
Machine Learning Algorithms
Machine Learning Process
Machine Learning Resources
machine translation
Matplotlib
Natural language processing
Natural Language Processing & Speech
Neural MT
nlp
NMT
opencv
Optimization
pandas
Probability
python
Python for Machine Learning
Python Machine Learning
Resources
R Machine Learning
scikit-learn
sentiment analysis
Start Machine Learning
Statistics
Time Series
Weka Machine Learning
XGBoost
Categories
Categories

Archives
		Archives


	
	
		

	
	
				
		
		
			
				
								
				
					
	
		Powered by WordPress and Rubine.