How to Train XGBoost Models in the Cloud with Amazon Web Services

Last Updated on August 27, 2020 The XGBoost library provides an implementation of gradient boosting designed for speed and performance. It is implemented to make best use of your computing resources, including all CPU cores and memory. In this post you will discover how you can setup a server on Amazon’s cloud service to quickly and cheaply create very large models. After reading this post you will know: How to setup and configure an Amazon EC2 server instance for use with […]

Read more

What You Know About Deep Learning Is A Lie

Last Updated on August 19, 2019 Getting started in deep learning is a struggle. It’s a struggle because deep learning is taught by academics, for academics. If you’re a developer (or practitioner), you’re different. You want results. The way practitioners learn new technologies is by developing prototypes that deliver value quickly. This is a top-down approach to learning, but it is not the way that deep learning is taught. There is another way. A way that works for top-down practitioners like you. In this […]

Read more

Tune Learning Rate for Gradient Boosting with XGBoost in Python

Last Updated on August 27, 2020 A problem with gradient boosted decision trees is that they are quick to learn and overfit training data. One effective way to slow down learning in the gradient boosting model is to use a learning rate, also called shrinkage (or eta in XGBoost documentation). In this post you will discover the effect of the learning rate in gradient boosting and how to tune it on your machine learning problem using the XGBoost library in […]

Read more

Stochastic Gradient Boosting with XGBoost and scikit-learn in Python

Last Updated on August 27, 2020 A simple technique for ensembling decision trees involves training trees on subsamples of the training dataset. Subsets of the the rows in the training data can be taken to train individual trees called bagging. When subsets of rows of the training data are also taken when calculating each split point, this is called random forest. These techniques can also be used in the gradient tree boosting model in a technique called stochastic gradient boosting. […]

Read more

How To Improve Deep Learning Performance

Last Updated on August 6, 2019 20 Tips, Tricks and Techniques That You Can Use ToFight Overfitting and Get Better Generalization How can you get better performance from your deep learning model? It is one of the most common questions I get asked. It might be asked as: How can I improve accuracy? …or it may be reversed as: What can I do if my neural network performs poorly? I often reply with “I don’t know exactly, but I have […]

Read more

7 Step Mini-Course to Get Started with XGBoost in Python

Last Updated on April 24, 2020 XGBoost With Python Mini-Course. XGBoost is an implementation of gradient boosting that is being used to win machine learning competitions. It is powerful but it can be hard to get started. In this post, you will discover a 7-part crash course on XGBoost with Python. This mini-course is designed for Python machine learning practitioners that are already comfortable with scikit-learn and the SciPy ecosystem. Kick-start your project with my new book XGBoost With Python, […]

Read more

Python Machine Learning Mini-Course

Last Updated on August 3, 2020 From Developer to Machine Learning Practitioner in 14 Days Python is one of the fastest-growing platforms for applied machine learning. In this mini-course, you will discover how you can get started, build accurate models and confidently complete predictive modeling machine learning projects using Python in 14 days. This is a big and important post. You might want to bookmark it. Kick-start your project with my new book Machine Learning Mastery With Python, including step-by-step […]

Read more

Embrace Randomness in Machine Learning

Last Updated on August 12, 2019 Why Do You Get Different Results On Different Runs Of An Algorithm With The Same Data? Applied machine learning is a tapestry of breakthroughs and mindset shifts. Understanding the role of randomness in machine learning algorithms is one of those breakthroughs. Once you get it, you will see things differently. In a whole new light. Things like choosing between one algorithm and another, hyperparameter tuning and reporting results. You will also start to see the abuses […]

Read more

Deploy Your Predictive Model To Production

Last Updated on September 30, 2016 5 Best Practices For Operationalizing Machine Learning. Not all predictive models are at Google-scale. Sometimes you develop a small predictive model that you want to put in your software. I recently received this reader question: Actually, there is a part that is missing in my knowledge about machine learning. All tutorials give you the steps up until you build your machine learning model. How could you use this model? In this post, we look at […]

Read more

How Beginners Get It Wrong In Machine Learning

Last Updated on October 3, 2016 The 5 Most Common Mistakes That Beginners MakeAnd How To Avoid Them. I help beginners get started in machine learning. But I see the same mistakes in both mindset and action again and again. In this post, you will discover the 5 most common ways that I see beginners slip-up when getting started in machine learning. I firmly believe thatanyone can get started and do really wellwith applied machine learning. Hopefully, you can identify yourself […]

Read more
1 793 794 795 796 797 910