What is a Confusion Matrix in Machine Learning

Last Updated on August 15, 2020 Make the Confusion Matrix Less Confusing. A confusion matrix is a technique for summarizing the performance of a classification algorithm. Classification accuracy alone can be misleading if you have an unequal number of observations in each class or if you have more than two classes in your dataset. Calculating a confusion matrix can give you a better idea of what your classification model is getting right and what types of errors it is making. […]

Read more

Top Books on Time Series Forecasting With R

Last Updated on August 15, 2020 Time series forecasting is a difficult problem. Unlike classification and regression, time series data also adds a time dimension which imposes an ordering of observations. This turns rows into a sequence which requires careful and specific handling. In this post, you will discover the top books for time series analysis and forecasting in R. These books will provide the resources that you need to get started working through your own time series predictive modeling […]

Read more

Machine Learning Performance Improvement Cheat Sheet

Last Updated on May 22, 2019 32 Tips, Tricks and Hacks That You Can Use To Make Better Predictions. The most valuable part of machine learning is predictive modeling. This is the development of models that are trained on historical data and make predictions on new data. And the number one question when it comes to predictive modeling is: How can I get better results? This cheat sheet contains my best advice distilled from years of my own application and […]

Read more

10 Standard Datasets for Practicing Applied Machine Learning

Last Updated on May 20, 2020 The key to getting good at applied machine learning is practicing on lots of different datasets. This is because each problem is different, requiring subtly different data preparation and modeling methods. In this post, you will discover 10 top standard machine learning datasets that you can use for practice. Let’s dive in. Update Mar/2018: Added alternate link to download the Pima Indians and Boston Housing datasets as the originals appear to have been taken […]

Read more

5 Top Machine Learning Podcasts

Machine learning podcasts are now a thing. There are now enough of us interested in this obscure geeky topic that there are podcasts dedicated to chatting about the ins and outs of predictive modeling. There has never been a better time to get started and working in this amazing field. In this post, I want to share the 5 podcasts on machine learning and data science that I listen to. Let’s dive in. Overview Here’s the short list of machine […]

Read more

7 Time Series Datasets for Machine Learning

Last Updated on August 21, 2019 Machine learning can be applied to time series datasets. These are problems where a numeric or categorical value must be predicted, but the rows of data are ordered by time. A problem when getting started in time series forecasting with machine learning is finding good quality standard datasets on which to practice. In this post, you will discover 8 standard time series datasets that you can use to get started and practice time series […]

Read more

What Is Time Series Forecasting?

Last Updated on August 15, 2020 Time series forecasting is an important area of machine learning that is often neglected. It is important because there are so many prediction problems that involve a time component. These problems are neglected because it is this time component that makes time series problems more difficult to handle. In this post, you will discover time series forecasting. After reading this post, you will know: Standard definitions of time series, time series analysis, and time […]

Read more

Time Series Forecasting as Supervised Learning

Last Updated on August 15, 2020 Time series forecasting can be framed as a supervised learning problem. This re-framing of your time series data allows you access to the suite of standard linear and nonlinear machine learning algorithms on your problem. In this post, you will discover how you can re-frame your time series problem as a supervised learning problem for machine learning. After reading this post, you will know: What supervised learning is and how it is the foundation […]

Read more

How to Go From Working in a Bank To Hired as Senior Data Scientist at Target

Last Updated on December 7, 2016 How Santhosh Sharma Went FromWorking in the Loans Department of a Bank toGetting Hired as a Senior Data Scientist at Target. Santhosh Sharma recently reached out to me to share his inspirational story and I want to share it with you. His story shows how with enthusiasm for machine learning, taking the initiative, sharing your results and a little luck can change your career and throw you deep into applied machine learning. After reading this interview, you will know: How Santhosh […]

Read more

How to Load and Explore Time Series Data in Python

Last Updated on April 30, 2020 The Pandas library in Python provides excellent, built-in support for time series data. Once loaded, Pandas also provides tools to explore and better understand your dataset. In this post, you will discover how to load and explore your time series dataset. After completing this tutorial, you will know: How to load your time series dataset from a CSV file using Pandas. How to peek at the loaded data and calculate summary statistics. How to […]

Read more
1 796 797 798 799 800 910