So, You are Working on a Machine Learning Problem…

Last Updated on January 9, 2019 So, you’re working on a machine learning problem. I want to really nail down where you’re at right now. Let me make some guesses… So, You are Working on a Machine Learning Problem…Photo by David Mulder, some rights reserved. 1) You Have a Problem So you have a problem that you need to solve. Maybe it’s your problem, an idea you have, a question, or something you want to address. Or maybe it is […]

Read more

How to Make Predictions with scikit-learn

Last Updated on January 10, 2020 How to predict classification or regression outcomeswith scikit-learn models in Python. Once you choose and fit a final machine learning model in scikit-learn, you can use it to make predictions on new data instances. There is some confusion amongst beginners about how exactly to do this. I often see questions such as: How do I make predictions with my model in scikit-learn? In this tutorial, you will discover exactly how you can make classification […]

Read more

How to Make Predictions with Keras

Last Updated on August 27, 2020 Once you choose and fit a final deep learning model in Keras, you can use it to make predictions on new data instances. There is some confusion amongst beginners about how exactly to do this. I often see questions such as: How do I make predictions with my model in Keras? In this tutorial, you will discover exactly how you can make classification and regression predictions with a finalized deep learning model with the […]

Read more

Machine Learning Development Environment

The development environment that you use for machine learning may be just as important as the machine learning methods that you use to solve your predictive modeling problem. A few times a week, I get a question such as: What is your development environment for machine learning? In this post, you will discover the development environment that I use and recommend for applied machine learning for developers. After reading this post, you will know: The important distinctions between the role […]

Read more

Analytical vs Numerical Solutions in Machine Learning

Do you have questions like: What data is best for my problem? What algorithm is best for my data? How do I best configure my algorithm? Why can’t a machine learning expert just give you a straight answer to your question? In this post, I want to help you see why no one can ever tell you what algorithm to use or how to configure it for your specific dataset. I want to help you see that finding good data/algorithm/configuration […]

Read more

How to Get the Most From Your Machine Learning Data

Last Updated on June 30, 2020 The data that you use, and how you use it, will likely define the success of your predictive modeling problem. Data and the framing of your problem may be the point of biggest leverage on your project. Choosing the wrong data or the wrong framing for your problem may lead to a model with poor performance or, at worst, a model that cannot converge. It is not possible to analytically calculate what data to […]

Read more

The Model Performance Mismatch Problem (and what to do about it)

What To Do If Model Test Results Are Worse than Training. The procedure when evaluating machine learning models is to fit and evaluate them on training data, then verify that the model has good skill on a held-back test dataset. Often, you will get a very promising performance when evaluating the model on the training dataset and poor performance when evaluating the model on the test set. In this post, you will discover techniques and issues to consider when you […]

Read more

How To Know if Your Machine Learning Model Has Good Performance

After you develop a machine learning model for your predictive modeling problem, how do you know if the performance of the model is any good? This is a common question I am asked by beginners. As a beginner, you often seek an answer to this question, e.g. you want someone to tell you whether an accuracy of x% or an error score of x is good or not. In this post, you will discover how to answer this question for […]

Read more

Introduction to Random Number Generators for Machine Learning in Python

Last Updated on July 31, 2020 Randomness is a big part of machine learning. Randomness is used as a tool or a feature in preparing data and in learning algorithms that map input data to output data in order to make predictions. In order to understand the need for statistical methods in machine learning, you must understand the source of randomness in machine learning. The source of randomness in machine learning is a mathematical trick called a pseudorandom number generator. […]

Read more

How to Remove Outliers for Machine Learning

Last Updated on August 18, 2020 When modeling, it is important to clean the data sample to ensure that the observations best represent the problem. Sometimes a dataset can contain extreme values that are outside the range of what is expected and unlike the other data. These are called outliers and often machine learning modeling and model skill in general can be improved by understanding and even removing these outlier values. In this tutorial, you will discover outliers and how […]

Read more
1 818 819 820 821 822 910