8 Tactics to Combat Imbalanced Classes in Your Machine Learning Dataset
Last Updated on August 15, 2020
Has this happened to you?
You are working on your dataset. You create a classification model and get 90% accuracy immediately. “Fantastic” you think. You dive a little deeper and discover that 90% of the data belongs to one class. Damn!
This is an example of an imbalanced dataset and the frustrating results it can cause.
In this post you will discover the tactics that you can use to deliver great results on machine learning datasets with imbalanced data.
Kick-start your project with my new book Imbalanced Classification with Python, including step-by-step tutorials and the Python source code files for all examples.
Let’s get started.
Coming To Grips With Imbalanced Data
I get emails about class imbalance all the time, for example:
I have a binary classification problem and one class is present with 60:1 ratio in my training set. I used the logistic regression and the result seems to just ignores one class.