Machine Learning Q&A: Concept Drift, Better Results and Learning Faster
Last Updated on June 7, 2016
I get a lot of questions about machine learning via email and I love answering them.
I get to see what real people are doing and help to make a difference. (Do you have a question about machine learning? Contact me).
In this post I highlight a few of the interesting questions I have received recently and summarize my answers.
Why does my spam classifier get worse when I train it on all lots of old emails?
This is a great question as it highlights an important concept in machine learning called concept drift.
The content of emails change through time. The user will change who they converse with and on which topics. Email spammers will send different offers and will actively change their tactics within emails to avoid email spam detection.
These changes affect the modeling.
The best source of information about which emails are spam and which are not spam are the emails
To finish reading, please visit source site