How To Investigate Machine Learning Algorithm Behavior

Last Updated on December 13, 2019 Machine learning algorithms are complex systems that require study to understand. Static descriptions of machine learning algorithms are a good starting point, but are insufficient to get a feeling for how the algorithm behaves. You need to see the algorithm in action. Experimenting on a running machine learning algorithms will allow you to build an intuition for the cause and effect relationship of the algorithm parameters with the results you can achieve on different […]

Read more

Don’t Start with Open-Source Code When Implementing Machine Learning Algorithms

Last Updated on August 12, 2019 Edward Raff is the author of the Java Machine Learning library called JSAT (which is an acronym for Java Statistical Analysis Tool). Edward has implemented many algorithms in creating this library and I recently reached out to him and asked what advice he could give to beginners implementing machine learning algorithms from scratch. In this post we take a look at tips on implementing machine learning algorithms based on Edwards advice. Kick-start your project with […]

Read more

Take Control By Creating Targeted Lists of Machine Learning Algorithms

Last Updated on August 12, 2019 Any book on machine learning will list and describe dozens of machine learning algorithms. Once you start using tools and libraries you will discover dozens more. This can really wear you down, if you think you need to know about every possible algorithm out there. A simple trick to tackle this feeling and take some control back is to make lists of machine learning algorithms. This ridiculously simple tactic can give you a lot […]

Read more

How To Get Baseline Results And Why They Matter

Last Updated on June 27, 2017 In my courses and guides, I teach the preparation of a baseline result before diving into spot checking algorithms. A student of mine recently asked: If a baseline is not calculated for a problem, will it make the results of other algorithms questionable? He went on to ask: If other algorithms do not give better accuracy than the baseline, what lesson should we take from it? Does it indicate that the data set does not […]

Read more

Hello World of Applied Machine Learning

Last Updated on September 5, 2016 It is easy to feel overwhelmed with the large numbers of machine learning algorithms. There are so many to choose from, it is hard to know where to start and what to try. The choice can be paralyzing. You need to get over this fear and start. There is no magic book or course that is going to tell you what algorithm to use and when. In fact, in practice you cannot know this […]

Read more

Machine Learning Q&A: Concept Drift, Better Results and Learning Faster

Last Updated on June 7, 2016 I get a lot of questions about machine learning via email and I love answering them. I get to see what real people are doing and help to make a difference. (Do you have a question about machine learning? Contact me). In this post I highlight a few of the interesting questions I have received recently and summarize my answers. Machine Learning Q&APhoto by Angelo Amboldi, some rights reserved Why does my spam classifier […]

Read more

Why Aren’t My Results As Good As I Thought? You’re Probably Overfitting

Last Updated on August 15, 2020 We all know the satisfaction of running an analysis and seeing the results come back the way we want them to: 80% accuracy; 85%; 90%? The temptation is strong just to turn to the Results section of the report we’re writing, and put the numbers in. But wait: as always, it’s not that straightforward. Succumbing to this particular temptation could undermine the impact of otherwise completely valid analysis. With most machine learning algorithms it’s […]

Read more

Crash Course in Statistics for Machine Learning

Last Updated on August 15, 2020 You do not need to know statistics before you can start learning and applying machine learning. You can start today. Nevertheless, knowing some statistics can be very helpful to understand the language used in machine learning. Knowing some statistics will eventually be required when you want to start making strong claims about your results. In this post you will discover a few key concepts from statistics that will give you the confidence you need […]

Read more

How to Become a Data Scientist

Last Updated on April 19, 2018 How do you become a data scientist? I think that really depends on where you are now and what you really want to do as a data scientist. Nevertheless, DataCamp posted an infographic recently that described 8 easy steps to becoming a data scientist. In this post I want to highlight and review DataCamp’s infographic. How to become a data scientist A portion of the infographic posted on the DataCamp blog What is a […]

Read more

Data Management Matters And Why You Need To Take It Seriously

Last Updated on March 5, 2020 We live in a world drowning in data. Internet tracking, stock market movement, genome sequencing technologies and their ilk all produce enormous amounts of data. Most of this data is someone else’s responsibility, generated by someone else, stored in someone else’s database, which is maintained and made available by… you guessed it… someone else. But. Whenever we carry out a machine learning project we are working with a small subset of the all the […]

Read more
1 775 776 777 778 779 911