Articles About Machine Learning

Design and Run your First Experiment in Weka

Last Updated on August 22, 2019 Weka is the perfect platform for learning machine learning. It provides a graphical user interface for exploring and experimenting with machine learning algorithms on datasets, without you having to worry about the mathematics or the programming. A powerful feature of Weka is the Weka Experimenter interface. Unlike the Weka Explorer that is for filtering data and trying out different algorithms, the Experimenter is for designing and running experiments. The experimental results it produces are […]

Read more

Quick and Dirty Data Analysis for your Machine Learning Problem

Last Updated on August 22, 2019 A part of having a good understanding of the machine learning problem that you’re working on, you need to know the data intimately. I personally find this step onerous sometimes and just want to get on with defining my test harness, but I know it always flushes out interested ideas and assumptions to test. As such, I use a step-by-step process to capture a minimum number of observations about the actual dataset before moving […]

Read more

How to Tune a Machine Learning Algorithm in Weka

Last Updated on August 22, 2019 Weka is the perfect platform for learning machine learning. It provides a graphical user interface for exploring and experimenting with machine learning algorithms on datasets, without you having to worry about the mathematics or the programming. In a previous post we looked at how to design and run an experiment with 3 algorithms on a dataset and how to analyse and report the results. Manhattan Skyline, because we are going to be looking at […]

Read more

Make Better Predictions with Boosting, Bagging and Blending Ensembles in Weka

Last Updated on August 22, 2019 Weka is the perfect platform for studying machine learning. It provides a graphical user interface for exploring and experimenting with machine learning algorithms on datasets, without you having to worry about the mathematics or the programming. In a previous post we looked at how to design and run an experiment running 3 algorithms on a dataset and how to analyse and report the results. We also looked at how to design and run an […]

Read more

4-Steps to Get Started in Applied Machine Learning

Last Updated on August 16, 2020 A Top-Down Strategy for Beginners to Start and Practice Machine Learning. Getting started is much easier than you think. In this post I show you the top-down approach for getting started in applied machine learning. You will discover the four steps to this approach. They should feel familiar because it’s probably the same top-down approach that you used to learn how to program. Namely, get the basics, practice a lot and dive into the […]

Read more

A Simple Intuition for Overfitting, or Why Testing on Training Data is a Bad Idea

Last Updated on August 21, 2016 When you first start out with machine learning you load a dataset and try models. You might think to yourself, why can’t I just build a model with all of the data and evaluate it on the same dataset? It seems reasonable. More data to train the model is better, right? Evaluating the model and reporting results on the same dataset will tell you how good the model is, right? Wrong. In this post […]

Read more

Template for Working through Machine Learning Problems in Weka

Last Updated on August 22, 2019 When you are getting started in Weka, you may feel overwhelmed. There are so many datasets, so many filters and so many algorithms to choose from. There is too much choice. There are too many things you could be doing. Too much ChoicePhoto by emilio labrador, some rights reserved. Structured process is key. I have talked about process and the need for tasks like spot checking algorithms to overcome the overwhelm and start learning […]

Read more

Biggest Mistake I Made When Starting Machine Learning, And How To Avoid It

Last Updated on August 22, 2019 When I first got started in machine learning I implemented algorithms by hand. It was really slow going. I was a terrible programmer at the time. I was trying to figure out the algorithms from books, how to use them on problems and how to write code – all at the same time. This was the biggest mistake I made when getting started. It made everything 3-times harder and killed my motivation. A friend […]

Read more

Feature Selection to Improve Accuracy and Decrease Training Time

Last Updated on August 16, 2020 Working on a problem, you are always looking to get the most out of the data that you have available. You want the best accuracy you can get. Typically, the biggest wins are in better understanding the problem you are solving. This is why I stress you spend so much time up front defining your problem, analyzing the data, and preparing datasets for your models. A key part of data preparation is creating transforms […]

Read more

Project Spotlight: Stack Exchange Clustering using Mahout with Konstantin Slisenko

Last Updated on August 16, 2020 This is a project spotlight with Konstantin Slisenko a programmer and machine learning enthusiast. Could you please introduce yourself? My name is Konstantin Slisenko, I’m from Belarus. I graduated from the Belarusian State University of Informatics and Radioelectronics. I am currently taking a master course. Konstantin Slisenko I’m a Java developer and work in JazzTeam company. I like to learn new technologies. I’m currently interested in big data and machine learning. I like to participate […]

Read more
1 122 123 124 125 126 226