A Gentle Introduction to Long Short-Term Memory Networks by the Experts

Last Updated on February 20, 2020 Long Short-Term Memory (LSTM) networks are a type of recurrent neural network capable of learning order dependence in sequence prediction problems. This is a behavior required in complex problem domains like machine translation, speech recognition, and more. LSTMs are a complex area of deep learning. It can be hard to get your hands around what LSTMs are, and how terms like bidirectional and sequence-to-sequence relate to the field. In this post, you will get […]

Read more

On the Suitability of Long Short-Term Memory Networks for Time Series Forecasting

Last Updated on August 5, 2019 Long Short-Term Memory (LSTM) is a type of recurrent neural network that can learn the order dependence between items in a sequence. LSTMs have the promise of being able to learn the context required to make predictions in time series forecasting problems, rather than having this context pre-specified and fixed. Given the promise, there is some doubt as to whether LSTMs are appropriate for time series forecasting. In this post, we will look at […]

Read more

7 Ways to Handle Large Data Files for Machine Learning

Exploring and applying machine learning algorithms to datasets that are too large to fit into memory is pretty common. This leads to questions like: How do I load my multiple gigabyte data file? Algorithms crash when I try to run my dataset; what should I do? Can you help me with out-of-memory errors? In this post, I want to offer some common suggestions you may want to consider. 7 Ways to Handle Large Data Files for Machine LearningPhoto by Gareth […]

Read more

How to Evaluate the Skill of Deep Learning Models

Last Updated on August 14, 2020 I often see practitioners expressing confusion about how to evaluate a deep learning model. This is often obvious from questions like: What random seed should I use? Do I need a random seed? Why don’t I get the same results on subsequent runs? In this post, you will discover the procedure that you can use to evaluate deep learning models and the rationale for using it. You will also discover useful related statistics that […]

Read more

How to Report Classifier Performance with Confidence Intervals

Last Updated on August 14, 2020 Once you choose a machine learning algorithm for your classification problem, you need to report the performance of the model to stakeholders. This is important so that you can set the expectations for the model on new data. A common mistake is to report the classification accuracy of the model alone. In this post, you will discover how to calculate confidence intervals on the performance of your model to provide a calibrated and robust […]

Read more

How to Calculate Bootstrap Confidence Intervals For Machine Learning Results in Python

Last Updated on August 14, 2020 It is important to both present the expected skill of a machine learning model a well as confidence intervals for that model skill. Confidence intervals provide a range of model skills and a likelihood that the model skill will fall between the ranges when making predictions on new data. For example, a 95% likelihood of classification accuracy between 70% and 75%. A robust way to calculate confidence intervals for machine learning algorithms is to […]

Read more

The 5 Step Life-Cycle for Long Short-Term Memory Models in Keras

Last Updated on August 27, 2020 Deep learning neural networks are very easy to create and evaluate in Python with Keras, but you must follow a strict model life-cycle. In this post, you will discover the step-by-step life-cycle for creating, training, and evaluating Long Short-Term Memory (LSTM) Recurrent Neural Networks in Keras and how to make predictions with a trained model. After reading this post, you will know: How to define, compile, fit, and evaluate an LSTM in Keras. How […]

Read more

How to Learn to Echo Random Integers with LSTMs in Keras

Last Updated on August 27, 2020 Long Short-Term Memory (LSTM) Recurrent Neural Networks are able to learn the order dependence in long sequence data. They are a fundamental technique used in a range of state-of-the-art results, such as image captioning and machine translation. They can also be difficult to understand, specifically how to frame a problem to get the most out of this type of network. In this tutorial, you will discover how to develop a simple LSTM recurrent neural […]

Read more

How to use an Encoder-Decoder LSTM to Echo Sequences of Random Integers

Last Updated on August 27, 2020 A powerful feature of Long Short-Term Memory (LSTM) recurrent neural networks is that they can remember observations over long sequence intervals. This can be demonstrated by contriving a simple sequence echo problem where the entire input sequence or partial contiguous blocks of the input sequence are echoed as an output sequence. Developing LSTM recurrent neural networks to address the sequence echo problem is both a good demonstration of the power of LSTMs and can […]

Read more

How to Get Reproducible Results with Keras

Last Updated on August 19, 2019 Neural network algorithms are stochastic. This means they make use of randomness, such as initializing to random weights, and in turn the same network trained on the same data can produce different results. This can be confusing to beginners as the algorithm appears unstable, and in fact they are by design. The random initialization allows the network to learn a good approximation for the function being learned. Nevertheless, there are times when you need […]

Read more
1 804 805 806 807 808 910