Articles About Machine Learning

Encoder-Decoder Long Short-Term Memory Networks

Last Updated on August 14, 2019 Gentle introduction to the Encoder-Decoder LSTMs forsequence-to-sequence prediction with example Python code. The Encoder-Decoder LSTM is a recurrent neural network designed to address sequence-to-sequence problems, sometimes called seq2seq. Sequence-to-sequence prediction problems are challenging because the number of items in the input and output sequences can vary. For example, text translation and learning to execute programs are examples of seq2seq problems. In this post, you will discover the Encoder-Decoder LSTM architecture for sequence-to-sequence prediction. After […]

Read more

Gentle Introduction to Generative Long Short-Term Memory Networks

Last Updated on August 14, 2019 The Long Short-Term Memory recurrent neural network was developed for sequence prediction. In addition to sequence prediction problems. LSTMs can also be used as a generative model In this post, you will discover how LSTMs can be used as generative models. After completing this post, you will know: About generative models, with a focus on generative models for text called language modeling. Examples of applications where LSTM Generative models have been used. Examples of […]

Read more

How to Make Predictions with Long Short-Term Memory Models in Keras

Last Updated on August 14, 2019 The goal of developing an LSTM model is a final model that you can use on your sequence prediction problem. In this post, you will discover how to finalize your model and use it to make predictions on new data. After completing this post, you will know: How to train a final LSTM model. How to save your final LSTM model, and later load it again. How to make predictions on new data. Kick-start […]

Read more

How to Reshape Input Data for Long Short-Term Memory Networks in Keras

Last Updated on August 14, 2019 It can be difficult to understand how to prepare your sequence data for input to an LSTM model. Often there is confusion around how to define the input layer for the LSTM model. There is also confusion about how to convert your sequence data that may be a 1D or 2D matrix of numbers to the required 3D format of the LSTM input layer. In this tutorial, you will discover how to define the […]

Read more

How to Diagnose Overfitting and Underfitting of LSTM Models

Last Updated on January 8, 2020 It can be difficult to determine whether your Long Short-Term Memory model is performing well on your sequence prediction problem. You may be getting a good model skill score, but it is important to know whether your model is a good fit for your data or if it is underfit or overfit and could do better with a different configuration. In this tutorial, you will discover how you can diagnose the fit of your […]

Read more

Making Predictions with Sequences

Last Updated on August 14, 2019 Sequence prediction is different from other types of supervised learning problems. The sequence imposes an order on the observations that must be preserved when training models and making predictions. Generally, prediction problems that involve sequence data are referred to as sequence prediction problems, although there are a suite of problems that differ based on the input and output sequences. In this tutorial, you will discover the different types of sequence prediction problems. After completing […]

Read more

A Gentle Introduction to RNN Unrolling

Last Updated on August 14, 2019 Recurrent neural networks are a type of neural network where the outputs from previous time steps are fed as input to the current time step. This creates a network graph or circuit diagram with cycles, which can make it difficult to understand how information moves through the network. In this post, you will discover the concept of unrolling or unfolding recurrent neural networks. After reading this post, you will know: The standard conception of […]

Read more

Top Books on Natural Language Processing

Last Updated on August 14, 2020 Natural Language Processing, or NLP for short, is the study of computational methods for working with speech and text data. The field is dominated by the statistical paradigm and machine learning methods are used for developing predictive models. In this post, you will discover the top books that you can read to get started with natural language processing. After reading this post, you will know: The top books for practical natural language processing. The […]

Read more

Review of Stanford Course on Deep Learning for Natural Language Processing

Last Updated on August 7, 2019 Natural Language Processing, or NLP, is a subfield of machine learning concerned with understanding speech and text data. Statistical methods and statistical machine learning dominate the field and more recently deep learning methods have proven very effective in challenging NLP problems like speech recognition and text translation. In this post, you will discover the Stanford course on the topic of Natural Language Processing with Deep Learning methods. This course is free and I encourage […]

Read more

Oxford Course on Deep Learning for Natural Language Processing

Last Updated on August 7, 2019 Deep Learning methods achieve state-of-the-art results on a suite of natural language processing problems What makes this exciting is that single models are trained end-to-end, replacing a suite of specialized statistical models. The University of Oxford in the UK teaches a course on Deep Learning for Natural Language Processing and much of the materials for this course are available online for free. In this post, you will discover the Oxford course on Deep Learning […]

Read more
1 165 166 167 168 169 226