How to Prepare Text Data for Deep Learning with Keras
Last Updated on August 7, 2019
You cannot feed raw text directly into deep learning models.
Text data must be encoded as numbers to be used as input or output for machine learning and deep learning models.
The Keras deep learning library provides some basic tools to help you prepare your text data.
In this tutorial, you will discover how you can use Keras to prepare your text data.
After completing this tutorial, you will know:
- About the convenience methods that you can use to quickly prepare text data.
- The Tokenizer API that can be fit on training data and used to encode training, validation, and test documents.
- The range of 4 different document encoding schemes offered by the Tokenizer API.
Kick-start your project with my new book Deep Learning for Natural Language Processing, including step-by-step tutorials and the Python source code files for all examples.
Let’s get started.