Training a deep learning text classifier with a standalone text dataset
In the Training a deep learning language model with a standalone text dataset section, we trained a language model using the standalone text dataset: the Kaggle Coronavirus tweets NLP – Text Classification dataset described here: https://www.kaggle.com/datatattle/covid-19-nlp-text-classification. In this section, we will use this language model to create a text classifier trained with the Covid-related tweets dataset.
Getting ready
This recipe makes use of the encoder that you trained in the Training a deep learning language model with a standalone text dataset section, so ensure that you have followed the steps in the recipe in that section. In particular, ensure that you have saved the encoder from the language model you trained in the previous section.
Also, make sure you have followed all the steps from the Getting ready sub-section of the previous section to ensure the following:
- ...