A Brief Introduction to BERT
data:image/s3,"s3://crabby-images/ea7a9/ea7a9b4e6c3b5143cc07fa78506b7618a81908b2" alt=""
As we learned what a Transformer is and how we might train the Transformer model, we notice that it is a great tool to make a computer understand human language. However, the Transformer was originally designed as a model to translate one language to another. If we repurpose it for a different task, we would likely need to retrain the whole model from scratch. Given the time it takes to train a Transformer model is enormous, we would like to have a solution that enables us to readily reuse the trained Transformer for many different tasks. BERT is such a model. It is an extension of the encoder part of a Transformer.
In this tutorial, you will