Below are my notes on Andrej Karpathy’s video tutorial on introduction to language modeling. You can watch Andrej’s original presentation on youtube.
In Part 1, we worked on a bigram model that takes into account only the local context of a word. This approach is
To finish reading, please visit source site