Pointer cache for Language Model
Posted on Thu 26 April 2018 in Experiments • Tagged with Deep Learning, NLP
You can easily boost the performance of a language model based on RNNs by adding a pointer cache on top of it. The idea was introduce by Grave et al. and their results showed how this simple technique can make your perplexity decrease by 10 points without additional training. This sounds exciting, so let's see what this is all about and implement that in pytorch with the fastai library.
Continue reading