This covers my work on the fourth lesson of the Fast AI course.
The model used was initially trained on the Wikipedia test set (WT103). Very interesting to hear that what was state of the art last year, has been bundled into the base FastAI library this year. Fast-moving field…
Steps involved:
Obviously, one needs to stick to the same language between steps 1 and 2 above. Would be interesting to see how much the model fails if initially trained in English, then applied to Korean for example.
Interesting coverage of other types of data such as tabular (aka spreadsheet/database) and collaborative filtering as well.