Machine Translation Weekly 81: Unsupervsied MT and Parallel Sentence Mining
This week I am going to briefly comment on a paper that uses unsupervised machine translation to improve unsupervised scoring for parallel data mining. The title of the paper is Unsupervised Multilingual Sentence Embeddings for Parallel Corpus Mining, it has authors from Charles University and the University of the Basque Country and will appear at this year’s ACL student research workshop. The idea of the paper is quite simple. They took XLM, a BERT-like model that was trained for 100 […]
Read more