References
Conneau, A. et al. (2019). Unsupervised Cross-lingual Representation Learning at Scale, Cornell University, https://arxiv.org/abs/1911.02116.
Goyal, N. et al. (2021). Larger-scale Transformers for Multilingual Masked Language mModeling, https://arxiv.org/abs/2105.00572.
Hu, J. et al. (2020). "XTREME: A Massively Multilingual Multi-task Benchmark for Evaluating Cross-lingual Generalisation", Proceedings of Machine Learning Research, Vol. 119, pp. 4411-4421, https://proceedings.mlr.press/v119/hu20b.html.
Joulin, A. et al. (2016). Bag of Tricks for Efficient Text Classification, Cornell University, https://arxiv.org/abs/1607.01759.
Mikolov, T., et al. (2017). Advances in Pre-training Distributed Word Representations, Cornell University, https://arxiv.org/abs/1712.09405.
Toporkov, O., and Agerri, R. (2024), “On the Role of Morphological Information for Contextual Lemmatization”, Computational Linguistics, Vol. 50/1, pp. 157-191, https://doi.org/DOI:10.1162/coli_a_00497.