A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner,
M. Dehghani, M. Minderer, G. Heigold, S. Gelly, J. Uszkoreit, N. Houlsby, 2020, An
Image is Worth 16x16 Words: Transformers for Image Recognition at Scale, arXiv preprint
arXiv:2010.11929