Ali Furkan Biten _and 7 others_. 2019. Scene Text Visual Question Answering. _18th IEEE International Conference on Computer Vision_.4291–4301.