TY - CONF AU - Raul Gomez AU - Jaume Gibert AU - Lluis Gomez AU - Dimosthenis Karatzas A2 - WACV PY - 2020// TI - Exploring Hate Speech Detection in Multimodal Publications BT - IEEE Winter Conference on Applications of Computer Vision N2 - In this work we target the problem of hate speech detection in multimodal publications formed by a text and an image. We gather and annotate a large scale dataset from Twitter, MMHS150K, and propose different models that jointly analyze textual and visual information for hate speech detection, comparing them with unimodal detection. We provide quantitative and qualitative results and analyze the challenges of the proposed task. We find that, even though images are useful for the hate speech detection task, current multimodal models cannot outperform models analyzing only text. We discuss why and open the field and the dataset for further research. UR - https://ieeexplore.ieee.org/document/9093414 L1 - http://refbase.cvc.uab.es/files/GGG2020a.pdf UR - http://dx.doi.org/10.1109/WACV45572.2020.9093414 N1 - DAG; 600.121; 600.129 ID - Raul Gomez2020 ER -