%0 Conference Proceedings
%T Compact and Efficient Multitask Learning in Vision, Language and Speech
%A Mohammed Al Rawi
%A Ernest Valveny
%B IEEE International Conference on Computer Vision Workshops
%D 2019
%F Mohammed Al Rawi2019
%O DAG; 600.121; 600.129
%O exported from refbase (http://refbase.cvc.uab.es/show.php?record=3365), last updated on Fri, 26 Feb 2021 13:57:34 +0100
%X Across-domain multitask learning is a challenging area of computer vision and machine learning due to the intra-similarities among class distributions. Addressing this problem to cope with the human cognition system by considering inter and intra-class categorization and recognition complicates the problem even further. We propose in this work an effective holistic and hierarchical learning by using a text embedding layer on top of a deep learning model. We also propose a novel sensory discriminator approach to resolve the collisions between different tasks and domains. We then train the model concurrently on textual sentiment analysis, speech recognition, image classification, action recognition from video, and handwriting word spotting of two different scripts (Arabic and English). The model we propose successfully learned different tasks across multiple domains.
%U https://ieeexplore.ieee.org/document/9022188
%U http://refbase.cvc.uab.es/files/RaV2019.pdf
%U http://dx.doi.org/10.1109/ICCVW.2019.00355
%P 2933-2942