%0 Conference Proceedings %T Towards Query-by-Speech Handwritten Keyword Spotting %A Marçal Rusiñol %A David Aldavert %A Ricardo Toledo %A Josep Llados %B 13th International Conference on Document Analysis and Recognition ICDAR2015 %D 2015 %F Marçal Rusiñol2015 %O DAG; 600.084; 600.061; 601.223; 600.077;ADAS %O exported from refbase (http://refbase.cvc.uab.es/show.php?record=2682), last updated on Tue, 18 Oct 2016 17:47:14 +0200 %X In this paper, we present a new querying paradigm for handwritten keyword spotting. We propose to represent handwritten word images both by visual and audio representations, enabling a query-by-speech keyword spotting system. The two representations are merged together and projected to a common sub-space in the training phase. This transform allows to, given a spoken query, retrieve word instances that were only represented by the visual modality. In addition, the same method can be used backwards at no additional cost to produce a handwritten text-tospeech system. We present our first results on this new querying mechanism using synthetic voices over the George Washingtondataset. %U http://refbase.cvc.uab.es/files/RAT2015b.pdf %U http://dx.doi.org/10.1109/ICDAR.2015.7333812 %P 501-505