%0 Conference Proceedings
%T Towards Query-by-Speech Handwritten Keyword Spotting
%A Marçal Rusiñol
%A David Aldavert
%A Ricardo Toledo
%A Josep Llados
%B 13th International Conference on Document Analysis and Recognition ICDAR2015
%D 2015
%F Marçal Rusiñol2015
%O DAG; 600.084; 600.061; 601.223; 600.077;ADAS
%O exported from refbase (http://refbase.cvc.uab.es/show.php?record=2682), last updated on Tue, 18 Oct 2016 17:47:14 +0200
%X In this paper, we present a new querying paradigm for handwritten keyword spotting. We propose to represent handwritten word images both by visual and audio representations, enabling a query-by-speech keyword spotting system. The two representations are merged together and projected to a common sub-space in the training phase. This transform allows to, given a spoken query, retrieve word instances that were only represented by the visual modality. In addition, the same method can be used backwards at no additional cost to produce a handwritten text-tospeech system. We present our first results on this new querying mechanism using synthetic voices over the George Washingtondataset.
%U http://refbase.cvc.uab.es/files/RAT2015b.pdf
%U http://dx.doi.org/10.1109/ICDAR.2015.7333812
%P 501-505