PT Unknown
AU Marçal Rusiñol
   David Aldavert
   Ricardo Toledo
   Josep Llados
TI Towards Query-by-Speech Handwritten Keyword Spotting
BT 13th International Conference on Document Analysis and Recognition ICDAR2015
PY 2015
BP 501
EP 505
DI 10.1109/ICDAR.2015.7333812
AB In this paper, we present a new querying paradigm for handwritten keyword spotting. We propose to represent handwritten word images both by visual and audio representations, enabling a query-by-speech keyword spotting system. The two representations are merged together and projected to a common sub-space in the training phase. This transform allows to, given a spoken query, retrieve word instances that were only represented by the visual modality. In addition, the same method can be used backwards at no additional cost to produce a handwritten text-tospeech system. We present our first results on this new querying mechanism using synthetic voices over the George Washingtondataset.
ER