TY - JOUR AU - Marçal Rusiñol AU - Volkmar Frinken AU - Dimosthenis Karatzas AU - Andrew Bagdanov AU - Josep Llados PY - 2014// TI - Multimodal page classification in administrative document image streams T2 - IJDAR JO - International Journal on Document Analysis and Recognition SP - 331 EP - 341 VL - 17 IS - 4 PB - Springer Berlin Heidelberg KW - Digital mail room KW - Multimodal page classification KW - Visual and textual document description N2 - In this paper, we present a page classification application in a banking workflow. The proposed architecture represents administrative document images by merging visual and textual descriptions. The visual description is based on a hierarchical representation of the pixel intensity distribution. The textual description uses latent semantic analysis to represent document content as a mixture of topics. Several off-the-shelf classifiers and different strategies for combining visual and textual cues have been evaluated. A final step uses an n-gram model of the page stream allowing a finer-grained classification of pages. The proposed method has been tested in a real large-scale environment and we report results on a dataset of 70,000 pages. SN - 1433-2833 UR - http://dx.doi.org/10.1007/s10032-014-0225-8 N1 - DAG; LAMP; 600.056; 600.061; 601.240; 601.223; 600.077; 600.079 ID - Marçal Rusiñol2014 ER -