%0 Conference Proceedings %T A Bag-of-Pages Approach to Unordered Multi-Page Document Classification %A Albert Gordo %A Florent Perronnin %B 20th International Conference on Pattern Recognition %D 2010 %@ 1051-4651 %@ 978-1-4244-7542-1 %F Albert Gordo2010 %O DAG %O exported from refbase (http://refbase.cvc.uab.es/show.php?record=1480), last updated on Wed, 19 Feb 2014 16:09:38 +0100 %X We consider the problem of classifying documents containing multiple unordered pages. For this purpose, we propose a novel bag-of-pages document representation. To represent a document, one assigns every page to a prototype in a codebook of pages. This leads to a histogram representation which can then be fed to any discriminative classifier. We also consider several refinements over this initial approach. We show on two challenging datasets that the proposed approach significantly outperforms a baseline system. %U http://dx.doi.org/10.1109/ICPR.2010.473 %P 1920–1923