PT Unknown AU Albert Gordo Florent Perronnin TI A Bag-of-Pages Approach to Unordered Multi-Page Document Classification BT 20th International Conference on Pattern Recognition PY 2010 BP 1920–1923 DI 10.1109/ICPR.2010.473 AB We consider the problem of classifying documents containing multiple unordered pages. For this purpose, we propose a novel bag-of-pages document representation. To represent a document, one assigns every page to a prototype in a codebook of pages. This leads to a histogram representation which can then be fed to any discriminative classifier. We also consider several refinements over this initial approach. We show on two challenging datasets that the proposed approach significantly outperforms a baseline system. ER