%0 Conference Proceedings %T The diagonal split: A pre-segmentation step for page layout analysis & classification %A Albert Gordo %A Ernest Valveny %B 4th Iberian Conference on Pattern Recognition and Image Analysis %D 2009 %V 5524 %I Springer Berlin Heidelberg %@ 0302-9743 %@ 978-3-642-02171-8 %F Albert Gordo2009 %O DAG %O exported from refbase (http://refbase.cvc.uab.es/show.php?record=1176), last updated on Tue, 17 Dec 2013 15:58:16 +0100 %X Document classification is an important task in all the processes related to document storage and retrieval. In the case of complex documents, structural features are needed to achieve a correct classification. Unfortunately, physical layout analysis is error prone. In this paper we present a pre-segmentation step based on a divide & conquer strategy that can be used to improve the page segmentation results, independently of the segmentation algorithm used. This pre-segmentation step is evaluated in classification and retrieval using the selective CRLA algorithm for layout segmentation together with a clustering based on the voronoi area diagram, and tested on two different databases, MARG and Girona Archives. %U http://dx.doi.org/10.1007/978-3-642-02172-5_38 %P 290–297