PT Unknown AU Antonio Clavelli Dimosthenis Karatzas Josep Llados TI A framework for the assessment of text extraction algorithms on complex colour images BT 9th IAPR International Workshop on Document Analysis Systems PY 2010 BP 19–26 DI 10.1145/1815330.1815333 AB The availability of open, ground-truthed datasets and clear performance metrics is a crucial factor in the development of an application domain. The domain of colour text image analysis (real scenes, Web and spam images, scanned colour documents) has traditionally suffered from a lack of a comprehensive performance evaluation framework. Such a framework is extremely difficult to specify, and corresponding pixel-level accurate information tedious to define. In this paper we discuss the challenges and technical issues associated with developing such a framework. Then, we describe a complete framework for the evaluation of text extraction methods at multiple levels, provide a detailed ground-truth specification and present a case study on how this framework can be used in a real-life situation. ER