TY - JOUR AU - Palaiahnakote Shivakumara AU - Anjan Dutta AU - Chew Lim Tan AU - Umapada Pal PY - 2014// TI - Multi-oriented scene text detection in video based on wavelet and angle projection boundary growing T2 - MTAP JO - Multimedia Tools and Applications SP - 515 EP - 539 VL - 72 IS - 1 PB - Springer US N2 - In this paper, we address two complex issues: 1) Text frame classification and 2) Multi-oriented text detection in video text frame. We first divide a video frame into 16 blocks and propose a combination of wavelet and median-moments with k-means clustering at the block level to identify probable text blocks. For each probable text block, the method applies the same combination of feature with k-means clustering over a sliding window running through the blocks to identify potential text candidates. We introduce a new idea of symmetry on text candidates in each block based on the observation that pixel distribution in text exhibits a symmetric pattern. The method integrates all blocks containing text candidates in the frame and then all text candidates are mapped on to a Sobel edge map of the original frame to obtain text representatives. To tackle the multi-orientation problem, we present a new method called Angle Projection Boundary Growing (APBG) which is an iterative algorithm and works based on a nearest neighbor concept. APBG is then applied on the text representatives to fix the bounding box for multi-oriented text lines in the video frame. Directional information is used to eliminate false positives. Experimental results on a variety of datasets such as non-horizontal, horizontal, publicly available data (Hua’s data) and ICDAR-03 competition data (camera images) show that the proposed method outperforms existing methods proposed for video and the state of the art methods for scene text as well. SN - 1380-7501 UR - http://dx.doi.org/10.1007/s11042-013-1385-0 N1 - DAG; 600.077 ID - Palaiahnakote Shivakumara2014 ER -