S. Chanda, Umapada Pal, & Oriol Ramos Terrades. (2009). Word-Wise Thai and Roman Script Identification. TALIP - ACM Transactions on Asian Language Information Processing, 1–21.
Abstract: In some Thai documents, a single text line of a printed document page may contain words of both Thai and Roman scripts. For the Optical Character Recognition (OCR) of such a document page it is better to identify, at first, Thai and Roman script portions and then to use individual OCR systems of the respective scripts on these identified portions. In this article, an SVM-based method is proposed for identification of word-wise printed Roman and Thai scripts from a single line of a document page. Here, at first, the document is segmented into lines and then lines are segmented into character groups (words). In the proposed scheme, we identify the script of a character group combining different character features obtained from structural shape, profile behavior, component overlapping information, topological properties, and water reservoir concept, etc. Based on the experiment on 10,000 data (words) we obtained 99.62% script identification accuracy from the proposed scheme.
|
Oriol Rodriguez-Leor, J. Mauri, Eduard Fernandez-Nofrerias, Antonio Tovar, Vicente del Valle, Aura Hernandez-Sabate, et al. (2004). Utilizacion de la estructura de los campos vectoriales para la deteccion de la Adventicia en imagenes de Ecografia Intracoronaria. REC - Revista Española de Cardiología, 100.
|
J. Pladellorens, Joan Serrat, A. Castell, & M.J. Yzuel. (1993). Using mathematical morphology to determine left ventricular contours. Physics in Medicine and Biology., 1877––1894.
|
M. Bressan, David Guillamet, & Jordi Vitria. (2003). Using an ICA Representation of Local Color Histograms for Object Recognition. Pattern Recognition, 36(3):691–701 (IF: 1.611).
|
Angel Sappa. (2006). Unsupervised Contour Closure Algorithm for Range Image Edge-Based Segmentation. IEEE Transactions on Image Processing, 15(2):377–384.
|
M. Gomez, J. Mauri, E. Fernandez-Nofrerias, Oriol Rodriguez-Leor, Carme Julia, David Rotger, et al. (2002). Una nova aplicacio informatica per a la correlacio d imatges angiografiques i d ecografia intracoronaria. Revista de la Societat Catalana de Cardiologia, 4(4): 42, XIV Congres de la Societat Catalana de Cardiologia..
|
B. Gotschy, Matthias S. Keil, H. Klos, & I. Rystau. (1994). Transition from static to dynamic Jahn-Teller distortion in (P(C6 H5)4)2 C60|. Solid State Communications, 92(12): 935–938.
|
Mikhail Mozerov, Ariel Amato, Xavier Roca, & Jordi Gonzalez. (2008). Trajectory Occlusion Handling with Multiple View Distance Minimisation Clustering. Optical Engineering, vol. 47(04)04702, DOI:10.11781.2909665.
|
Roger Max Calle Quispe, Maya Aghaei Gavari, & Eduardo Aguilar Torres. (2023). Towards real-time accurate safety helmets detection through a deep learning-based method. Ingeniare. Revista chilena de ingenieria.
Abstract: Occupational safety is a fundamental activity in industries and revolves around the management of the necessary controls that must be present to mitigate occupational risks. These controls include verifying the use of Personal Protection Equipment (PPE). Within PPE, safety helmets are vital to reducing severe or fatal consequences caused by head injuries. This problem has been addressed recently by various research based on deep learning to detect the usage of safety helmets by the present people in the industrial field.
These works have achieved promising results for safety helmet detection using object detection methods from the YOLO family. In this work, we propose to analyze the performance of Scaled-YOLOv4, a novel model of the YOLO family that has yet to be previously studied for this problem. The performance of the Scaled-YOLOv4 is evaluated on two public databases, carefully selected among the previously proposed datasets for the occupational safety framework. We demonstrate the superiority of Scaled-YOLOv4 in terms of mAP and Fl-score concerning the previous works for both databases. Further, we summarize the currently available datasets for safety helmet detection purposes and discuss their suitability.
|
Patrick Brandao, O. Zisimopoulos, E. Mazomenos, G. Ciutib, Jorge Bernal, M. Visentini-Scarzanell, et al. (2018). Towards a computed-aided diagnosis system in colonoscopy: Automatic polyp segmentation using convolution neural networks. JMRR - Journal of Medical Robotics Research.
Abstract: Early diagnosis is essential for the successful treatment of bowel cancers including colorectal cancer (CRC) and capsule endoscopic imaging with robotic actuation can be a valuable diagnostic tool when combined with automated image analysis. We present a deep learning rooted detection and segmentation framework for recognizing lesions in colonoscopy and capsule endoscopy images. We restructure established convolution architectures, such as VGG and ResNets, by converting them into fully-connected convolution networks (FCNs), ne-tune them and study their capabilities for polyp segmentation and detection. We additionally use Shape-from-Shading (SfS) to recover depth and provide a richer representation of the tissue's structure in colonoscopy images. Depth is
incorporated into our network models as an additional input channel to the RGB information and we demonstrate that the resulting network yields improved performance. Our networks are tested on publicly available datasets and the most accurate segmentation model achieved a mean segmentation IU of 47.78% and 56.95% on the ETIS-Larib and CVC-Colon datasets, respectively. For polyp
detection, the top performing models we propose surpass the current state of the art with detection recalls superior to 90% for all datasets tested. To our knowledge, we present the rst work to use FCNs for polyp segmentation in addition to proposing a novel combination of SfS and RGB that boosts performance.
Keywords: convolutional neural networks; colonoscopy; computer aided diagnosis
|
Javier Varona, Antoni Jaume-i-Capo, Jordi Gonzalez, & Francisco Jose Perales. (2008). Toward Natural Interaction through Visual Recognition of Body Gestures in Real-Time. Interacting with Computers, diu 10,1016/j.intcom.2008.10.001, available on line.
|
Yong Xu, Jing-Yu Yang, & Zhong Jin. (2003). Theory analysis on FSLDA and ULDA. Pattern Recognition, 36(12): 3031–3033 (IF: 1.611).
|
Fernando Vilariño, Dimosthenis Karatzas, & Alberto Valcarce. (2018). The Library Living Lab Barcelona: A participative approach to technology as an enabling factor for innovation in cultural spaces. Technology Innovation Management Review.
|
Juan Andrade, & A. Sanfeliu. (2005). The effects of partial observability when building fully correlated maps. IEEE Transactions on Robotics, 21(4):771–777 (IF: 1.486).
|
A. Richichi, O. Fors, M.T. Merino, Xavier Otazu, J. Nuñez, A. Prades, et al. (2006). The Calar Alto lunar occultation program: update and new results. Astronomy and Astrophysics (Section ’Stellar structure and evolution’), 445:1081–1088.
|