Records |
Author |
Sounak Dey; Palaiahnakote Shivakumara; K.S. Raghunanda; Umapada Pal; Tong Lu; G. Hemantha Kumar; Chee Seng Chan |
Title |
Script independent approach for multi-oriented text detection in scene image |
Type |
Journal Article |
Year |
2017 |
Publication |
Neurocomputing |
Abbreviated Journal |
NEUCOM |
Volume |
242 |
Issue |
|
Pages |
96-112 |
Keywords |
|
Abstract |
Developing a text detection method which is invariant to scripts in natural scene images is a challeng- ing task due to different geometrical structures of various scripts. Besides, multi-oriented of text lines in natural scene images make the problem more challenging. This paper proposes to explore ring radius transform (RRT) for text detection in multi-oriented and multi-script environments. The method finds component regions based on convex hull to generate radius matrices using RRT. It is a fact that RRT pro- vides low radius values for the pixels that are near to edges, constant radius values for the pixels that represent stroke width, and high radius values that represent holes created in background and convex hull because of the regular structures of text components. We apply k -means clustering on the radius matrices to group such spatially coherent regions into individual clusters. Then the proposed method studies the radius values of such cluster components that are close to the centroid and far from the cen- troid to detect text components. Furthermore, we have developed a Bangla dataset (named as ISI-UM dataset) and propose a semi-automatic system for generating its ground truth for text detection of arbi- trary orientations, which can be used by the researchers for text detection and recognition in the future. The ground truth will be released to public. Experimental results on our ISI-UM data and other standard datasets, namely, ICDAR 2013 scene, SVT and MSRA data, show that the proposed method outperforms the existing methods in terms of multi-lingual and multi-oriented text detection ability. |
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
|
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
DAG; 600.121 |
Approved |
no |
Call Number |
Admin @ si @ DSR2017 |
Serial |
3260 |
Permanent link to this record |
|
|
|
Author |
Sergio Escalera; R. M. Martinez; Jordi Vitria; Petia Radeva; Maria Teresa Anguera |
Title |
Dominance Detection in Face-to-face Conversations |
Type |
Conference Article |
Year |
2009 |
Publication |
2nd IEEE Workshop on CVPR for Human communicative Behavior analysis |
Abbreviated Journal |
|
Volume |
|
Issue |
|
Pages |
97–102 |
Keywords |
|
Abstract |
Dominance is referred to the level of influence a person has in a conversation. Dominance is an important research area in social psychology, but the problem of its automatic estimation is a very recent topic in the contexts of social and wearable computing. In this paper, we focus on dominance detection from visual cues. We estimate the correlation among observers by categorizing the dominant people in a set of face-to-face conversations. Different dominance indicators from gestural communication are defined, manually annotated, and compared to the observers opinion. Moreover, the considered indicators are automatically extracted from video sequences and learnt by using binary classifiers. Results from the three analysis shows a high correlation and allows the categorization of dominant people in public discussion video sequences. |
Address |
Miami, USA |
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
2160-7508 |
ISBN |
978-1-4244-3994-2 |
Medium |
|
Area |
|
Expedition |
|
Conference |
CVPR |
Notes |
HuPBA; OR; MILAB;MV |
Approved |
no |
Call Number |
BCNPCL @ bcnpcl @ EMV2009 |
Serial |
1227 |
Permanent link to this record |
|
|
|
Author |
Miguel Oliveira; Angel Sappa; V. Santos |
Title |
Color Correction using 3D Gaussian Mixture Models |
Type |
Conference Article |
Year |
2012 |
Publication |
9th International Conference on Image Analysis and Recognition |
Abbreviated Journal |
|
Volume |
7324 |
Issue |
I |
Pages |
97-106 |
Keywords |
|
Abstract |
The current paper proposes a novel color correction approach based on a probabilistic segmentation framework by using 3D Gaussian Mixture Models. Regions are used to compute local color correction functions, which are then combined to obtain the final corrected image. The proposed approach is evaluated using both a recently published metric and two large data sets composed of seventy images. The evaluation is performed by comparing our algorithm with eight well known color correction algorithms. Results show that the proposed approach is the highest scoring color correction method. Also, the proposed single step 3D color space probabilistic segmentation reduces processing time over similar approaches. |
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
Springer Berlin Heidelberg |
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
LNCS |
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
0302-9743 |
ISBN |
10.1007/978-3-642-31295-3_12 |
Medium |
|
Area |
|
Expedition |
|
Conference |
ICIAR |
Notes |
ADAS |
Approved |
no |
Call Number |
Admin @ si @ OSS2012a |
Serial |
2015 |
Permanent link to this record |
|
|
|
Author |
Miguel Reyes; Albert Clapes; Luis Felipe Mejia; Jose Ramirez; Juan R Revilla; Sergio Escalera |
Title |
Posture Analysis and Range of Movement Estimation using Depth Maps |
Type |
Conference Article |
Year |
2012 |
Publication |
21st International Conference on Pattern Recognition International Workshop on Depth Image Analysis |
Abbreviated Journal |
|
Volume |
7854 |
Issue |
|
Pages |
97-105 |
Keywords |
|
Abstract |
World Health Organization estimates that 80% of the world population is affected of back pain during his life. Current practices to analyze back problems are expensive, subjective, and invasive. In this work, we propose a novel tool for posture and range of movement estimation based on the analysis of 3D information from depth maps. Given a set of keypoints defined by the user, RGB and depth data are aligned, depth surface is reconstructed, keypoints are matching using a novel point-to-point fitting procedure, and accurate measurements about posture, spinal curvature, and range of movement are computed. The system shows high precision and reliable measurements, being useful for posture reeducation purposes to prevent musculoskeletal disorders, such as back pain, as well as tracking the posture evolution of patients in rehabilitation treatments. |
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
Springer Berlin Heidelberg |
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
0302-9743 |
ISBN |
978-3-642-40302-6 |
Medium |
|
Area |
|
Expedition |
|
Conference |
WDIA |
Notes |
HuPBA;MILAB |
Approved |
no |
Call Number |
Admin @ si @ RCM2012 |
Serial |
2121 |
Permanent link to this record |
|
|
|
Author |
Carles Sanchez; Oriol Ramos Terrades; Patricia Marquez; Enric Marti; J.Roncaries; Debora Gil |
Title |
Automatic evaluation of practices in Moodle for Self Learning in Engineering |
Type |
Journal |
Year |
2015 |
Publication |
Journal of Technology and Science Education |
Abbreviated Journal |
JOTSE |
Volume |
5 |
Issue |
2 |
Pages |
97-106 |
Keywords |
|
Abstract |
|
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
|
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
IAM; DAG; 600.075; 600.077 |
Approved |
no |
Call Number |
Admin @ si @ SRM2015 |
Serial |
2610 |
Permanent link to this record |
|
|
|
Author |
Lluis Gomez; Marçal Rusiñol; Dimosthenis Karatzas |
Title |
Cutting Sayre's Knot: Reading Scene Text without Segmentation. Application to Utility Meters |
Type |
Conference Article |
Year |
2018 |
Publication |
13th IAPR International Workshop on Document Analysis Systems |
Abbreviated Journal |
|
Volume |
|
Issue |
|
Pages |
97-102 |
Keywords |
Robust Reading; End-to-end Systems; CNN; Utility Meters |
Abstract |
In this paper we present a segmentation-free system for reading text in natural scenes. A CNN architecture is trained in an end-to-end manner, and is able to directly output readings without any explicit text localization step. In order to validate our proposal, we focus on the specific case of reading utility meters. We present our results in a large dataset of images acquired by different users and devices, so text appears in any location, with different sizes, fonts and lengths, and the images present several distortions such as
dirt, illumination highlights or blur. |
Address |
Viena; Austria; April 2018 |
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
|
Medium |
|
Area |
|
Expedition |
|
Conference |
DAS |
Notes |
DAG; 600.084; 600.121; 600.129 |
Approved |
no |
Call Number |
Admin @ si @ GRK2018 |
Serial |
3102 |
Permanent link to this record |
|
|
|
Author |
Ricard Borras; Agata Lapedriza; Laura Igual |
Title |
Depth Information in Human Gait Analysis: An Experimental Study on Gender Recognition |
Type |
Conference Article |
Year |
2012 |
Publication |
9th International Conference on Image Analysis and Recognition |
Abbreviated Journal |
|
Volume |
7325 |
Issue |
II |
Pages |
98-105 |
Keywords |
|
Abstract |
This work presents DGait, a new gait database acquired with a depth camera. This database contains videos from 53 subjects walking in different directions. The intent of this database is to provide a public set to explore whether the depth can be used as an additional information source for gait classification purposes. Each video is labelled according to subject, gender and age. Furthermore, for each subject and view point, we provide initial and final frames of an entire walk cycle. On the other hand, we perform gait-based gender classification experiments with DGait database, in order to illustrate the usefulness of depth information for this purpose. In our experiments, we extract 2D and 3D gait features based on shape descriptors, and compare the performance of these features for gender identification, using a Kernel SVM. The obtained results show that depth can be an information source of great relevance for gait classification problems. |
Address |
Aveiro, Portugal |
Corporate Author |
|
Thesis |
|
Publisher |
Springer Berlin Heidelberg |
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
0302-9743 |
ISBN |
978-3-642-31297-7 |
Medium |
|
Area |
|
Expedition |
|
Conference |
ICIAR |
Notes |
OR; MILAB;MV |
Approved |
no |
Call Number |
Admin @ si @ BLI2012 |
Serial |
2009 |
Permanent link to this record |
|
|
|
Author |
Katerine Diaz; Aura Hernandez-Sabate; Antonio Lopez |
Title |
A reduced feature set for driver head pose estimation |
Type |
Journal Article |
Year |
2016 |
Publication |
Applied Soft Computing |
Abbreviated Journal |
ASOC |
Volume |
45 |
Issue |
|
Pages |
98-107 |
Keywords |
Head pose estimation; driving performance evaluation; subspace based methods; linear regression |
Abstract |
Evaluation of driving performance is of utmost importance in order to reduce road accident rate. Since driving ability includes visual-spatial and operational attention, among others, head pose estimation of the driver is a crucial indicator of driving performance. This paper proposes a new automatic method for coarse and fine head's yaw angle estimation of the driver. We rely on a set of geometric features computed from just three representative facial keypoints, namely the center of the eyes and the nose tip. With these geometric features, our method combines two manifold embedding methods and a linear regression one. In addition, the method has a confidence mechanism to decide if the classification of a sample is not reliable. The approach has been tested using the CMU-PIE dataset and our own driver dataset. Despite the very few facial keypoints required, the results are comparable to the state-of-the-art techniques. The low computational cost of the method and its robustness makes feasible to integrate it in massive consume devices as a real time application. |
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
|
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
ADAS; 600.085; 600.076; |
Approved |
no |
Call Number |
Admin @ si @ DHL2016 |
Serial |
2760 |
Permanent link to this record |
|
|
|
Author |
Sergio Escalera; Oriol Pujol; Petia Radeva |
Title |
Traffic sign recognition system with β -correction |
Type |
Journal Article |
Year |
2010 |
Publication |
Machine Vision and Applications |
Abbreviated Journal |
MVA |
Volume |
21 |
Issue |
2 |
Pages |
99–111 |
Keywords |
|
Abstract |
Traffic sign classification represents a classical application of multi-object recognition processing in uncontrolled adverse environments. Lack of visibility, illumination changes, and partial occlusions are just a few problems. In this paper, we introduce a novel system for multi-class classification of traffic signs based on error correcting output codes (ECOC). ECOC is based on an ensemble of binary classifiers that are trained on bi-partition of classes. We classify a wide set of traffic signs types using robust error correcting codings. Moreover, we introduce the novel β-correction decoding strategy that outperforms the state-of-the-art decoding techniques, classifying a high number of classes with great success. |
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
Springer-Verlag |
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
0932-8092 |
ISBN |
|
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
MILAB;HUPBA |
Approved |
no |
Call Number |
BCNPCL @ bcnpcl @ EPR2010a |
Serial |
1276 |
Permanent link to this record |
|
|
|
Author |
Jorge Bernal; F. Javier Sanchez; Gloria Fernandez Esparrach; Debora Gil; Cristina Rodriguez de Miguel; Fernando Vilariño |
Title |
WM-DOVA Maps for Accurate Polyp Highlighting in Colonoscopy: Validation vs. Saliency Maps from Physicians |
Type |
Journal Article |
Year |
2015 |
Publication |
Computerized Medical Imaging and Graphics |
Abbreviated Journal |
CMIG |
Volume |
43 |
Issue |
|
Pages |
99-111 |
Keywords |
Polyp localization; Energy Maps; Colonoscopy; Saliency; Valley detection |
Abstract |
We introduce in this paper a novel polyp localization method for colonoscopy videos. Our method is based on a model of appearance for polyps which defines polyp boundaries in terms of valley information. We propose the integration of valley information in a robust way fostering complete, concave and continuous boundaries typically associated to polyps. This integration is done by using a window of radial sectors which accumulate valley information to create WMDOVA1 energy maps related with the likelihood of polyp presence. We perform a double validation of our maps, which include the introduction of two new databases, including the first, up to our knowledge, fully annotated database with clinical metadata associated. First we assess that the highest value corresponds with the location of the polyp in the image. Second, we show that WM-DOVA energy maps can be comparable with saliency maps obtained from physicians' fixations obtained via an eye-tracker. Finally, we prove that our method outperforms state-of-the-art computational saliency results. Our method shows good performance, particularly for small polyps which are reported to be the main sources of polyp miss-rate, which indicates the potential applicability of our method in clinical practice. |
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
0895-6111 |
ISBN |
|
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
MV; IAM; 600.047; 600.060; 600.075;SIAI |
Approved |
no |
Call Number |
Admin @ si @ BSF2015 |
Serial |
2609 |
Permanent link to this record |
|
|
|
Author |
Akhil Gurram; Onay Urfalioglu; Ibrahim Halfaoui; Fahd Bouzaraa; Antonio Lopez |
Title |
Semantic Monocular Depth Estimation Based on Artificial Intelligence |
Type |
Journal Article |
Year |
2020 |
Publication |
IEEE Intelligent Transportation Systems Magazine |
Abbreviated Journal |
ITSM |
Volume |
13 |
Issue |
4 |
Pages |
99-103 |
Keywords |
|
Abstract |
Depth estimation provides essential information to perform autonomous driving and driver assistance. A promising line of work consists of introducing additional semantic information about the traffic scene when training CNNs for depth estimation. In practice, this means that the depth data used for CNN training is complemented with images having pixel-wise semantic labels where the same raw training data is associated with both types of ground truth, i.e., depth and semantic labels. The main contribution of this paper is to show that this hard constraint can be circumvented, i.e., that we can train CNNs for depth estimation by leveraging the depth and semantic information coming from heterogeneous datasets. In order to illustrate the benefits of our approach, we combine KITTI depth and Cityscapes semantic segmentation datasets, outperforming state-of-the-art results on monocular depth estimation. |
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
|
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
ADAS; 600.124; 600.118 |
Approved |
no |
Call Number |
Admin @ si @ GUH2019 |
Serial |
3306 |
Permanent link to this record |
|
|
|
Author |
David Berga; Xavier Otazu; Xose R. Fernandez-Vidal; Victor Leboran; Xose M. Pardo |
Title |
Generating Synthetic Images for Visual Attention Modeling |
Type |
Journal Article |
Year |
2019 |
Publication |
Perception |
Abbreviated Journal |
PER |
Volume |
48 |
Issue |
|
Pages |
99 |
Keywords |
|
Abstract |
|
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
|
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
NEUROBIT; no menciona |
Approved |
no |
Call Number |
Admin @ si @ BOF2019 |
Serial |
3309 |
Permanent link to this record |
|
|
|
Author |
Joana Maria Pujadas-Mora; Alicia Fornes; Oriol Ramos Terrades; Josep Llados; Jialuo Chen; Miquel Valls-Figols; Anna Cabre |
Title |
The Barcelona Historical Marriage Database and the Baix Llobregat Demographic Database. From Algorithms for Handwriting Recognition to Individual-Level Demographic and Socioeconomic Data |
Type |
Journal |
Year |
2022 |
Publication |
Historical Life Course Studies |
Abbreviated Journal |
HLCS |
Volume |
12 |
Issue |
|
Pages |
99-132 |
Keywords |
Individual demographic databases; Computer vision, Record linkage; Social mobility; Inequality; Migration; Word spotting; Handwriting recognition; Local censuses; Marriage Licences |
Abstract |
The Barcelona Historical Marriage Database (BHMD) gathers records of the more than 600,000 marriages celebrated in the Diocese of Barcelona and their taxation registered in Barcelona Cathedral's so-called Marriage Licenses Books for the long period 1451–1905 and the BALL Demographic Database brings together the individual information recorded in the population registers, censuses and fiscal censuses of the main municipalities of the county of Baix Llobregat (Barcelona). In this ongoing collection 263,786 individual observations have been assembled, dating from the period between 1828 and 1965 by December 2020. The two databases started as part of different interdisciplinary research projects at the crossroads of Historical Demography and Computer Vision. Their construction uses artificial intelligence and computer vision methods as Handwriting Recognition to reduce the time of execution. However, its current state still requires some human intervention which explains the implemented crowdsourcing and game sourcing experiences. Moreover, knowledge graph techniques have allowed the application of advanced record linkage to link the same individuals and families across time and space. Moreover, we will discuss the main research lines using both databases developed so far in historical demography. |
Address |
June 23, 2022 |
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
|
Medium |
|
Area |
|
Expedition |
|
Conference |
|
Notes |
DAG; 600.121; 600.162; 602.230; 600.140 |
Approved |
no |
Call Number |
Admin @ si @ PFR2022 |
Serial |
3737 |
Permanent link to this record |
|
|
|
Author |
Oriol Rodriguez-Leon; Josefina Mauri;Eduard Fernandez-Nofrerias; Antonio Tovar; Vicente del Valle; Aura Hernandez-Sabate; Debora Gil; Petia Radeva |
Title |
Utilizacion de la estructura de los campos vectoriales para la deteccion de la Adventicia en imagenes de Ecografia Intracoronaria |
Type |
Journal |
Year |
2004 |
Publication |
Revista Española de Cardiología |
Abbreviated Journal |
REC |
Volume |
57 |
Issue |
2 |
Pages |
100 |
Keywords |
|
Abstract |
|
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
|
Medium |
|
Area |
|
Expedition |
|
Conference |
SEC |
Notes |
MILAB;IAM |
Approved |
no |
Call Number |
BCNPCL @ bcnpcl @ RMF2004 |
Serial |
566 |
Permanent link to this record |
|
|
|
Author |
Oriol Rodriguez-Leon; Josefina Mauri;Eduard Fernandez-Nofrerias; Antonio Tovar; Vicente del Valle; Aura Hernandez-Sabate; Debora Gil; Petia Radeva |
Title |
Utilización de la Estructura de los Campos Vectoriales para la Detección de la Adventicia en Imágenes de Ecografía Intracoronaria |
Type |
Journal Article |
Year |
2004 |
Publication |
Revista Internacional de Enfermedades Cardiovasculares Revista Española de Cardiología |
Abbreviated Journal |
|
Volume |
57 |
Issue |
2 |
Pages |
100 |
Keywords |
|
Abstract |
|
Address |
|
Corporate Author |
|
Thesis |
|
Publisher |
|
Place of Publication |
|
Editor |
|
Language |
|
Summary Language |
|
Original Title |
|
Series Editor |
|
Series Title |
|
Abbreviated Series Title |
|
Series Volume |
|
Series Issue |
|
Edition |
|
ISSN |
|
ISBN |
|
Medium |
|
Area |
|
Expedition |
|
Conference |
SEC |
Notes |
IAM;MILAB |
Approved |
no |
Call Number |
IAM @ iam @ RMF2004 |
Serial |
1642 |
Permanent link to this record |