|
Lluis Pere de las Heras, Ahmed Sheraz, Marcus Liwicki, Ernest Valveny and Gemma Sanchez. 2014. Statistical Segmentation and Structural Recognition for Floor Plan Interpretation. IJDAR, 17(3), 221–237.
Abstract: A generic method for floor plan analysis and interpretation is presented in this article. The method, which is mainly inspired by the way engineers draw and interpret floor plans, applies two recognition steps in a bottom-up manner. First, basic building blocks, i.e., walls, doors, and windows are detected using a statistical patch-based segmentation approach. Second, a graph is generated, and structural pattern recognition techniques are applied to further locate the main entities, i.e., rooms of the building. The proposed approach is able to analyze any type of floor plan regardless of the notation used. We have evaluated our method on different publicly available datasets of real architectural floor plans with different notations. The overall detection and recognition accuracy is about 95 %, which is significantly better than any other state-of-the-art method. Our approach is generic enough such that it could be easily adopted to the recognition and interpretation of any other printed machine-generated structured documents.
|
|
|
Yi Xiao, Felipe Codevilla, Akhil Gurram, Onay Urfalioglu and Antonio Lopez. 2020. Multimodal end-to-end autonomous driving. TITS, 1–11.
Abstract: A crucial component of an autonomous vehicle (AV) is the artificial intelligence (AI) is able to drive towards a desired destination. Today, there are different paradigms addressing the development of AI drivers. On the one hand, we find modular pipelines, which divide the driving task into sub-tasks such as perception and maneuver planning and control. On the other hand, we find end-to-end driving approaches that try to learn a direct mapping from input raw sensor data to vehicle control signals. The later are relatively less studied, but are gaining popularity since they are less demanding in terms of sensor data annotation. This paper focuses on end-to-end autonomous driving. So far, most proposals relying on this paradigm assume RGB images as input sensor data. However, AVs will not be equipped only with cameras, but also with active sensors providing accurate depth information (e.g., LiDARs). Accordingly, this paper analyses whether combining RGB and depth modalities, i.e. using RGBD data, produces better end-to-end AI drivers than relying on a single modality. We consider multimodality based on early, mid and late fusion schemes, both in multisensory and single-sensor (monocular depth estimation) settings. Using the CARLA simulator and conditional imitation learning (CIL), we show how, indeed, early fusion multimodality outperforms single-modality.
|
|
|
Felipe Lumbreras and Joan Serrat. 1996. Segmentation of petrographical images of marbles. Computers and Geosciences. 22(5):547–558.
|
|
|
J. Pladellorens, Joan Serrat, A. Castell and M.J. Yzuel. 1993. Using mathematical morphology to determine left ventricular contours..
|
|
|
J. Pladellorens, M.J. Yzuel, J. Castell and Joan Serrat. 1993. Calculo automatico del volumen del ventriculo izquierdo. Comparacion con expertos..
|
|
|
A.F. Sole, S. Ngan, G. Sapiro, X. Hu and Antonio Lopez. 2001. Anisotropic 2-D and 3-D Averaging of fMRI Signals. IEEE Transactions on Medical Imaging, 20(2): 86–93 (IF: 3.142).
|
|
|
Daniel Ponsa, Robert Benavente, Felipe Lumbreras, J. Martinez and Xavier Roca. 2003. Quality control of safety belts by machine vision inspection for real-time production.
|
|
|
A.F. Sole, Antonio Lopez and G. Sapiro. 2001. Crease Enhancement Diffusion. Computer Vision and Image Understanding, 84(2): 241–248 (IF: 1.298).
|
|
|
A. Restrepo, Angel Sappa and M. Devy. 2005. Edge registration versus triangular mesh registration, a comparative study.
|
|
|
Jaume Amores and Petia Radeva. 2005. Retrieval of IVUS Images Using Contextual Information and Elastic Matching.
|
|