Type of Publication

Conference Papers

Date:

4 /

2020

Status

Published

DOI:

10.1109/ICARSC49921.2020.9096138

Multimodal Deep-Learning for Object Recognition Combining Camera and LIDAR Data

Featured in:

2020 IEEE International Conference on Autonomous Robot Systems and Competitions, Ponta Delgada, Portugal

Authors:

Gledson Melotti, Cristiano Premebida and Nuno Gonçalves

Abstract

Object detection and recognition is a key component of autonomous robotic vehicles, as evidenced by the continuous efforts made by the robotic community on areas related to object detection and sensory perception systems. This paper presents a study on multisensor (camera and LIDAR) late fusion strategies for object recognition. In this work, LIDAR data is processed as 3D points and also by means of a 2D representation in the form of depth map (DM), which is obtained by projecting the LIDAR 3D point cloud into a 2D image plane followed by an upsampling strategy which generates a high-resolution 2D range view. A CNN network (Inception V3) is used as classification method on the RGB images, and on the DMs (LIDAR modality). A 3D-network (the PointNet), which directly performs classification on the 3D point clouds, is also considered in the experiments. One of the motivations of this work consists of incorporating the distance to the objects, as measured by he LIDAR, as a relevant cue to improve the classification performance. A new rangebased average weighting strategy is proposed, which considers the relationship between the deep-models performance and the distance of objects. A classification dataset, based on the KITTI database, is used to evaluate the deep-models, and to support the experimental part. We report extensive results in terms of single modality i.e., using RGB and LIDAR models individually, and late fusion multimodality approaches.

Citation
Gledson Melotti, Cristiano Premebida and Nuno Gonçalves. Multimodal deep-learning for object recognition combining camera and LIDAR data. In 2020 IEEE International Conference on Autonomous Robot Systems and Competitions (ICARSC) (pp. 177-182). IEEE. DOI: 10.1109/ICARSC49921.2020.9096138

Related Content

Researcher Coordinator, VIS TEAM Leader
PhD Student
No tagged content to show
No tagged content to show
No tagged content to show

RECENT PUBLICATIONS

MorFacing: A Benchmark for Estimation Face Recognition Robustness to Face Morphing Attacks

Authors: Iurii Medvedev and Nuno Gonçalves
Featured in: IEEE International Joint Conference on Biometrics (IJCB 2024)

Face Liveness Detection Competition (LivDet-Face)

Authors: Lambert Igene, Afzal Hossain, Stephanie Schuckers, Mohammad Zahir Uddin Chowdhury, Humaira Rezaie, Ayden Rollins, Jesse Dykes, Rahul Vijaykumar, Sebastien Marcel, Juan Tapia, Carlos Aravena, Daniel Schulz, Nima Karimian and Anafsheh Adami, Diogo Nunes, João Marcos, Nuno Gonçalves, Lovro Sikošek, Borut Batagelj, Nima Schei, David Pabon, Manuela Tiedemann, Vasiliy Pryadchenko, Aleksandr Alenin, Alhasan Alkhaddour, Anton Pimenov, Artem Tregubov, Igor Avdonin, Maxim Lazantsev and Mikhail Pozigun
Featured in: IEEE International Joint Conference on Biometrics Competitions, 2024

Social NSTransformers: Low-Quality Pedestrian Trajectory Prediction

Authors: Zihan Jiang, Yiqun Ma, Bingyu Shi, Xin Lu, Jian Xing, Nuno Gonçalves and Bo Jin
Featured in: IEEE Transactions on Artificial Intelligence

suggested news

Laser engraving of precious metal artifacts (UniqueMark® deterministic...
UniqueMark® and UniQode® Glitter patent published
Paper about protecting facial recognition systems against morphing...

RECENT PROJECTS

FACING2 – Face Image Understanding
VISUAL-ID – Unique Visual Identities in Graphics, Images and Faces
UniqueMark

Institute of Systems and Robotics Department of Electrical and Computers Engineering University of Coimbra