Type of Publication

Conference Papers

Date:

4 /

2020

Status

Published

DOI:

10.1109/ICARSC49921.2020.9096138

Multimodal Deep-Learning for Object Recognition Combining Camera and LIDAR Data

Featured in:

2020 IEEE International Conference on Autonomous Robot Systems and Competitions, Ponta Delgada, Portugal

Authors:

Gledson Melotti, Cristiano Premebida and Nuno Gonçalves

Abstract

Object detection and recognition is a key component of autonomous robotic vehicles, as evidenced by the continuous efforts made by the robotic community on areas related to object detection and sensory perception systems. This paper presents a study on multisensor (camera and LIDAR) late fusion strategies for object recognition. In this work, LIDAR data is processed as 3D points and also by means of a 2D representation in the form of depth map (DM), which is obtained by projecting the LIDAR 3D point cloud into a 2D image plane followed by an upsampling strategy which generates a high-resolution 2D range view. A CNN network (Inception V3) is used as classification method on the RGB images, and on the DMs (LIDAR modality). A 3D-network (the PointNet), which directly performs classification on the 3D point clouds, is also considered in the experiments. One of the motivations of this work consists of incorporating the distance to the objects, as measured by he LIDAR, as a relevant cue to improve the classification performance. A new rangebased average weighting strategy is proposed, which considers the relationship between the deep-models performance and the distance of objects. A classification dataset, based on the KITTI database, is used to evaluate the deep-models, and to support the experimental part. We report extensive results in terms of single modality i.e., using RGB and LIDAR models individually, and late fusion multimodality approaches.

Citation
Gledson Melotti, Cristiano Premebida and Nuno Gonçalves. Multimodal deep-learning for object recognition combining camera and LIDAR data. In 2020 IEEE International Conference on Autonomous Robot Systems and Competitions (ICARSC) (pp. 177-182). IEEE. DOI: 10.1109/ICARSC49921.2020.9096138

Related Content

Researcher Coordinator, VIS TEAM Leader
PhD Student
No tagged content to show
No tagged content to show
No tagged content to show

RECENT PUBLICATIONS

Using Benford’s Law for Deepfake Detection

Authors: Miguel Leão; Nuno Gonçalves
Featured in: RECPAD - 30th Portuguese Conference on Pattern Recognition. 2024, Covilhã, Portugal

Proceedings of the 12th Iberian Conference on Pattern Recognition and Image Analysis Part I

Authors: Nuno Gonçalves; Hélder P. Oliveira; Joan Andreu Sánchez
Featured in: 12th Iberian Conference on Pattern Recognition and Image Analysis (IbPRIA 2025)

Proceedings of the 12th Iberian Conference on Pattern Recognition and Image Analysis Part II

Authors: Nuno Gonçalves; Hélder P. Oliveira; Joan Andreu Sánchez
Featured in: 12th Iberian Conference on Pattern Recognition and Image Analysis (IbPRIA 2025)

suggested news

Paper accepted to IJCB 2025
Prof. Nuno and VIS Team successfully organizes IbPRIA...
Four papers presented @ IbPRIA 2025

RECENT PROJECTS

FACING2 – Face Image Understanding
VISUAL-ID – Unique Visual Identities in Graphics, Images and Faces
UniqueMark

Institute of Systems and Robotics Department of Electrical and Computers Engineering University of Coimbra