Type of Publication

Thesis

Date:

9 /

2022

Status

Published

Reducing Overconfident Predictions in Multimodality Perception for Autonomous Driving

Featured in:

PhD Thesis

Authors:

Gledson Melotti

Abstract

In the last recent years, machine learning techniques have occupied a great space in order to solve problems in the areas related to perception systems applied to autonomous driving and advanced driver-assistance systems, such as: road users detection, traffic signal recognition, road detection, multiple object tracking, lane detection, scene understanding. In this way, a large number of techniques have been developed to cope with problems belonging to sensory perception field. Currently, deep network is the state-of-the-art for object recognition, begin softmax and sigmoid functions as prediction layers. Such layers often produce overconfident predictions rather than proper probabilistic scores, which can thus harm the decision-making of “critical” perception systems applied in autonomous driving and robotics. Given this, we propose a probabilistic approach based on distributions calculated out of the logit layer scores
of pre-trained networks which are then used to constitute new decision layers based on Maximum Likelihood (ML) and Maximum a-Posteriori (MAP) inference. We demonstrate that the hereafter called ML and MAP functions are more suitable for probabilistic interpretations than softmax and sigmoid-based predictions for object recognition, where our approach shows promising performance compared to the usual softmax and sigmoid functions, with the benefit of enabling interpretable probabilistic predictions. Another advantage of the approach introduced in this thesis is that the so-called ML and MAP functions can be implemented in existing trained networks, that is, the approach benefits from the output of the logit layer of pre-trained networks. Thus, there is no need to carry out a new training phase since the ML and MAP functions are used in the test/prediction phase. To validate our methodology, we explored distinct sensor modalities via RGB images and LiDARs (3D point clouds, range-view and reflectance-view) data from the KITTI dataset. The range-view and reflectance-view modalities were obtained by projecting the range/reflectance data to the 2D image-plane and consequently upsampling the projected points. The results achieved by the proposed approach were presented considering the individual modalities and through the early and late fusion strategies.

Citation

Gledson Melotti (2022). Reducing Overconfident Predictions in Multimodality Perception for Autonomous Driving. PhD Thesis. University of Coimbra, 2022

RECENT PUBLICATIONS

FLOWING: Implicit Neural Flows for Structure-Preserving Morphing

Authors: Arthur Bizzi; Matias Grynberg; Vitor Matias; Daniel Perazzo; João Paulo Lima; Luiz Velho; Nuno Gonçalves; João Pereira; Guilherme Schardong; Tiago Novello

Featured in: 39th Conference on Neural Information Processing Systems (NeurIPS 2025)

Adversarial Attack Challenge for Secure Face Recognition 2025

Authors: João Tremoço, Iurii Medvedev, Nuno Freitas, Andreia Costa, Diogo Nunes, Niklas Bunzel, Lukas Graner, Nicholas Göller, Lorenzo Pellegrini, Nicolò Di Domenico, Guido Borghi, Monson Verghese, Shruti Bhilare, Avik Hati, Miguel Lourenço, Nuno Gonçalves

Featured in: IEEE International Joint Conference on Biometrics (IJCB 2025)

VOIDFace: A Privacy-Preserving Multi-Network Face Recognition With Enhanced Security

Authors: Ajnas Muhammed; Iurii Medvedev; Nuno Gonçalves

Featured in: IEEE International Joint Conference on Biometrics (IJCB 2025)

suggested news

Paper accepted to IJCB 2025

Prof. Nuno and VIS Team successfully organizes IbPRIA...

Four papers presented @ IbPRIA 2025

RECENT PROJECTS

FACING2 – Face Image Understanding

VISUAL-ID – Unique Visual Identities in Graphics, Images and Faces

UniqueMark

Publication featured in: PhD Thesis

Resource featured in: PhD Thesis

Reducing Overconfident Predictions in Multimodality Perception for Autonomous Driving

Abstract

Citation

Related Content

RECENT PUBLICATIONS

FLOWING: Implicit Neural Flows for Structure-Preserving Morphing

Authors: Arthur Bizzi; Matias Grynberg; Vitor Matias; Daniel Perazzo; João Paulo Lima; Luiz Velho; Nuno Gonçalves; João Pereira; Guilherme Schardong; Tiago Novello

Featured in: 39th Conference on Neural Information Processing Systems (NeurIPS 2025)

Adversarial Attack Challenge for Secure Face Recognition 2025

Authors: João Tremoço, Iurii Medvedev, Nuno Freitas, Andreia Costa, Diogo Nunes, Niklas Bunzel, Lukas Graner, Nicholas Göller, Lorenzo Pellegrini, Nicolò Di Domenico, Guido Borghi, Monson Verghese, Shruti Bhilare, Avik Hati, Miguel Lourenço, Nuno Gonçalves

Featured in: IEEE International Joint Conference on Biometrics (IJCB 2025)

VOIDFace: A Privacy-Preserving Multi-Network Face Recognition With Enhanced Security

Authors: Ajnas Muhammed; Iurii Medvedev; Nuno Gonçalves

Featured in: IEEE International Joint Conference on Biometrics (IJCB 2025)

suggested news

RECENT PROJECTS

The Lab

About us

Resources