Type of Publication

Thesis

Date:

12 /

2023

Status

Published

Pseudo RGB-D Facial Image Processing – Towards Face Recognition And Facial Diagnosis

Featured in:

PhD Thesis

Authors:

Jin Bo

Abstract

Today, face image-based applications have become widespread in fields such as security, medicine, and entertainment. Factors like lighting, pose, and facial expressions can impact the performance of these applications. Over the past decade, the development and affordability of low-cost RGB-D sensors have made it possible to obtain depth information of objects, leading researchers to tackle face recognition problems by capturing RGB-D face images. However, due to privacy restrictions, acquiring depth data from human faces remains challenging, and 2D RGB face images are still prevalent. Intelligent beings, such as humans, can use their vast experience to derive 3D spatial information from 2D scenes. Machine learning methodologies aim to solve such problems by training computers to generate accurate answers. Our research’s objective is to enhance the performance of subsequent face processing tasks, such as face recognition and facial diagnosis, by obtaining depth maps directly from corresponding RGB images. We propose a pseudo RGB-D facial image processing framework that replaces depth sensors with generated pseudo-depth maps and others data-driven methods to create depth maps from 2D face images. Specifically, we design and implement a generative adversarial network model named ‘D+GAN’ for multi-conditional image-to-image translation with facial attributes. We validate the pseudo RGB-D facial image processing approach through experiments on face recognition and facial diagnosis using various datasets. The pseudo RGB-D facial image processing framework works in conjunction with image fusion algorithms to enhance face recognition and facial diagnosis performance. To further exploit pseudo-depth features, we ultimately propose a simulated multimodal facial image processing framework that significantly improves performance with a higher probability.

Citation
Jin Bo (2023). Pseudo RGB-D Facial Image Processing – Towards Face Recognition And Facial Diagnosis. PhD Thesis. University of Coimbra, 2023.

Related Content

Researcher Coordinator, VIS TEAM Leader
Post-Doc Researcher
No tagged content to show
No tagged content to show
No tagged content to show

RECENT PUBLICATIONS

Using Benford’s Law for Deepfake Detection

Authors: Miguel Leão; Nuno Gonçalves
Featured in: RECPAD - 30th Portuguese Conference on Pattern Recognition. 2024, Covilhã, Portugal

Proceedings of the 12th Iberian Conference on Pattern Recognition and Image Analysis Part I

Authors: Nuno Gonçalves; Hélder P. Oliveira; Joan Andreu Sánchez
Featured in: 12th Iberian Conference on Pattern Recognition and Image Analysis (IbPRIA 2025)

Proceedings of the 12th Iberian Conference on Pattern Recognition and Image Analysis Part II

Authors: Nuno Gonçalves; Hélder P. Oliveira; Joan Andreu Sánchez
Featured in: 12th Iberian Conference on Pattern Recognition and Image Analysis (IbPRIA 2025)

suggested news

Paper accepted to IJCB 2025
Prof. Nuno and VIS Team successfully organizes IbPRIA...
Four papers presented @ IbPRIA 2025

RECENT PROJECTS

FACING2 – Face Image Understanding
VISUAL-ID – Unique Visual Identities in Graphics, Images and Faces
UniqueMark

Institute of Systems and Robotics Department of Electrical and Computers Engineering University of Coimbra