Simulated multimodal deep facial diagnosis; Deep facial diagnosisSimulated multimodalFace depth estimationFacial phenotypesCondition-specific facesBilinear

Type of Publication

Journal Articles

Date:

10 /

2024

Status

Published

DOI:

10.1016/j.eswa.2024.123881

Simulated multimodal deep facial diagnosis

Featured in:

Expert Systems with Applications

Authors:

Bo Jin, Nuno Gonçalves, Leandro Cruz, Iurii Medvedev, Yuanyu Yu and Jiujiang Wang

Abstract

Facial phenotypes are extensively studied in medical and biological research, serving as critical markers that potentially indicate underlying genetic traits or medical conditions. With the recent advancements in big data, algorithms, and hardware, deep facial diagnosis, which employs deep learning techniques to systematically examine facial phenotypes and identify signs of certain diseases or medical conditions, has attracted significant attention and research, gradually emerging as a promising tool in precision medicine. Primarily limited by the scarcity of data for training facial diagnosis models, the accuracy of facial diagnosis for various conditions remains low up to now. In the past decade, RGB-D cameras, measuring depth information along with standard RGB capabilities, have proven superior in processing spatial details with more stability and accuracy. Motivated by the facts mentioned above, in this paper, we propose a Simulated Multimodal Framework, which effectively improves the computer-aided facial diagnosis performance of state-of-the-art models in experiments under different conditions. The underlying principle is to leverage the simulated depth by generative models to improve the performance of RGB image recognition. Furthermore, as a rapid and non-invasive tool for disease screening and detection, our proposal demonstrated an average accuracy improvement of over 20% compared to practicing physicians in the study.

Citation
Bo Jin, Nuno Gonçalves, Leandro Cruz, Iurii Medvedev, Yuanyu Yu and Jiujiang Wang. Simulated multimodal deep facial diagnosis. Expert Systems with Applications, Volume 252, Part A, 2024. DOI: 10.1016/j.eswa.2024.123881.

Related Content

Researcher Coordinator, VIS TEAM Leader
Post-Doc Researcher
Researcher
Post-Doc Researcher
No tagged content to show
No tagged content to show
No tagged content to show

RECENT PUBLICATIONS

StylePuncher: encoding a hidden QR code into images

Authors: Farhad Shadmand; Luiz Schirmer; Nuno Gonçalves
Featured in: 14th International Conference on Pattern Recognition Applications and Methods (ICPRAM'25)

RiemStega: Covariance-based loss for print-proof transmission of data in images

Authors: Aniana Cruz; Guilherme Schardong; Luiz Schirmer; João Marcos, Farhad Shadmand; Nuno Gonçalves
Featured in: IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2025

MorFacing: A Benchmark for Estimation Face Recognition Robustness to Face Morphing Attacks

Authors: Iurii Medvedev and Nuno Gonçalves
Featured in: IEEE International Joint Conference on Biometrics (IJCB 2024)

suggested news

Nuno Gonçalves debates AI impact at "Café com...
Nuno Gonçalves participates in the conference-debate "IA -...
Laser engraving of precious metal artifacts (UniqueMark® deterministic...

RECENT PROJECTS

FACING2 – Face Image Understanding
VISUAL-ID – Unique Visual Identities in Graphics, Images and Faces
UniqueMark

Institute of Systems and Robotics Department of Electrical and Computers Engineering University of Coimbra