Featured in:
Applied Sciences
Authors:
Gustavo Assunção, Nuno Gonçalves and Paulo Menezes
This paper introduces a methodology for fusing auditory and visual data inspired by neuroscience models, enhancing active speaker detection. Validation on public datasets exceeds expectations, promising diverse applications in teleconferencing and social robotics.
© 2024 VISTeam | Made by Black Monster Media
Institute of Systems and Robotics Department of Electrical and Computers Engineering University of Coimbra