Type of Publication

Conference Papers

Date:

4 /

2020

Status

Published

DOI:

10.1109/ICARSC49921.2020.9096068

Deep-Learning based Global and Semantic Feature Fusion for Indoor Scene Classification

Featured in:

2020 IEEE International Conference on Autonomous Robot Systems and Competitions, Ponta Delgada, Portugal

Authors:

Ricardo Pereira, Nuno Gonçalves, Luís Garrote, Tiago Barros, Ana Lopes and Urbano J. Nunes

Abstract

This paper focuses on the task of RGB indoor scene classification. A single scene may contain various configurations and points of view, but there are a small number of objects that can characterize the scene. In this paper we propose a deeplearning based Global and Semantic Feature Fusion Approach (GSF2App) with two branches. In the first branch (top branch), a CNN model is trained to extract global features from RGB images, taking leverage from the ImageNet pre-trained model to initialize our CNN’s weights. In the second branch (bottom branch), we develop a semantic feature vector that represents the objects in the image, which are detected and classified through the COCO dataset pre-trained YOLOv3 model. Then, both global and semantic features are combined in an intermediate feature fusion stage. The proposed approach was evaluated on the SUN RGB-D Dataset and NYU Depth Dataset V2 achieving state-of-the-art results on both datasets.

Citation
Ricardo Pereira, Nuno Gonçalves, Luís Garrote, Tiago Barros, Ana Lopes and Urbano J. Nunes (2020, April). Deep-learning based global and semantic feature fusion for indoor scene classification. In 2020 IEEE International Conference on Autonomous Robot Systems and Competitions (ICARSC) (pp. 67-73). IEEE. DOI: 10.1109/ICARSC49921.2020.9096068

Related Content

Researcher Coordinator, VIS TEAM Leader
No tagged content to show
No tagged content to show
No tagged content to show

RECENT PUBLICATIONS

How to Rethink Education with Artificial Intelligence: The Portuguese Use Case from Political and Practical Perspectives

Authors: Nuno Gonçalves; Maria Helena Monteiro
Featured in: Futures'2025 Conference Artificial Intelligence in Education

Book of Extended Abstracts of the 12th Iberian Conference on Pattern Recognition and Image Analysis (IbPRIA 2025)

Authors: Aitana Menárguez-Box; Angel Navarro ; Antonio Requena Jiménez; Brenda Nogueira; Dylan Perdigão; Farhad Shadmand; Iurii Medvedev; Marco Alexandre Tomás Tereso; Maria del Mar Coch-Alcina; Matheus Kovaleski; Miguel Leão; Teresa M.C. Pereira; Tomás Silva Santos Rocha
Featured in: 12th Iberian Conference on Pattern Recognition and Image Analysis (IbPRIA 2025)

Pseudo-MOS Learning: A Full-to-No-Reference FIQA Framework

Authors: André Neto; Nuno Gonçalves
Featured in: 12th Iberian Conference on Pattern Recognition and Image Analysis (IbPRIA 2025)

suggested news

Prof. Nuno participates in Conference on Digital Governance
ISR-UC maintains the “Excellent” rating in FCT evaluation!
Nuno Gonçalves presents seminar at the University of...

RECENT PROJECTS

FACING2 – Face Image Understanding
VISUAL-ID – Unique Visual Identities in Graphics, Images and Faces
UniqueMark

Institute of Systems and Robotics Department of Electrical and Computers Engineering University of Coimbra