Simultaneous Localization and Mapping (SLAM) has become a fundamental technology in various applications, including robotics, autonomous navigation, geographic information systems (GIS), and infrastructure inspection. This paper presents a new version of GuPho, a low-cost, lightweight, and portable visual SLAM-based system equipped with AI-driven capabilities for real-time mapping, object detection, and defect analysis. The system integrates stereo vision and deep learning (DL) methods to enhance spatial understanding and enable accurate real-time scene interpretation. In particular, we explore DL-based semantic segmentation, monocular depth estimation (MDE), and stereo depth estimation to improve 3D reconstruction and size measurement of cracks for infrastructure monitoring. We implement state-of-the-art neural networks, including RF-DETR and YOLO for real-time crack and windows segmentation and Depth Anything V2, Depth Pro, and Unimatch for depth estimation. Our results demonstrate the potential of GuPho as an affordable and efficient system for real-time mobile mapping and defect assessment. The real-time and AI capabilities of our in-house solution are showcased here: https://youtu.be/ATIwn4zOSFw

AI-Based 3D Vision System for Inspection and Monitoring

Padkan, Nazanin
Methodology
;
Facenda, Samuele
Software
;
Morelli, Luca
Membro del Collaboration Group
;
Elalailyi, Ahmad
Membro del Collaboration Group
;
Remondino, Fabio
Supervision
2025-01-01

Abstract

Simultaneous Localization and Mapping (SLAM) has become a fundamental technology in various applications, including robotics, autonomous navigation, geographic information systems (GIS), and infrastructure inspection. This paper presents a new version of GuPho, a low-cost, lightweight, and portable visual SLAM-based system equipped with AI-driven capabilities for real-time mapping, object detection, and defect analysis. The system integrates stereo vision and deep learning (DL) methods to enhance spatial understanding and enable accurate real-time scene interpretation. In particular, we explore DL-based semantic segmentation, monocular depth estimation (MDE), and stereo depth estimation to improve 3D reconstruction and size measurement of cracks for infrastructure monitoring. We implement state-of-the-art neural networks, including RF-DETR and YOLO for real-time crack and windows segmentation and Depth Anything V2, Depth Pro, and Unimatch for depth estimation. Our results demonstrate the potential of GuPho as an affordable and efficient system for real-time mobile mapping and defect assessment. The real-time and AI capabilities of our in-house solution are showcased here: https://youtu.be/ATIwn4zOSFw
File in questo prodotto:
File Dimensione Formato  
isprs-archives-XLVIII-1-W5-2025-101-2025.pdf

accesso aperto

Tipologia: Documento in Post-print
Licenza: PUBBLICO - Creative Commons 2.2
Dimensione 1.47 MB
Formato Adobe PDF
1.47 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11582/364527
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
social impact