The ability to automatically recognize a wide variety of objects in complex 3D urban environments without relying on predefined categories or annotated training data is becoming increasingly important for end-users of large-scale geospatial 3D datasets. Given that objects in urban scenes noticeably vary across locations, users and applications, flexible annotation-free methods for 3D semantic segmentation are getting desirable. In this work, we present and compare two approaches for classifying aerial photogrammetric point clouds. The first employs conventional supervised 3D neural networks trained on annotated datasets and predefined object classes. The second adopts a training-free, open-vocabulary strategy that detects objects directly in images and subsequently projects and refines them within 3D space. Approaches are evaluated through quantitative metrics and qualitative analysis, providing insights into their respective capabilities and limitations over 3D urban areas.

Towards annotation-less semantic segmentation of aerial point clouds

Alami, Ashkan;Remondino, Fabio
2026-01-01

Abstract

The ability to automatically recognize a wide variety of objects in complex 3D urban environments without relying on predefined categories or annotated training data is becoming increasingly important for end-users of large-scale geospatial 3D datasets. Given that objects in urban scenes noticeably vary across locations, users and applications, flexible annotation-free methods for 3D semantic segmentation are getting desirable. In this work, we present and compare two approaches for classifying aerial photogrammetric point clouds. The first employs conventional supervised 3D neural networks trained on annotated datasets and predefined object classes. The second adopts a training-free, open-vocabulary strategy that detects objects directly in images and subsequently projects and refines them within 3D space. Approaches are evaluated through quantitative metrics and qualitative analysis, providing insights into their respective capabilities and limitations over 3D urban areas.
File in questo prodotto:
File Dimensione Formato  
isprs-archives-XLVIII-4-W18-2025-19-2026.pdf

accesso aperto

Licenza: Non specificato
Dimensione 14.7 MB
Formato Adobe PDF
14.7 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11582/366607
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
social impact