High-resolution Satellite Image Time Series (SITS), combined with advanced deep learning approaches such as transformers, enable reliable change detection by effectively using large amounts of temporal data. While most existing change detection methods overlook temporal dynamics and focus primarily on spatial features, Transformers are well-suited to capture both spatial and temporal features. This paper presents a Transformer-based spatio-temporal architecture for multi-annual change detection that integrates two key components: an Adaptive Swin Transformer Encoder (ASTE) for hierarchical feature extraction and a Feature Difference Module (FDM) to highlight changes for generating change detection maps. ASTE incorporates a Cross Attention Fusion (CAF) module, which integrates convolutional operations for spatial feature extraction and Swin Transformer to extract spatio-temporal dependencies. FDM refines the difference features, enhancing feature interaction for optimization and accurate change detection. The method was tested in the Trentino region of Italy, focusing on deforestation caused by the Vaia storm.

Transformer-based Spatio-temporal Change Detection Network Using Satellite Image Time Series: A Case Study of Forest Disturbance in Trentino, Italy Following the Vaia Storm

K. Meshkini;F. Bovolo
2025-01-01

Abstract

High-resolution Satellite Image Time Series (SITS), combined with advanced deep learning approaches such as transformers, enable reliable change detection by effectively using large amounts of temporal data. While most existing change detection methods overlook temporal dynamics and focus primarily on spatial features, Transformers are well-suited to capture both spatial and temporal features. This paper presents a Transformer-based spatio-temporal architecture for multi-annual change detection that integrates two key components: an Adaptive Swin Transformer Encoder (ASTE) for hierarchical feature extraction and a Feature Difference Module (FDM) to highlight changes for generating change detection maps. ASTE incorporates a Cross Attention Fusion (CAF) module, which integrates convolutional operations for spatial feature extraction and Swin Transformer to extract spatio-temporal dependencies. FDM refines the difference features, enhancing feature interaction for optimization and accurate change detection. The method was tested in the Trentino region of Italy, focusing on deforestation caused by the Vaia storm.
File in questo prodotto:
File Dimensione Formato  
Transformer-Based_Spatio-Temporal_Change_Detection.pdf

solo utenti autorizzati

Tipologia: Documento in Post-print
Licenza: Copyright dell'editore
Dimensione 571.44 kB
Formato Adobe PDF
571.44 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11582/365389
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
social impact