6DGS: 6D Pose Estimation from a Single Image and a 3D Gaussian Splatting Model

Bortolon, Matteo; Tsesmelis, Theodore; James, Stuart; Poiesi, Fabio; Del Bue, Alessio

doi:10.1007/978-3-031-72943-0_24

We propose 6DGS to estimate the camera pose of a target RGB image given a 3D Gaussian Splatting (3DGS) model representing the scene. 6DGS avoids the iterative process typical of analysis-by-synthesis methods (e. g.iNeRF) that also require an initialization of the camera pose in order to converge. Instead, our method estimates a 6DoF pose by inverting the 3DGS rendering process. Starting from the object surface, we define a radiant Ellicell that uniformly generates rays departing from each ellipsoid that parameterize the 3DGS model. Each Ellicell ray is associated with the rendering parameters of each ellipsoid, which in turn is used to obtain the best bindings between the target image pixels and the cast rays. These pixel-ray bindings are then ranked to select the best scoring bundle of rays, which their intersection provides the camera center and, in turn, the camera rotation. The proposed solution obviates the necessity of an “a priori” pose for initialization, and it solves 6DoF pose estimation in closed form, without the need for iterations. Moreover, compared to the existing Novel View Synthesis (NVS) baselines for pose estimation, 6DGS can improve the overall average rotational accuracy by and translation accuracy by on real scenes, despite not requiring any initialization pose. At the same time, our method operates near real-time, reaching 15 fps on consumer hardware.

6DGS: 6D Pose Estimation from a Single Image and a 3D Gaussian Splatting Model

Matteo, Bortolon^Software;James, Stuart^Supervision;Poiesi, Fabio^Supervision;Del Bue, Alessio^Supervision

2024-01-01

Abstract

We propose 6DGS to estimate the camera pose of a target RGB image given a 3D Gaussian Splatting (3DGS) model representing the scene. 6DGS avoids the iterative process typical of analysis-by-synthesis methods (e. g.iNeRF) that also require an initialization of the camera pose in order to converge. Instead, our method estimates a 6DoF pose by inverting the 3DGS rendering process. Starting from the object surface, we define a radiant Ellicell that uniformly generates rays departing from each ellipsoid that parameterize the 3DGS model. Each Ellicell ray is associated with the rendering parameters of each ellipsoid, which in turn is used to obtain the best bindings between the target image pixels and the cast rays. These pixel-ray bindings are then ranked to select the best scoring bundle of rays, which their intersection provides the camera center and, in turn, the camera rotation. The proposed solution obviates the necessity of an “a priori” pose for initialization, and it solves 6DoF pose estimation in closed form, without the need for iterations. Moreover, compared to the existing Novel View Synthesis (NVS) baselines for pose estimation, 6DGS can improve the overall average rotational accuracy by and translation accuracy by on real scenes, despite not requiring any initialization pose. At the same time, our method operates near real-time, reaching 15 fps on consumer hardware.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2024
			
	Codice ISBN
	
				9783031729423
9783031729430
			
	Appare nelle tipologie:
	
				4.1 Contributo in Atti di convegno

File in questo prodotto:

File	Dimensione	Formato
06914.pdf solo utenti autorizzati Tipologia: Documento in Post-print Licenza: Copyright dell'editore Dimensione 14.68 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	14.68 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11582/358027

Citazioni

ND

Nome	Dominio	Durata	Descrizione
s_.*	plu.mx	sessione	recupero grafico citazioni sociali da plumx
A_.*	core.ac.uk	7 giorni	recupero pubblicazioni consigliate per il pannello core-recommander
GS_.*	gstatic.com	richiesta http	visualizza grafico citazioni
CC_.*	creativecommons.org	richiesta http	visualizza licenza bitstream

6DGS: 6D Pose Estimation from a Single Image and a 3D Gaussian Splatting Model

Matteo, Bortolon^Software;James, Stuart^Supervision;Poiesi, Fabio^Supervision;Del Bue, Alessio^Supervision

Software

Writing – Review & Editing

Supervision

Supervision

Supervision

2024-01-01

Abstract

Scheda breve

Scheda completa

Scheda completa (DC)

Citazioni

social impact

IRIS Institutional Research Information System

6DGS: 6D Pose Estimation from a Single Image and a 3D Gaussian Splatting Model

Matteo, BortolonSoftware;Tsesmelis, TheodoreWriting – Review & Editing;James, StuartSupervision;Poiesi, FabioSupervision;Del Bue, AlessioSupervision

Software

Writing – Review & Editing

Supervision

Supervision

Supervision

2024-01-01

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Citazioni

social impact

Conferma cancellazione

Matteo, Bortolon^Software;James, Stuart^Supervision;Poiesi, Fabio^Supervision;Del Bue, Alessio^Supervision

Scheda breve

Scheda completa

Scheda completa (DC)