Multi-source Domain Adaptation for Panoramic Semantic Segmentation

Dokumentdetails

ID

oai:arXiv.org:2408.16469

Thema

Computer Science - Computer Vision...

Autor

Jiang, Jing Zhao, Sicheng Zhu, Jiankun Tang, Wenbo Xu, Zhaopan Yang, Jidong Liu, Guoping Xing, Tengfei Xu, Pengfei Yao, Hongxun

Kategorie

Computer Science

Jahr

2024

Auflistungsdatum

15.01.2025

Schlüsselwörter

distortion deformation synthetic source adaptation domain panoramic images pinhole real domains segmentation

Metrisch

Zusammenfassung

Unsupervised domain adaptation methods for panoramic semantic segmentation utilize real pinhole images or low-cost synthetic panoramic images to transfer segmentation models to real panoramic images.

However, these methods struggle to understand the panoramic structure using only real pinhole images and lack real-world scene perception with only synthetic panoramic images.

Therefore, in this paper, we propose a new task, Multi-source Domain Adaptation for Panoramic Semantic Segmentation (MSDA4PASS), which leverages both real pinhole and synthetic panoramic images to improve segmentation on unlabeled real panoramic images.

There are two key issues in the MSDA4PASS task: (1) distortion gaps between the pinhole and panoramic domains -- panoramic images exhibit global and local distortions absent in pinhole images; (2) texture gaps between the source and target domains -- scenes and styles differ across domains.

To address these two issues, we propose a novel framework, Deformation Transform Aligner for Panoramic Semantic Segmentation (DTA4PASS), which converts all pinhole images in the source domains into distorted images and aligns the source distorted and panoramic images with the target panoramic images.

Specifically, DTA4PASS consists of two main components: Unpaired Semantic Morphing (USM) and Distortion Gating Alignment (DGA).

First, in USM, the Dual-view Discriminator (DvD) assists in training the diffeomorphic deformation network at the image and pixel level, enabling the effective deformation transformation of pinhole images without paired panoramic views, alleviating distortion gaps.

Second, DGA assigns pinhole-like (pin-like) and panoramic-like (pan-like) features to each image by gating, and aligns these two features through uncertainty estimation, reducing texture gaps.

;Comment: Accepted by Information Fusion 2025

Jiang, Jing,Zhao, Sicheng,Zhu, Jiankun,Tang, Wenbo,Xu, Zhaopan,Yang, Jidong,Liu, Guoping,Xing, Tengfei,Xu, Pengfei,Yao, Hongxun, 2024, Multi-source Domain Adaptation for Panoramic Semantic Segmentation