Détail du document
Identifiant

oai:arXiv.org:2402.17133

Sujet
Computer Science - Computer Vision...
Auteur
Wang, Chengcheng Hao, Zhiwei Tang, Yehui Guo, Jianyuan Yang, Yujie Han, Kai Wang, Yunhe
Catégorie

Computer Science

Année

2024

Date de référencement

06/03/2024

Mots clés
sr diffusion-based models process noise diffusion
Métrique

Résumé

Diffusion-based super-resolution (SR) models have recently garnered significant attention due to their potent restoration capabilities.

But conventional diffusion models perform noise sampling from a single distribution, constraining their ability to handle real-world scenes and complex textures across semantic regions.

With the success of segment anything model (SAM), generating sufficiently fine-grained region masks can enhance the detail recovery of diffusion-based SR model.

However, directly integrating SAM into SR models will result in much higher computational cost.

In this paper, we propose the SAM-DiffSR model, which can utilize the fine-grained structure information from SAM in the process of sampling noise to improve the image quality without additional computational cost during inference.

In the process of training, we encode structural position information into the segmentation mask from SAM.

Then the encoded mask is integrated into the forward diffusion process by modulating it to the sampled noise.

This adjustment allows us to independently adapt the noise mean within each corresponding segmentation area.

The diffusion model is trained to estimate this modulated noise.

Crucially, our proposed framework does NOT change the reverse diffusion process and does NOT require SAM at inference.

Experimental results demonstrate the effectiveness of our proposed method, showcasing superior performance in suppressing artifacts, and surpassing existing diffusion-based methods by 0.74 dB at the maximum in terms of PSNR on DIV2K dataset.

The code and dataset are available at https://github.com/lose4578/SAM-DiffSR.

Wang, Chengcheng,Hao, Zhiwei,Tang, Yehui,Guo, Jianyuan,Yang, Yujie,Han, Kai,Wang, Yunhe, 2024, SAM-DiffSR: Structure-Modulated Diffusion Model for Image Super-Resolution

Document

Ouvrir

Partager

Source

Articles recommandés par ES/IODE IA

Systematic druggable genome-wide Mendelian randomization identifies therapeutic targets for lung cancer
agphd1 subtypes replication hykk squamous cell gene carcinoma causal targets mendelian randomization cancer analysis