Dokumentdetails
ID

oai:arXiv.org:2410.07753

Thema
Computer Science - Computer Vision... Computer Science - Machine Learnin...
Autor
Venkatesh, Danush Kumar Rivoir, Dominik Pfeiffer, Micha Kolbinger, Fiona Speidel, Stefanie
Kategorie

Computer Science

Jahr

2024

Auflistungsdatum

27.11.2024

Schlüsselwörter
scene computer organs generated multi-class organ
Metrisch

Zusammenfassung

In computer-assisted surgery, automatically recognizing anatomical organs is crucial for understanding the surgical scene and providing intraoperative assistance.

While machine learning models can identify such structures, their deployment is hindered by the need for labeled, diverse surgical datasets with anatomical annotations.

Labeling multiple classes (i.e., organs) in a surgical scene is time-intensive, requiring medical experts.

Although synthetically generated images can enhance segmentation performance, maintaining both organ structure and texture during generation is challenging.

We introduce a multi-stage approach using diffusion models to generate multi-class surgical datasets with annotations.

Our framework improves anatomy awareness by training organ specific models with an inpainting objective guided by binary segmentation masks.

The organs are generated with an inference pipeline using pre-trained ControlNet to maintain the organ structure.

The synthetic multi-class datasets are constructed through an image composition step, ensuring structural and textural consistency.

This versatile approach allows the generation of multi-class datasets from real binary datasets and simulated surgical masks.

We thoroughly evaluate the generated datasets on image quality and downstream segmentation, achieving a $15\%$ improvement in segmentation scores when combined with real images.

The code is available at https://gitlab.com/nct_tso_public/muli-class-image-synthesis ;Comment: Accepted at WACV 2025

Venkatesh, Danush Kumar,Rivoir, Dominik,Pfeiffer, Micha,Kolbinger, Fiona,Speidel, Stefanie, 2024, Data Augmentation for Surgical Scene Segmentation with Anatomy-Aware Diffusion Models

Dokumentieren

Öffnen

Teilen

Quelle

Artikel empfohlen von ES/IODE AI

Skin cancer prevention behaviors, beliefs, distress, and worry among hispanics in Florida and Puerto Rico
skin cancer hispanic/latino prevention behaviors protection motivation theory florida puerto rico variables rico psychosocial behavior response efficacy levels skin cancer participants prevention behaviors spanish-preferring tampeños puerto hispanics