ESTRO 2023 - Abstract Book
S1329
Digital Posters
ESTRO 2023
Conclusion This study illustrated the important inter-observer variability associated with the task of segmenting endoscopy images. The results suggest that ground-truth data should be composed of labels from multiple annotators to avoid bias. Additional data are needed to increase the developed model’s performance and generalizability.
PO-1633 Clinical evaluation of deep learning-based nodal structures segmentation for gynecological cancers
S. Deshpande 1,3,4 , P. Chlap 2,5,6 , R. Finnegan 7,5 , D. Al Mouiee 3,8 , L. Holloway 6,3,4,9,10 , K. Lim 6,4 , V. Do 6,4
1 Liverpool & Macarthur Cancer Therapy Centres, SWSLHD Cancer Services, Liverpool,NSW, Radiation Oncology, Sydney, Australia; 2 UNSW Medicine & Health, South-West Sydney Clinical Campus, UNSW , School of Clinical Medicine, Sydney, Australia; 3 Ingham Institute for Applied Medical Research, Liverpool, NSW, Medical Physics, Sydney, Australia; 4 UNSW Medicine & Health, South-West Sydney Clinical Campus, UNSW, School of Clinical Medicine, Sydney, Australia; 5 Ingham Institute for Applied Medical Research, Medical Physics, Sydney, Australia; 6 Liverpool & Macarthur Cancer Therapy Centres, SWSLHD Cancer Services, Liverpool, NSW, Radiation Oncology, Sydney, Australia; 7 Royal North Shore Hospital, St Leonards, NSW, , Radiation Oncology, Sydney, Australia; 8 UNSW Medicine & Health, South-West Sydney Clinical Campus, UNSW SYDNEY, School of Clinical Medicine, Sydney, Australia; 9 Institute of Medical Physics, University of Sydney, NSW, Medical Physics, Sydney, Australia; 10 University of Wollongong, Wollongong, NSW, Centre for Medical Radiation Physics, Sydney, Australia Purpose or Objective Manual delineation of nodal target structures is time consuming and prone to inter- and intra-observer variability. The Deep Learning (DL) framework nnUNet has shown versatility to adapt to a range of clinical sites and imaging modalities. In this work we investigate the feasibility of using nnUNet for auto-segmentation of clinical structures training on local datasets. The purpose of this study was to evaluate the accuracy and consistency of nodal clinical target volume (CTV) definition derived from DL based auto-segmentation compared with manual delineation for gynecological cancers. Materials and Methods A framework for training, evaluating and deploying auto-segmentation models using nnUNet was implemented. By using nnUnet in our framework, the need for network adaption and hyper-parameter tuning per clinical site is removed. As a use case the automatic delineation of three CTV structures using a dataset of 51 computed tomography (CT) scans from gynecological cancer patients was performed. The dataset was randomly split to use 41 cases for training and 10 for validation. The network was trained to perform multi-label prediction for the internal iliac, common iliac and obturator nodes using four approaches: 2D, 3D low resolution, 3D high resolution, and an ensemble model (which is a composite of the first three methods). Automatic segmentations derived from each of four methods were evaluated against manual contours for the 10 CT validation scans using Dice Similarity Coefficient (DSC), 95th Percentile of Hausdorff Distance (HD95) and mean distance to Agreement (MDA). Each DL segmented contour was reviewed visually by a physicist to assess deviations from the ground truth manual contours and identify the region where this deviation occurs. Results Table 1 summarises the quantitative metrics for DL based auto-segmentation accuracy for the four approaches against the manual segmentation. Our preliminary results were comparable to published data despite a relatively small training dataset. The ensemble model showed incremental benefit over the other three approaches. Figure 1 illustrates the comparison between DL and manual contours with an example of the best and worst performing cases. Most variation between automatic and manual contours was seen in the superior and inferior boundaries.
Made with FlippingBook flipbook maker