ESTRO 2021 Abstract Book
S386
ESTRO 2021
diaphragm (target and OAR surrogates respectively). Motion traces were reported for the lead tip in the three cardinal axes and in the sup-inf axis for the diaphragm. Fourier analysis subsequently allowed respiratory motion to be separated from cardiac motion. Validation of this process was carried out using a ball-bearing in motion on a moveable platform. Data tracking accuracy was assessed by manual identification of positions compared with automatic tracking. Data from three patients treated at our centre were analysed for cardiac and respiratory motion. Results Phantom validation measurements showed that motion was quantified to within 0.5 mm for motion amplitudes over a clinically relevant range. For the three patients included in this study, lead tip displacement was measured to be on average (± standard deviation) 9.5 ± 2.1 mm (left-right), 9.7 ± 1.8 mm (ant-post) and 12.9 ± 2.7 mm (sup-inf). Cardiac motion alone was shown to provide approximately two thirds of the total motion (see table). Diaphragm positions were between 11 and 19 mm measured as average peak-to-trough values. The accuracy of lead tip tracking was > 82% and diaphragm tracking was > 94% for each of the patients.
Conclusion A novel technique for measuring motion in the hearts of patients undergoing radiotherapy has been developed and validated. A three-patient sample has reported results in-line with motion data measured by other authors using alternative measurement approaches. Characterisation of motion for radiotherapy treatment planning and quality assurance of patient set-up have been shown to be possible for cardiac SABR using routinely acquired cone-beam CT images. OC-0504 Quality Assurance and Clinical Acceptability for AI-driven Automatic Contouring of Organs at Risk N. Dissler 1 , S. Stathakis 2 , A. Lombard 3 , N. Paragios 4 , G. Klausner 5 , L. Lahmi 5 , W. Jones III 6 , E. Maani 6 1 TheraPanacea, Functional & Clinical Specifications, Paris, France; 2 MD Anderson Cancer Center, Mays Cancer Center, Radiation Oncology and Radiology, San Antonio, TX, USA; 3 TheraPanacea, Machine Learning Research & Development, Paris, France; 4 TheraPanacea, President & Chief Executive Officer, Paris, France; 5 Université de médecine Pierre et Marie Curie, Paris Sorbonne Université, Radiation Oncology and Radiology, Paris, France; 6 South Texas Veterans Health Affairs, Radiation Oncology, San Antonio, TX, USA Purpose or Objective The aims of this study are three-fold: (i) to evaluate the generalization properties of automatic contouring solution across different annotation guidelines (ESTRO for training, ASTRO for testing), (ii) to compare the precision between experts’ annotations (inter-variability) and the ones obtained by AI solution (iii) to evaluate the relevance of commonly adopted quality assurance metrics with respect to the clinical reality. Materials and Methods An automatic contouring CE/FDA-cleared solution - based on an ensemble deep learning models trained for contouring 100+ organs at risk with ESTRO guidelines on +25,000 patients after anatomically preserving data augmentation – is blindly evaluated on ~140 ASTRO guidelines patients coming from the Mays Cancer Center (60%) and the cancer imaging archive (40%) for pelvis, abdomen head & neck and brain anatomies. In addition, a distinct set of 60+ patients from the pelvis and head & neck anatomies is retrospectively blindly annotated from a second qualified radiation oncologist. Results The absence of ASTRO-contoured patients in training doesn’t seem to impact the generalization of the trained models to the ASTRO treated population cohort in terms of Dice coefficient (DSC) and Hausdorff distance (HD) (with marginal consistent improvement on the ASTRO patient cohort). Generalization is demonstrated by an average DSC among 34 organs of 0.74, higher value of 0.97 for the lungs and lower value of 0.31 for the right brachial plexus. In terms of acceptability, DSC score which is the most commonly adopted and recommended metric fails short to depict the complexity of the objective. On top of the fact that the measure is biased towards organs with significant volume, it was observed a clear relationship between the observed DSC and the volume of the organs on all anatomies (except for bone structures). Finally, inter-expert variability (average DSC among 9 organs of 0.54, higher value of 0.82 for the rectum and lower value of 0.13 for the penile bulb) seems to follow in terms of DSC discrepancies the same trend of behavior as the one observed for the automatic contouring solution while in average showing constantly lower similarities than the ones observed between the automatic solution and the expert’s annotation (see Figure 1).
Made with FlippingBook Learn more on our blog