ESTRO 2024 - Abstract Book
S4454
Physics - Machine learning models and clinical applications
ESTRO 2024
Threshold level
Sensitivity
-
Specificity
-
Accuracy - Test Data [%]
Sensitivity - Test Data [%]
Specificity - Test Data [%]
Accuracy - Test Data [%]
Test Data [%]
Test Data [%]
0.5 0.6 0.7
81.25 86.67 85.71 84.62
100 100
83.33 88.89 83.33 77.78 77.78
76.47 81.25 92.31 92.31
100 100
77.78 83.33 88.89 88.89 88.89 72.22
92.31 84.62 69.23 30.77
92.31 92.31 84.62 61.54
0.75
0.8 0.9
100 100
100 100
50
We selected a conservative threshold level = 0.8. Anyway, a threshold = 0.75 could intercept all plans with gamma <85% for both models (Figure 2).
For XGBoost, we evaluated model stability by selecting 10 different random seeds. Results averaged over 10 random seeds (threshold level = 0.8) are AUC 10 = 0.8±0.1, Sensitivity 10 = 91±3.9, Specificity 10 = 78.5±9, Accuracy 10 = 78.9±7.4. XGBoost could correctly identify all plans with gamma <85% for all random seeds analysed.
Conclusion:
Made with FlippingBook - Online Brochure Maker