Treffer: Improving radiomics-based isocitrate dehydrogenase 1 prediction in glioma patients using semi-supervised machine learning models.
J Digit Imaging. 2013 Dec;26(6):1045-57. (PMID: 23884657)
Front Oncol. 2025 Mar 11;15:1530144. (PMID: 40134593)
J Pers Med. 2024 Mar 07;14(3):. (PMID: 38541029)
Neuro Oncol. 2017 Nov 6;19(suppl_5):v1-v88. (PMID: 29117289)
Clin Cancer Res. 2016 Oct 15;22(20):5079-5086. (PMID: 27185374)
Sci Data. 2017 Sep 05;4:170117. (PMID: 28872634)
Acad Radiol. 2025 Aug;32(8):4880-4892. (PMID: 40328536)
World Neurosurg. 2019 Jul;127:607-616.e4. (PMID: 30974279)
Cancers (Basel). 2025 May 06;17(9):. (PMID: 40361507)
Biology (Basel). 2024 Oct 30;13(11):. (PMID: 39596840)
AJNR Am J Neuroradiol. 2025 Oct 1;46(10):2098-2106. (PMID: 40194850)
J Neurooncol. 2019 Nov;145(2):257-263. (PMID: 31531788)
Nat Rev Clin Oncol. 2017 Dec;14(12):749-762. (PMID: 28975929)
Magn Reson Imaging. 2023 Dec;104:72-79. (PMID: 37778708)
Sci Data. 2022 Jul 29;9(1):453. (PMID: 35906241)
Eur Radiol. 2022 Jan;32(1):572-581. (PMID: 34255157)
Science. 2018 May 11;360(6389):660-663. (PMID: 29748285)
J Digit Imaging. 2021 Jun;34(3):647-666. (PMID: 33532893)
Diagnostics (Basel). 2024 Nov 05;14(22):. (PMID: 39594139)
Radiol Artif Intell. 2022 Oct 05;4(6):e220058. (PMID: 36523646)
Med Phys. 2020 Dec;47(12):6039-6052. (PMID: 33118182)
Front Oncol. 2022 Oct 06;12:1005805. (PMID: 36276163)
Ann Med Surg (Lond). 2021 Jan 08;62:53-64. (PMID: 33489117)
J Neurooncol. 2015 Jan;121(1):141-50. (PMID: 25205290)
J Neurooncol. 2017 May;133(1):27-35. (PMID: 28470431)
Sci Data. 2022 Jun 14;9(1):338. (PMID: 35701399)
Clin Imaging. 2025 Mar;119:110386. (PMID: 39742798)
Cancers (Basel). 2025 Jan 17;17(2):. (PMID: 39858067)
AJNR Am J Neuroradiol. 2025 Jan 8;46(1):121-128. (PMID: 39779292)
Eur Radiol. 2017 Aug;27(8):3509-3522. (PMID: 28004160)
Neuro Oncol. 2021 Aug 2;23(8):1231-1251. (PMID: 34185076)
Clin Radiol. 2025 Nov;90:107049. (PMID: 40974758)
Neurosurg Rev. 2025 Apr 29;48(1):396. (PMID: 40299088)
Int J Mol Sci. 2021 Sep 26;22(19):. (PMID: 34638714)
EC 1.1.1.42. (IDH1 protein, human)
Weitere Informationen
Background: Determining isocitrate dehydrogenase (IDH) mutation status in glioma is important for determining prognosis. We aimed to compare supervised and semi-supervised machine learning (ML) models in glioma IDH1 mutation status prediction using magnetic resonance imaging (MRI)-derived radiomics features.
Methods: Images and segmentation masks from several public collections, including ACRIN-FMISO, CPTAC-GBM, IvyGAP, TCGA-GBM, TCGA-LGG, UCSF-PDGM, UPENN-GBM, and REMBRANDT, were retrieved from The Cancer Imaging Archive (TCIA) portal. These data were divided into training cohort 1, unlabeled cohort, holdout internal validation (HOIV) cohort, and external validation (EV) cohort. After image preprocessing, radiomics features were extracted from T1-weighted, T1 contrast-enhanced (T1CE), T2-weighted, and fluid-attenuated inversion recovery (FLAIR) sequences. The least absolute shrinkage and selection operator (Lasso) algorithm was used for feature selection. Supervised and semi-supervised models were then constructed using 10 ML algorithms and various sequence combinations. For supervised models, we used training cohort 1 to develop the models. Regarding semi-supervised models, we initially predicted the labels of the unlabeled cohort using the training cohort 1 (pseudolabeling), then concatenated the training cohort 1 with these pseudolabeled data to create training cohort 2, and subsequently developed models using the training cohort 2. Both supervised and semi-supervised models were then validated on HOIV and EV cohorts.
Results: Data for 436, 151, 110, and 535 patients were included in the training cohort 1, unlabeled cohort, HOIV cohort, and EV cohort, respectively. A semi-supervised model using 24 features from T1CE images yielded the highest AUC on EV (0.951), which was significantly higher than the best supervised model (AUC = 0.917, p = 0.005). The latter model was constructed using 30 features from FLAIR and T1CE sequences. Furthermore, across all sequence combinations, the semi-supervised models consistently achieved higher AUCs in the EV cohort.
Conclusion: Semi-supervised approaches may improve the performance of radiomics-based ML models in predicting glioma IDH1 status. Using pseudolabels, these models can increase the size of training data, potentially leading to enhancement of model predictive performance. Additionally, these models may improve prediction efficiency by requiring fewer image sequences.
(© 2025. The Author(s).)
Declarations. Ethics approval and consent to participate: Not applicable. Consent for publication: Not applicable. Competing interests: Dr. Shahriar Faghani is the guest editor of the Computer-Aided Diagnosis collection of this journal.