ABSTRACT
PURPOSE
Knowing the genetic phenotype of gastrointestinal stromal tumors (GISTs) is essential for patients who receive therapy with tyrosine kinase inhibitors. The aim of this study was to develop a radiomic algorithm for predicting GISTs with KIT exon 11 mutation.
METHODS
We enrolled 106 patients (80 in the training set, 26 in the validation set) with clinicopathologically confirmed GISTs from two centers. Preoperative and postoperative clinical characteristics were selected and analyzed to construct the clinical model. Arterial phase, venous phase, delayed phase, and tri-phase combined radiomics algorithms were generated from the training set based on contrast-enhanced computed tomography (CE-CT) images. Various radiomics feature selection methods were used, namely least absolute shrinkage and selection operator (LASSO); minimum redundancy maximum relevance (mRMR); and generalized linear model (GLM) as a machine-learning classifier. Independent predictive factors were determined to construct preoperative and postoperative radiomics nomograms by multivariate logistic regression analysis. The performances of the clinical model, radiomics algorithm, and radiomics nomogram in distinguishing GISTs with the KIT exon 11 mutation were evaluated by area under the curve (AUC) of the receiver operating characteristics.
RESULTS
Of 106 patients who underwent genetic analysis, 61 had the KIT exon 11 mutation. The combined radiomics algorithm was found to be the best prediction model for differentiating the expression status of the KIT exon 11 mutation (AUC = 0.836; 95% confidence interval [CI], 0.640-0.951) in the validation set. The clinical model, and preoperative and postoperative radiomics nomograms had AUCs of 0.606 (95% CI, 0.397-0.790), 0.715 (95% CI, 0.506-0.873), and 0.679 (95% CI, 0.468-0.847), respectively, with the validation set.
CONCLUSION
The radiomics algorithm could distinguish GISTs with the KIT exon 11 mutation based on CE-CT images and could potentially be used for selective genetic analysis to support the precision medicine of GISTs.
Main points
• Distinguishing the genetic phenotype of gastrointestinal stromal tumors (GISTs) is important for patients who receive targeted therapy with tyrosine kinase inhibitors.
• In the present study, we developed new arterial phase, venous phase, and delayed phase radiomics algorithms, a combined radiomics algorithm, preoperative and postoperative radiomics nomograms based on tri-phase contrast-enhanced CT.
• As a noninvasive tool, the combined radiomics algorithm facilitates the best prediction performance effectively.
• The combined radiomics algorithm achieves a satisfactory overall diagnostic accuracy for predicting the GISTs with KIT exon 11 mutation, showing an AUC of 0.836 (95% CI, 0.640-0.951) in the validation set.
Gastrointestinal stromal tumors (gISTs) are the most commonly diagnosed subepithelial tumors of the gastrointestinal tract.1-3 However, gISTs respond poorly to chemotherapy and prognosis prediction is challenging because of their biological diversity and histological heterogeneity.2 mutations in the KIT gene, which encodes a tyrosine kinase, are known to lead to the development of gISTs. Besides KIT, mutations in other genes (e.g., platelet-derived growth factor receptor alpha, PDGFRA) have also been associated with gISTs.4
Most gISTs (75%-85%) have KIT mutations, 5%-7% have PDGFRA mutations, and 12%15% have no detectable mutation in KIT or PDGFRA.1, 4 The national comprehensive cancer network (nccn) guidelines highly recommend the genetic analysis of gISTs, because the expression status of specific regions of the KIT and PDGFRA gene sequences are correlated with a specific response to tyrosine kinase inhibitors (TKIs).2 The KIT exon 11 mutation, which accounts for about 75% of KIT mutations in gISTs, is the most common imatinib-target mutation and gISTs with this mutation respond well to standard imatinib therapy.1, 5-7 Because there are a fairly large number of patients who do not have the KIT exon 11 mutation, genetic analysis is required to define the wide variety of treatment effects.2, 6, 7 Therefore, before clinical treatment with TKIs, it is important to determine the expression status of the KIT exon 11 mutation. However, considering tolerance of the procedure and the heterogeneity of gISTs, conventional invasive biopsy or surgery to obtain specimens has many defects.2, 8, 9 Further, in our two centers, pathological specimens were obtained through biopsy and surgery, and routine pathological data such as mitotic count were analyzed. But patients still need to spend extra to get a genetic analysis of gISTs.10, 11 consequently, it is urgent to establish a noninvasive, repeatable, economical, and accurate pretreatment method to identify the expression status of the KIT exon 11 mutation.
Contrast-enhanced computed tomography (CE-CT) is a basic diagnostic method that is used to detect lesions and evaluate the treatment response of gISTs 10-12. medical imaging has been playing an increasing key role in personalized precision medicine. Radiomics has been applied to oncology studies13 and has been widely used in diagnosis,14 prognostic evaluation,15 prediction of biological behaviors,16 and even genetic prediction.11, 17 Previous studies have shown that gene mutation typing of gISTs is related to qualitative imaging features18-20 and clinicopathological characteristics21-24, so we included preoperative clinical and postoperative routine pathological dates as predictive factors in this radiomics study. As far as we know, there are still relatively few radiomics studies to predict the genotypes of gISTs.11 Thus, the aim of this two-center study was to develop and validate a triphase ce-cT based radiomics model that can distinguish the patients with the KIT exon 11 mutation.
Methods
Study Design and Patient Population
In this retrospective study, we screened the patients with pathologically confirmed gISTs in the databases of these institutions from July 2008 to August 2019. Patients who fulfilled the following inclusion criteria were included in the study: a) patients with pathological specimens through biopsy or surgery for genetic analysis results; b) complete clinical characteristics; c) tri-phase cecT images before treatment (neoadjuvant therapy or surgery); and d) gIST size ≥2 cm and could be outlined in the ce-cT images for volume of interest segmentation.
According to these criteria, 106 patients who also underwent genetic analysis between August 2010 and August 2019 were included: 93 were from Affiliated Hospital of Qingdao University (Institution 1) and 13 were from Shandong Provincial Hospital Affiliated to Shandong First medical University (Institution 2). Because the number of patients from institution 2 was small, their data could not be used as an external validation set alone. Therefore, the data of patients from both institutions were merged, then randomly allocated to training and validation sets in a 3:1 ratio. The number of patients with the KIT exon 11 mutation was 61, the number of patients without the KIT exon 11 mutation was 45. Among the 106 patients, 57 were male and 49 were female (mean age 59.5 ± 9.9 years).
Ethical statement
The authors are accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. This study was approved by both institutions' review boards (Approval number: QYFYWZLL25829) and the need for informed consent from patients was waived.
Tri-phase CE-CT image screening and volume of interest segmentation
All 106 patient had undergone ce-cT scanning with 16 or 64-detector spiral cT scanners (Siemens Somatom Definition, ge Hispeed, ge Bright, or ge optima 670). For all these measurements, the tube current was 200-240 mAs, the slice thickness was 5.0 mm, the tube voltage was 120-160 kV, and the injection rate was 3.5 mL/s, with intravenous injection of contrast (iopromide, 80 mL) and arterial, venous, and delayed phase with 30, 70, and 300 s delay, respectively.
ITK-SnAP (v3.6.0; www.itksnap.org) was used for three-dimensional volume of interest (VoI) segmentation of the ce-cT images. VoI segmentation and feature extraction were performed in a blind manner by two radiologists. Radiologist 1, who had nine years’ clinical experience, outlined the edge of the tumor on the tri-phase ce-cT images layer by layer and fused them into the VoIs. The same radiologist repeated the VoI segmentation and feature extraction after a two week wash out period. In addition, radiologist 2, who had 14 years’ clinical experience, performed the VoI segmentation on the all ce-cT images and feature extraction once. Inter- and intraclass correlation coefficients (Iccs) were used to evaluate the reproducibility and stability of radiomics feature extraction to identify robust radiomics features. We randomly selected 30 cases of VoI segmentation performed by radiologist 1 and 2 for the Iccs. The interobserver Icc was derived from the first feature extraction by radiologist 1 and radiologist 2. The intraobserver Icc was derived from the feature extraction performed twice by radiologist 1. generally, Iccs >0.75 are considered to indicate good reproducibility or reliability. In this study, to ensure highly accurate features were selected, an Icc >0.9 was used as a criterion for good robustness. The radiologists were blinded to the mutational analysis.
Image normalization and feature extraction
Some preprocessing was needed to improve texture recognition before feature extraction. First, the VoI was normalized by the μ±3σ to highlight the differences between the classes.25 Second, in order to reduce the computation time and improve the SnR of the texture results, the image was processed by using gray-level quantization (reducing the grayscale used to represent the image).26 Finally, cubic interpolation was used to isotopically resample the VoIs to the in-plane resolution (0.5×0.5 mm) to ensure that the proportion and direction of the acquired three-dimensional features remain unchanged.27
The radiomics features of preprocessed cT images were extracted using commercial software (AnalysisKIT v.: 1.0.3; ge Healthcare). The software can process the image texture features and quantitatively analyze the heterogeneity within the lesion. For each phase ce-cT images, 4 types of features and 396 quantitative variables (10 Haralick feature, 42 Histogram features, 9 morphological feature, 11 gLSZm features, 144 gLcm features and 180 RLm features) were extracted. The workflow process for radiomics processing and analysis is presented in Figure 1.
Combat compensation method
The distinguishing features of the texture patterns were retained using the combat compensation method,28 while eliminating the influence of the scanner and the protocol. This method was helpful for multicenter radiomics analysis29 and can be used for cecT images.
Clinical characteristics
The preoperative demographic and clinicopathologic data, including age, sex, and blood lab results (e.g., carcinoembryonic antigen (ceA), carbohydrate antigen 199 [cA199]), were collected for the 106 patients. Postoperative pathological data, including the location and size (maximum diameter) of the tumor, proteomics (e.g., expression statuses of tumor related proteins Ki-67, cD117, cD34, and Dog1), mitotic count, national Institutes of Health (nIH) risk stratification, growth pattern, and genetic phenotype were also collected.
For KIT and PDGFRA gene mutation analysis, a DnA extraction KIT was used to extract DnA from pathological tissues, and primer design and PcR amplification was performed.
Clinical model, radiomics algorithm, and radiomics nomogram building
The relationship between clinical factors and the KIT exon 11 mutation was assessed by the univariate logistic regression. Then, the multivariate logistic model was used with the significant clinical features (P < .05) to develop the clinical model.
The feature extraction algorithm implemented in the R statistical software (v3.6.3; www.Rproject.org) was used to select features and generate each phase and tri-phase combined radiomics algorithm. During the algorithm construction, only the features with Iccs >0.9 were included. To select the most valuable predictive factors in the training set, we incorporated each phase and tri-phase ce-cT’s the top 20 features of minimum redundancy maximum relevance (mRmR) into the least absolute shrinkage and selection operator method (LASSo) and fitted for regression of data. generalized linear model (gLm) was used in this study, which is a machine-learning classifier and was trained with the training set using the 10-fold cross-validation approach. A radiomics algorithm was added with preoperative and postoperative clinical factors to build nomograms with multivariable logistic regression analysis. In the research design process, we considered that even if pathological specimens are obtained, routine genetic analysis of gIST has not been extensively performed due to the high cost, so we decided to make full use of the preoperative and postoperative data to construct the preoperative and postoperative model.
Assessment and validation of the clinical model, radiomics algorithm, and radiomics nomogram
The effectiveness of the clinical model, radiomics algorithm, and radiomics nomogram in differentiating, calibration, and clinical value was verified with the validation set. The predictive capability was evaluated using the validation set by the receiver operating characteristic (Roc) curve, the area under the Roc curve (AUc), specificity, sensitivity, accuracy, and positive and negative predictive values. The calibration curves were used to assess the agreement between the predicted and actual results with the Hosmer–Lemeshow test.30 The clinical applicability of the radiomics algorithm and preoperative and postoperative radiomics nomograms also was assessed with decision curve analysis by calculating the net benefit at different threshold probabilities.31
Statistical analysis
R statistical software (v3.6.3; www.Rproject.org) was used for all the statistical analyses. To select the clinical factors, Fisher’s exact test or the chi-square test was used to compare the different groups of continuous variables in clinical characteristics. mann-Whitney U test or independent t-test was used to evaluate the discrimination in clinical characteristics among different groups of continuous variables. Logistic regression was used to develop the clinical model and radiomics nomograms.
Results
Patients with gISTs were divided into two groups: those without the KIT exon 11 mutation and those with the KIT exon 11 mutation. The clinicopathologic and demographic data of these patients are summarized in Table 1. The expression status of cA724 and cD117 were significantly different between the two groups. no significant differences were found between any of the other clinical characteristics.
Because the ratio of the number of cD117 (-) (n=0) and (+) (n=61) in gISTs with the KIT 11 mutation is extremely asymmetric in the training set, we did not put this factor into the logistic regression in order not to produce meaningless results and errors. The univariate logistic regression analysis of the clinical results showed that cA724 was significantly associated with gISTs with the KIT 11 mutation in the training set (Table 2). The multivariate logistic regression analysis of the clinical characteristics showed that cA724 (P = .004) and mitotic count (P = .020) were independent predictors of the KIT exon 11 mutation in gISTs (Table 2). We constructed preoperative and postoperative clinical models on the multivariate logistic regression results. The performances of the preoperative (cA724) and postoperative (cA724 and mitotic count) clinical models in distinguishing gISTs with the KIT exon 11 mutation are shown in Table 3. For the preoperative clinical model with the training and validation sets, the AUcs were 0.669 and 0.509 respectively; for the postoperative clinical model with the training and validation sets, the AUcs were 0.770 and 0.606 respectively.
Through the above tri-phase ce-cT data processing, we developed arterial phase (A-phase), venous phase (V-phase), delayed phase (D-phase), and tri-phase combined radiomics algorithms. Patients without the KIT exon 11 mutation generally had lower radiomics scores than patients with the KIT exon 11 mutation. The selected radiomics features of the A-phase, V-phase, D-phase, and combined radiomics algorithms are shown in Figure 2. The prediction efficiencies of the radiomics algorithms to distinguish gISTs with the KIT exon 11 mutation in the training and validation sets are shown in Table 4. The AUcs of the radiomics algorithms with the training and validation sets were 0.773 and 0.709, for the A-phase, 0.745 and 0.558 for the V-phase, 0.738 and 0.600 for the D-phase, and 0.826 and 0.836 for the combined radiomics algorithms, respectively (Figure 3). These results show that the predictive performance of the tri-phase combined radiomics algorithm was better than that of any of the single-phase algorithms.
The preoperative and postoperative clinical models and combined radiomics algorithm were incorporated into the multivariate logistic regression analysis to generate preoperative and postoperative radiomics nomograms, respectively (Figure 4a, 4d). The performances of preoperative and postoperative radiomics nomograms in distinguishing the patients with the KIT exon 11 mutation in the training and validation sets are shown in Table 3. For the preoperative radiomics nomogram with the training and validation sets, the AUcs were 0.913 and 0.715, respectively; for postoperative radiomics nomogram with the training and validation sets, the AUcs were 0.919 and 0.679, respectively. The Roc curves for the training and validation sets were used to compare the prediction efficiencies of the combined radiomics algorithm and preoperative and postoperative radiomics nomograms (Figure 3). Interestingly, with the training set, the preoperative and postoperative radiomics nomograms both had higher AUcs than the combined radiomics algorithm, whereas, with the validation set, the combined radiomics algorithm had a higher AUc than the three models.
The calibration curves of the combined radiomics algorithm (Figure 5c, 5d; P = .781 and .534) and preoperative (Figure 4b, 4c; P = .838 and .730) and postoperative (Figure 4e, 4f; P = .899 and .495) radiomics nomograms showed good calibration with the nonsignificant Hosmer–Lemeshow test statistic with the training and validation sets respectively. The decision curve analysis (DcA) showed that the combined radiomics algorithm produced the highest overall net benefit compared with that of the preoperative or postoperative radiomics nomograms in predicting the KIT 11 mutation in the validation set across most of the range of reasonable threshold probabilities. However, the DcA found opposite results with the training set (Figure 5a, 5b).
Discussion
The specific KIT 11 mutation is associated with the clinical response to targeted therapy of gISTs.1, 2, 5 Selecting the targeted therapies based on the gIST genotypes has made precision medicine for gISTs possible. Radiogenomics approaches decode image phenotypes using radiomics to predict the genotypes of tumors, and they have been applied successfully in precision medicine studies.11, 17 In this retrospective two-center study, the aim was to develop a personalized radiomics method that could distinguish gISTs with the KIT exon 11 mutation by tri-phase ce-cT. In order to adapt to more diverse patients, flexibly use preoperative and postoperative data to develop clinical models, A-phase, V-phase, D-phase, and tri-phase combined radiomics algorithms, and preoperative and postoperative radiomics nomograms. The results showed that the combined radiomics algorithm had a well-calibrated clinical benefit and the best ability to distinguish gISTs with the KIT exon 11 mutation in the validation set, which indicated that radiomics had accurate and reproducible value for predicting gISTs with the KIT exon 11 mutation genotype.
The clinical model was used to predict the biological behaviors of gISTs,32 and genotypes have been shown to be closely related to certain clinical characteristics.21, 22, 33 However, the relationship between some clinical characteristics and the KIT 11 mutation remains controversial.21, 23, 24, 33 most of the previous studies showed that gISTs with the KIT exon 11 mutation usually had high percentages of mitotic count and/or poor prognosis.21, 24, 33 The relationship between cA724 and gIST genotypes has not been reported until now. We found that, in the training set, the mitotic count and cA724 were correlated with the gISTs with the KIT exon 11 mutation. However, the preoperative and postoperative clinical models had relatively lower predictive effectiveness (AUc= 0.509 and 0.606) with the validation set than all the other models tested. This result indicated that the models based solely on clinical characteristics made only limited contributions to the discrimination performance. The roles of mitotic count and cA724 for differentiating the expression status of KIT exon 11 need further studies.
The extraction of high-dimensional features from medical images by radiomics has emerged as a promising approach for predicting the characteristics of tumors.11, 13, 15, 17 Radiomics is more comprehensive than morphological visual analysis and can quantitatively reveal tumor heterogeneity. Various types of medical images can be analyzed to help with diagnosis and treatment and support traditional diagnostic methods.14, 16, 34, 35 Previous studies have shown that gene mutation typing of gISTs is related to qualitative imaging features,18-20 the gISTs with the KIT exon 9 mutation were associated with a higher enhancement ratio and a tumor size larger than 10 cm when compared with those with the KIT exon 11 mutation.18 Hyperenhancement of cT was more frequent in gISTs without KIT mutations compared to gISTs with KIT mutations.20 considering that gISTs of different genotypes may have heterogeneous blood supply and different ce-cT enhancement ratio.18, 20 Using VoIs of all available slices extracted from tri-phase ce-cT images to construct a combined radiomics algorithm11 can provide more information about a tumor,36 and may have better clinical application value and generalizability in discriminating mutation.36-38 In the present study, we extracted more advanced radiomics features and used more data from two institutions compared with the cT texture analysis and single-center data in a previous study.11 We also used various evaluation tools, better radiomics feature selection methods and machine-learning classifier to obtain the generalization and accuracy of the predictive model.
The radiomics algorithm was proven to be more effective than the clinical model. next, we incorporated the clinical factors with the combined radiomics algorithm to build preoperative and postoperative radiomics nomograms. The AUc and DcA of the combined radiomics algorithm were compared with those of the preoperative and postoperative radiomics nomograms. The results showed that the combined radiomics algorithm had better predictive effectiveness and clinical applicability with the validation set than the radiomics nomograms, which indicated that the radiomics algorithm reflected molecular-level pathology better than the clinical factors and confirmed the enormous potential of the radiomics algorithm to distinguish gISTs with the KIT 11 mutation.
To prevent overfitting, the mRmR and LASSo regression methods were applied to the extracted features. For feature selection, we used mRmR, which is a new feature screening method,34 to screen radiomics features that have the fewest redundancies and most credible coefficients. LASSo is a feature screening method for selecting the predominant features of both ridge regression and subset screening.39 We used gLm as a machine-learning classifier because it was reported to produce the best screening performance among the 11 machine-learning methods that were tested.35 With application of the excellent feature selection methods and machine-learning classifier, the combined radiomics algorithm satisfactorily distinguished gISTs with and without the KIT exon 11 mutation.
The development of targeted therapy and genetic analysis have made the precision medicine feasible for gISTs.5, 11, 22, 33 The most common imatinib-target mutation is the KIT exon 11 mutation, and patients with this mutation can benefit from standard imatinib therapy.2, 4, 5, 10 The relatively rare patients without the KIT exon 11 mutation require further genetic analysis to identify multiple effects of different treatments.1, 2, 4-6 The combined radiomics algorithm developed in this study may provide a supplement to biopsies, which also have many challenges,2, 8, 10 to distinguish gISTs with the KIT exon 11 mutation noninvasively. For patients who were evaluated as the KIT exon 11 mutation of high probability with the combined radiomics algorithm, clinicians could recommend the standard imatinib therapy.
Our study has several limitations. First, all data were collected retrospectively and limited to chinese patients, so selection bias was inevitable. Second, although the data were from two centers, the proportion of samples that had been performed genetic analysis was small, possibly because of cost considerations, which highlights the need for and significance of radiomics analysis. These findings need to be validated in international, larger multi-institutional studies. Third, only ce-cT images (slice thickness of 5.0 mm) were analyzed in this study. Whether radiomics could achieve a better performance on different slice thickness of ce-cT is worth further investigation. Fourth, the hand-crafted feature requires time-consuming tumor boundary segmentation, deep learning radiomics could accurately and automatically detect and segment and achieve excellent performances.17 Lastly, this study includes radiomic features from the tri-phase ce-cT. Due to difference in protocols between different institutions, the delayed phase is not always performed in gIST’s staging, which may limit the general applicability of the models.
In conclusion, we developed and validated several predictive models that could noninvasively distinguish gISTs with KIT exon 11 mutation based on tri-phase ce-cT images. The combined radiomics algorithm performed better than the other predictive models, which confirmed it as a potential predictor to supplement conventional approaches for selective genetic analysis and to support the clinical decision about imatinib therapy in the precision medicine of gISTs.
Discussion
The specific KIT 11 mutation is associated with the clinical response to targeted therapy of gISTs.1, 2, 5 Selecting the targeted therapies based on the gIST genotypes has made precision medicine for gISTs possible. Radiogenomics approaches decode image phenotypes using radiomics to predict the genotypes of tumors, and they have been applied successfully in precision medicine studies.11, 17 In this retrospective two-center study, the aim was to develop a personalized radiomics method that could distinguish gISTs with the KIT exon 11 mutation by tri-phase ce-cT. In order to adapt to more diverse patients, flexibly use preoperative and postoperative data to develop clinical models, A-phase, V-phase, D-phase, and tri-phase combined radiomics algorithms, and preoperative and postoperative radiomics nomograms. The results showed that the combined radiomics algorithm had a well-calibrated clinical benefit and the best ability to distinguish gISTs with the KIT exon 11 mutation in the validation set, which indicated that radiomics had accurate and reproducible value for predicting gISTs with the KIT exon 11 mutation genotype.
The clinical model was used to predict the biological behaviors of gISTs,32 and genotypes have been shown to be closely related to certain clinical characteristics.21, 22, 33 However, the relationship between some clinical characteristics and the KIT 11 mutation remains controversial.21, 23, 24, 33 most of the previous studies showed that gISTs with the KIT exon 11 mutation usually had high percentages of mitotic count and/or poor prognosis.21, 24, 33 The relationship between cA724 and gIST genotypes has not been reported until now. We found that, in the training set, the mitotic count and cA724 were correlated with the gISTs with the KIT exon 11 mutation. However, the preoperative and postoperative clinical models had relatively lower predictive effectiveness (AUc= 0.509 and 0.606) with the validation set than all the other models tested. This result indicated that the models based solely on clinical characteristics made only limited contributions to the discrimination performance. The roles of mitotic count and cA724 for differentiating the expression status of KIT exon 11 need further studies.
The extraction of high-dimensional features from medical images by radiomics has emerged as a promising approach for predicting the characteristics of tumors.11, 13, 15, 17 Radiomics is more comprehensive than morphological visual analysis and can quantitatively reveal tumor heterogeneity. Various types of medical images can be analyzed to help with diagnosis and treatment and support traditional diagnostic methods.14, 16, 34, 35 Previous studies have shown that gene mutation typing of gISTs is related to qualitative imaging features,18-20 the gISTs with the KIT exon 9 mutation were associated with a higher enhancement ratio and a tumor size larger than 10 cm when compared with those with the KIT exon 11 mutation.18 Hyperenhancement of cT was more frequent in gISTs without KIT mutations compared to gISTs with KIT mutations.20 considering that gISTs of different genotypes may have heterogeneous blood supply and different ce-cT enhancement ratio.18, 20 Using VoIs of all available slices extracted from tri-phase ce-cT images to construct a combined radiomics algorithm11 can provide more information about a tumor,36 and may have better clinical application value and generalizability in discriminating mutation.36-38 In the present study, we extracted more advanced radiomics features and used more data from two institutions compared with the cT texture analysis and single-center data in a previous study.11 We also used various evaluation tools, better radiomics feature selection methods and machine-learning classifier to obtain the generalization and accuracy of the predictive model.
The radiomics algorithm was proven to be more effective than the clinical model. next, we incorporated the clinical factors with the combined radiomics algorithm to build preoperative and postoperative radiomics nomograms. The AUc and DcA of the combined radiomics algorithm were compared with those of the preoperative and postoperative radiomics nomograms. The results showed that the combined radiomics algorithm had better predictive effectiveness and clinical applicability with the validation set than the radiomics nomograms, which indicated that the radiomics algorithm reflected molecular-level pathology better than the clinical factors and confirmed the enormous potential of the radiomics algorithm to distinguish gISTs with the KIT 11 mutation.
To prevent overfitting, the mRmR and LASSo regression methods were applied to the extracted features. For feature selection, we used mRmR, which is a new feature screening method,34 to screen radiomics features that have the fewest redundancies and most credible coefficients. LASSo is a feature screening method for selecting the predominant features of both ridge regression and subset screening.39 We used gLm as a machine-learning classifier because it was reported to produce the best screening performance among the 11 machine-learning methods that were tested.35 With application of the excellent feature selection methods and machine-learning classifier, the combined radiomics algorithm satisfactorily distinguished gISTs with and without the KIT exon 11 mutation.
The development of targeted therapy and genetic analysis have made the precision medicine feasible for gISTs.5, 11, 22, 33 The most common imatinib-target mutation is the KIT exon 11 mutation, and patients with this mutation can benefit from standard imatinib therapy.2, 4, 5, 10 The relatively rare patients without the KIT exon 11 mutation require further genetic analysis to identify multiple effects of different treatments.1, 2, 4-6 The combined radiomics algorithm developed in this study may provide a supplement to biopsies, which also have many challenges,2, 8, 10 to distinguish gISTs with the KIT exon 11 mutation noninvasively. For patients who were evaluated as the KIT exon 11 mutation of high probability with the combined radiomics algorithm, clinicians could recommend the standard imatinib therapy.
our study has several limitations. First, all data were collected retrospectively and limited to chinese patients, so selection bias was inevitable. Second, although the data were from two centers, the proportion of samples that had been performed genetic analysis was small, possibly because of cost considerations, which highlights the need for and significance of radiomics analysis. These findings need to be validated in international, larger multi-institutional studies. Third, only ce-cT images (slice thickness of 5.0 mm) were analyzed in this study. Whether radiomics could achieve a better performance on different slice thickness of ce-cT is worth further investigation. Fourth, the hand-crafted feature requires time-consuming tumor boundary segmentation, deep learning radiomics could accurately and automatically detect and segment and achieve excellent performances.17 Lastly, this study includes radiomic features from the tri-phase ce-cT. Due to difference in protocols between different institutions, the delayed phase is not always performed in gIST’s staging, which may limit the general applicability of the models.
In conclusion, we developed and validated several predictive models that could noninvasively distinguish gISTs with KIT exon 11 mutation based on tri-phase ce-cT images. The combined radiomics algorithm performed better than the other predictive models, which confirmed it as a potential predictor to supplement conventional approaches for selective genetic analysis and to support the clinical decision about imatinib therapy in the precision medicine of gISTs.