ABSTRACT
PURPOSE
To evaluate whether deep learning reconstruction (DLR) can reduce the radiation dose in routine clinical computed tomography (CT) scans compared with iterative reconstruction (IR) while maintaining or improving image quality. The study assesses DLR’s consistency and effectiveness across four distinct CT protocols—chest, head, chest–abdomen–pelvis (CAP) oncology, and lower limb CT angiography (CTA)—representing a wide range of clinical applications.
METHODS
Our study is retrospective and monocentric. It involves a total population of 13,060 patients who underwent a CT scan using either a DLR algorithm (CT-DLR) or an IR algorithm (CT-IR) in one of four different CT acquisition protocols. Image quality was evaluated qualitatively and quantitatively by measuring standardized signal-to-noise ratio and contrast-to-noise ratio values. Assessment was performed on a subsample of 200 patients (25 per protocol per group).
RESULTS
The overall reduction in radiation dose for the CT-DLR group compared with the CT-IR group was approximately 20%. By protocol, dose reductions were 22% for chest CT, 21% for CAP oncology, 20% for lower limb CTA, and 19% for head CT. The CT-DLR group exhibited superior subjective and objective image quality to the CT-IR group.
CONCLUSION
DLR algorithms allow for a significant reduction in radiation dose while achieving higher image quality compared with IR algorithms.
CLINICAL SIGNIFICANCE
This large-scale study confirms that DLR can significantly reduce the radiation dose in routine CT imaging while maintaining or enhancing diagnostic image quality. Its consistent performance across multiple protocols supports broader clinical adoption. Notably, the greatest dose reductions were observed in high-use protocols such as chest and CAP CT, underscoring DLR’s potential to improve both individual patient care and long-term population-level radiation safety.
Main points
• Deep learning reconstruction (DLR) significantly reduces the radiation dose in routine computed tomography (CT) scans compared with iterative reconstruction, with an average reduction of 20%.
• Despite lower radiation, DLR provides consistently better or equivalent image quality across all evaluated protocols: chest CT, head CT, chest–abdomen–pelvis oncology, and lower limb CT angiography.
• These findings support the clinical implementation of DLR as a safe and effective reconstruction method for dose optimization in a wide range of CT applications.
The exponential growth in computational power over the last two decades has enabled substantial improvements in computed tomography (CT) image reconstruction techniques.1 Statistical-based iterative reconstruction (IR) and model-based IR techniques were introduced between 2000 and 2010,2-4 resulting in a major reduction in image noise and the optimization of the signal-to-noise ratio (SNR) and the contrast-to-noise ratio (CNR) for lower irradiation doses,5-8 yet with mixed results regarding pure image quality.
Recently, new image reconstruction methods have been developed using artificial intelligence-based techniques, such as deep learning with convolutional neural networks.9 These neural networks are trained on tens of thousands of paired images to differentiate the noise from the signal more efficiently, resulting in further improvements in image quality10-13 as well as a reduction in the radiation dose delivered.14-17
Although many studies have highlighted the superiority of deep learning reconstruction (DLR) algorithms in terms of image quality compared with previous algorithms, their potential for radiation dose reduction has also been confirmed. Kobayashi et al.18 demonstrated that the implementation of DLR reduced the radiation dose by 45% and lowered the estimated risk of radiation-induced cancer from 0.247% to 0.130%. Although their modeling approach illustrates the theoretical potential of DLR, our study focuses on real-world clinical protocols, revealing a more moderate reduction of 20%. This contrast underscores the complementary value of both perspectives and motivates the present study by addressing routine practice applicability.
The main objective of our study is to estimate the potential radiation dose reduction that can be achieved in a real-life setting using a DLR algorithm compared with an IR algorithm while maintaining equivalent image quality. Our secondary objective is to investigate the variation in radiation dose reduction potential as a function of body mass index (BMI) and patient sex.
Methods
Ethics
Our study was a retrospective analysis of prospectively collected clinical data. The study design was approved by the Ethics Committee of Strasbourg University Hospital (decision number: RNI-2021-149, date: 14.12.2021), and written consent from participating patients was not required.
Population
This is a retrospective, single-center study analyzing prospectively collected dosimetry data from the Dose Archiving and Communication System (DACS) of the diagnostic radiology department of the Nouvel Hôpital Civil (Strasbourg University Hospital) between May 2018 and May 2021. All patients who underwent CT examinations during this period and had their radiation dose data stored on the DACS were considered for inclusion.
Specifically, we included patients who underwent CT examinations on either a third-generation wide-area detector CT scanner equipped with IR software [Aquilion One Genesis, Canon Medical Systems, Ōtawara, Tochigi, Japan, with three-dimensional adaptive iterative dose reduction (AIDR-3D)], which served as the control group (CT-IR), or a fourth-generation wide-area detector CT scanner equipped with DLR [Aquilion One Prism, Canon Medical Systems, with the Advanced intelligent Clear-IQ Engine (AiCE)], which served as the test group (CT-DLR). The Genesis system is equipped with PUREViSION detectors, the AIDR-3D hybrid IR algorithm, and SUREExposure 3D dose modulation, providing 16-cm z-axis coverage suitable for whole-organ imaging. The Prism system, although offering the same 16-cm z-axis coverage, incorporates improved PUREViSION Optics with lower electronic noise and higher light-collection efficiency, enhanced dose-modulation features (including adaptive collimation and advanced exposure control), faster gantry rotation (up to 0.25 seconds), and optional single energy metal artifact reduction, in addition to the AiCE DLR algorithm. These hardware and software differences may have contributed to the observed reduction in radiation dose, in addition to the reconstruction algorithm itself.
Examinations had to meet predefined criteria, including being either a non-contrast chest CT, non-contrast head CT, chest–abdomen–pelvis (CAP) CT with injection, or CT angiography (CTA) of the lower limbs. Only examinations with a single series were included; thus, multiphasic CT was excluded.
A total of 13,060 patients were included in our study, with 4,361 patients in the CT-DLR group and 8,799 patients in the CT-IR group. Within the CT-DLR group, 1,567 patients were examined according to the chest CT protocol, 1,623 according to the head CT protocol, 1,109 according to the CAP oncology protocol, and 62 according to the lower limb CTA protocol. Within the CT-IR group, 3,060 patients were examined according to the chest CT protocol, 3,436 according to the head CT protocol, 2,058 according to the CAP oncology protocol, and 245 according to the lower limb CTA protocol (Figure 1).
To improve transparency and assess potential baseline imbalances between groups, we compiled a summary table of demographic characteristics, including age, sex distribution, BMI, and weight, for each protocol and reconstruction method (CT-IR and CT-DLR). These variables are known to affect radiation dose and were therefore reported to support comparability across groups (Table 1).
To homogenize data, examinations were selected based on anatomical area, contrast medium injection, and z-axis coverage (Table 2). The acquisition protocols were kept identical on both devices, except for the reconstruction algorithm used and the automatic z-axis dose modulation, which was adapted to these algorithms based on vendor recommendations.
Database
For each patient, we collected the date of the examination; the machine on which the examination was performed; the examination protocol; the patient’s sex, weight, height, and BMI (if available); the dose–length product (DLP; mGy.cm); the CT dose index (mGy); the exposure time (ms); the acquisition field length (mm); the tube voltage (kV); and the tension (mA). The radiation protection team of Strasbourg University Hospital conducted the data extraction.
Image quality assessment
For the specific qualitative and quantitative image analysis, we randomly selected 25 patients from the CT-IR group for each of the four examination protocols, using a stratified random sampling method with probabilities proportional to height. The stratification variable was BMI, which was categorized into four classes (BMI <20, 20< BMI <25, 25< BMI <30, BMI >30) to approximate clinical thresholds for underweight, normal weight, overweight, and obesity, in line with World Health Organization guidelines.19 These categories were chosen because patient body habitus substantially affects image noise and radiation dose modulation in CT examinations. This approach ensured that the sampled BMI distribution closely mirrored that of the overall study population.
Next, we selected 25 patients from the CT-DLR group using a 1:1 individual matching method that combined exact matching for sex and the nearest neighbor method for BMI. A caliper of ± 0.5 kg/m² was applied during BMI matching to ensure comparable body habitus between CT-DLR and CT-IR groups.
A sample size of 25 patients per protocol and per group (CT-DLR and CT-IR) was deemed sufficient for the paired comparison of image quality metrics based on SNR, CNR, and Likert scores. This sample size represents a practical balance between statistical robustness and the feasibility of manual image evaluation.
Qualitative analysis
We conducted a subjective analysis of image quality using a Likert scale ranging from 1 to 5, following the European guidelines on image quality.20 The analysis was performed by a single radiologist (LG, with 4 years of experience in CT). Evaluation criteria included visibility of relevant anatomical structures, sharpness of tissue boundaries, perceived image noise, soft tissue contrast resolution, and presence of reconstruction-related artifacts. The Likert scale categories were as follows: 1) non-diagnostic, 2) poor image quality, 3) limited image quality but diagnostic, 4) good image quality, and 5) optimal image quality.
Quantitative analysis
To ensure reproducibility and minimize inter-subject variability, all quantitative measurements (SNR and CNR) were standardized by using anatomically predefined, fixed-size regions of interest (ROIs). Background noise was consistently defined as the standard deviation of Hounsfield units within a homogeneous air region around the outside of the patient’s body. This approach, adapted from prior validated methods, was based on Bhosale21 and Chatzaraki et al.22 for CAP oncology and lower limb CTA, Tang et al.23 for chest CT, and Guziński et al.24 for head CT.
Chest–abdomen–pelvis oncology and lower limb computed tomography angiography
Based on Bhosale,21 we placed an axial slice ROI of approximately 20 mm² in the right liver parenchyma behind the vena cava (Figure 2), with another ROI of approximately 20 mm² in the abdominal subcutaneous fat when possible (the size of the ROI was reduced if the thickness of the adipose tissue was insufficient) to measure the mean value. In cases where the adipose tissue thickness was insufficient, the size of the ROI was reduced accordingly. Although smaller ROIs are known to be more susceptible to local noise, potentially influencing the reproducibility of the SNR and CNR measurements—particularly in low-contrast regions—ROIs were carefully placed in homogeneous areas to mitigate this effect. This approach is consistent with the methodological considerations discussed in prior studies, including Chatzaraki et al.22 A similarly sized ROI was placed in the background around the outside of the patient to measure the distribution (standard deviation) of voxel values.
According to the literature review by Chatzaraki et al.,22 the most reliable and reproducible method for calculating the SNR and CNR in images is to use the distribution of voxel values in the background as the denominator.
The SNR is calculated as follows:
The CNR is calculated as follows:
Chest computed tomography
We measured the mean voxel value by placing an ROI in the descending thoracic aorta, covering approximately half of the aortic lumen area in the axial section (Figure 2), similar to the study by Tang et al.23 We also measured the mean voxel value in the adipose tissue of the left axillary region, using an ROI of at least 50 mm², and we placed an ROI of at least 50 mm² in the background around the outside of the patient to measure the voxel value distribution (standard deviation).
The SNR is calculated as follows:
The CNR is calculated as follows:
Head computed tomography
We followed a method similar to that of Guziński et al.24 We measured the mean voxel value in an ROI of at least 20 mm² located in the anterior portion of the right caudate nucleus in the axial slice (Figure 2). We also measured the mean voxel value in a similarly sized ROI of the fourth ventricle and an ROI with a minimum area of 10 mm² in the background around the outside of the patient to measure the distribution (standard deviation) of the voxel value.
The SNR is calculated as follows:
The CNR is calculated as follows:
The position of these ROIs was optimized to minimize the variation in voxel values within them.
Statistical analysis
General population
Continuous variables are reported as mean ± standard deviation, whereas categorical variables are presented as counts and proportions. To analyze the radiation exposure, linear regression models were used. Extreme values of radiation exposure (> 95th percentile) were excluded from the models, as they may be caused by technical issues or multiple acquisitions during the same examination (such as patient movements and injection problems); their exclusion was intended to reduce the undue influence of outliers and improve the robustness of the analysis by focusing on typical cases.
To evaluate the independent effects of different factors, a multivariate model, including scanner type, protocol type, BMI, and sex, was constructed. Pairwise interactions between scanner type and other factors of interest were also included in the model to determine whether the effect of scanner type was dependent on other factors. The normality of the residuals was assessed using a quantile–quantile plot. The results are presented as differences with 95% confidence intervals (CIs).
Random samples
The analyses of image quality, SNR, and CNR were conducted on randomly selected samples with individual matching. Therefore, comparisons of the different variables of interest between the two types of scanners were performed using paired tests. Continuous variables were compared using the Wilcoxon signed-rank test, and categorical variables were compared using the Stuart–Maxwell test. A P value < 0.05 was considered statistically significant. The analyses were performed using R software version 4.1.1. (R Core Team, Vienna, Austria).
Results
Population
A total of 13,060 patients were included in the study—4,361 in the CT-DLR group and 8,799 in the CT-IR group—covering a range of clinical protocols: chest CT, head CT, CAP oncology, and lower limb CTA. Baseline demographic characteristics for each protocol and reconstruction group are summarized in Table 1.
Overall results
Compared with the CT-IR group, overall irradiation was significantly lower in the CT-DLR group, with a difference of 86.6 DLP units (95% CI: 78.5, 94.6; P < 0.001), corresponding to a radiation dose reduction of approximately 20%.
Further analysis by protocol showed an average reduction of 19.3% (95% CI: -21.6, -17) for head CT, 21.8% (95% CI: -24.1, -18.3) for chest CT, 20.9% (95% CI: -26.8, -20.2) for CAP oncology, and 19.5% (95% CI: -30.4, -7.0) for lower limb CTA in the CT-DLR group (Figure 3).
Multivariate analysis adjusting for BMI showed that male patients had a mean excess radiation of 47.9 DLP units (P < 0.001), corresponding to a 12% difference in delivered radiation dose. However, there was no significant difference between the CT-DLR and CT-IR groups [47.8 (95% CI: 41.6, 54.1) vs. 47.9 (95% CI: 43.2, 52.7), P = 0.965], indicating that the dose reduction achieved with DLR was consistent across sexes.
In terms of BMI, there was no significant difference in irradiation between the two groups for the head CT and lower limb CTA protocols. However, for the chest CT protocol, patients with BMI >30 showed a significant increase in dose reduction with DLR compared with those with BMI <20 [-62.1 (95% CI: -73.8, -49.1) vs. -30.7 (95% CI: -46.6, -14.9), P = 0.003]. Similarly, for the CAP oncology protocol, patients with BMI between 25 and 30 and those with BMI >30 showed a significant increase in dose reduction in the CT-DLR group compared with the CT-IR group [-115.8 (95% CI: -129.5, -102.1) and -131.0 (95% CI: -150.7, -111.4) vs. -57.6 (95% CI: -80.4, -34.9), P < 0.001 for both comparisons] (Figure 4).
Results samples
Table 3 displays the mean values for the SNR, CNR, and Likert scores used to assess image quality. Regardless of the acquisition protocol, the CT-DLR group exhibited significantly higher SNR, CNR, and Likert score values than the CT-IR group (P < 0.001).
Figure 5 displays the distribution of CNR values using a box-and-whisker plot.
Discussion
In a real-life retrospective analysis of over 10,000 CT examinations, we demonstrated that the use of DLR leads to a meaningful reduction in radiation dose compared with IR, with an average decrease of approximately 20%. This reduction was consistent and statistically significant across all four protocols, being most notable in the chest CT protocol and least notable in the head CT protocol. The observed differences remained significant even after adjusting for patient characteristics such as BMI and sex. Notably, dose reduction was greater in patients with higher BMI, especially in the CAP oncology and chest CT protocols. Image quality was qualitatively and quantitatively evaluated as being at least equivalent to that of IR, confirming the results already published in the literature,16, 17 while involving a larger number of patients.
The benefit of the DLR algorithm in terms of dose reduction compared with the IR algorithm appears to be slightly greater for the chest CT protocol than for the other protocols. This may be attributed to the lower attenuation of thoracic structures and the lung parenchyma having greater intrinsic contrast, making it more adapted to dose reduction.
Despite the dose reduction, we observed a superior objective (SNR, CNR) and subjective (Likert scale) image quality for each protocol, consistent with studies published in the literature.11-13 The DLR algorithm seems to have a greater advantage in terms of image quality for the chest CT and CAP oncology protocols than for the head CT and lower limb CTA protocols. This could be explained by the predominance of soft tissue on the thorax and abdomen, which the DLR algorithm is more effective at handling, compared with the lower limbs and skull, which have a larger proportion of bony structures. For lower limb CTA, however, the limited number of DLR cases resulted in wide CIs, and these findings should therefore be regarded as exploratory.
The clinical implications of this dose reduction are considerable. A recent large-scale risk-projection study by Smith-Bindman et al.25 estimated that approximately 103,000 future cancers may result from the 93 million CT examinations performed in the United States in 2023 alone. Notably, the largest share of projected cancers was attributed to CAP and chest CT scans—protocols in which we observed the highest dose reductions using DLR.25 Hence, a consistent 20% reduction in radiation dose across these high-utilization protocols could potentially contribute to reducing radiation-induced cancer risk at the population level. These findings highlight the dual benefit of DLR: achieving superior image quality while also supporting long-term cancer risk mitigation. A recent study by Kobayashi et al.18 demonstrated that the implementation of DLR could reduce the radiation dose by up to 45% and, importantly, lower the projected risk of radiation-induced cancer compared with IR. Taken together with our finding of a consistent 20% dose reduction in a large real-life cohort, these results underscore the broader clinical importance of adopting DLR, highlighting its potential not only for routine dose optimization but also for contributing to long-term patient safety.
Additionally, we found that male patients received, on average, 48 DLP units (approximately 12%) more radiation than female patients, independent of the type of examination, scanner, or BMI. This finding indicates a potential sex-related difference in radiation dose. However, the current literature on this topic is limited, and further dedicated investigations are required to better understand the underlying factors and confirm this observation.
We were able to demonstrate that the difference in irradiation between the two scanners is significantly greater for higher BMIs in the CAP oncology and lower limb CTA protocols. This difference may be attributed to the DLR algorithm’s superior ability to eliminate artifacts associated with anatomical factors related to body structure, thereby maintaining satisfactory image quality with more effective dose modulation.
Another important aspect to consider is that the CT-IR and CT-DLR groups were scanned on different generations of scanners (Aquilion One Genesis vs. Aquilion One Prism). Although acquisition protocols were identical, the fourth-generation Prism integrates incremental hardware advances such as improved detector efficiency, lower electronic noise, and more sophisticated dose modulation. Consequently, the 20% reduction in radiation dose observed in our study should not be attributed solely to the DLR algorithm but rather to the combined contribution of both reconstruction and hardware improvements. This potential confounding effect is important when interpreting our results and may partly explain the variability in dose reduction observed across protocols and patient subgroups.
Our findings of an average 20% reduction in radiation dose with DLR differ from the 45% reduction reported by Kobayashi et al.,18 who additionally performed lifetime attributable risk modeling. Several factors may explain this discrepancy. First, scanner hardware generation differed: our analysis compared the Aquilion One Genesis (IR) with the newer Aquilion One Prism (DLR), whereas Kobayashi et al.18 employed the Aquilion Precision. Such differences in detector design, dose modulation, and overall system architecture may partly confound the isolated contribution of the reconstruction algorithm. Second, protocol composition differed: our study included a broader range of routine protocols (head CT, chest CT, CAP oncology, and lower limb CTA), whereas Kobayashi et al.18 focused solely on body CT examinations (e.g., abdomen and pelvis), potentially delivering a different dose distribution. Finally, methodological differences must be acknowledged, particularly their use of risk-projection modeling vs. our emphasis on real-world clinical applicability. Taken together, these differences suggest that although absolute dose-reduction magnitudes may vary depending on study design, scanner generation, and protocol composition, the available evidence—including our large real-world cohort—supports the potential of DLR to meaningfully reduce patient radiation exposure.
Our study has several limitations that should be acknowledged. First, due to its retrospective design, it was not possible to strictly match patients between the CT-IR and CT-DLR groups. Additionally, the number of patients scanned with IR was higher, as the IR-equipped scanner was installed earlier and thus had a longer period of data collection. This temporal difference may have introduced slight variations in patient characteristics, possibly influenced by individual radiologists’ preferences for one scanner over the other. Nonetheless, no statistically significant differences were observed between the two populations. As a further limitation, being a single-center non-randomized study, some degree of temporal or operator-related bias cannot be entirely excluded, particularly given the longer accrual period of the IR scanner.
Being a single-center study, the generalizability of our results to other institutions or populations may be limited despite the large sample size. Some inaccuracies in protocol labeling may also have occurred, such as injected head CTs being mislabeled or CAP oncology scans being performed without contrast. These errors did not affect the radiation dose measurements and only minimally influenced the SNR and CNR results in subgroup analyses.
A few extreme values were excluded during multivariate modeling to ensure robust analysis; however, the exclusion rate was balanced across both groups. In addition, qualitative and quantitative image assessments were performed by a single radiologist with 4 years of CT experience. Although full blinding was not feasible due to the visually distinguishable differences between IR and DLR images, the consistency between subjective Likert scores and objective metrics (SNR, CNR), together with concordance with the existing literature, supports the validity of our findings. Furthermore, the small sample size in the lower limb CTA subgroup limits the precision of dose estimates, and these results should be interpreted with caution.
Finally, because IR and DLR data were acquired on scanners from different generations, hardware advances in the Prism system may have contributed to the observed dose reduction, representing a potential confounding factor.
In conclusion, in this retrospective study, we found that implementing DLR algorithms resulted in a mean 20% reduction in radiation dose compared with IR algorithms, across a wide range of patients and scanning protocols, with—at least—non-inferior qualitative and quantitative image quality. Future studies could explore the difference in radiation dose between men and women, independent of other factors, and determine the optimal balance between dose reduction and image quality enhancement offered by these DLR algorithms.


