Sort by:
Page 114 of 1521519 results

AI model using CT-based imaging biomarkers to predict hepatocellular carcinoma in patients with chronic hepatitis B.

Shin H, Hur MH, Song BG, Park SY, Kim GA, Choi G, Nam JY, Kim MA, Park Y, Ko Y, Park J, Lee HA, Chung SW, Choi NR, Park MK, Lee YB, Sinn DH, Kim SU, Kim HY, Kim JM, Park SJ, Lee HC, Lee DH, Chung JW, Kim YJ, Yoon JH, Lee JH

pubmed logopapersJun 1 2025
Various hepatocellular carcinoma (HCC) prediction models have been proposed for patients with chronic hepatitis B (CHB) using clinical variables. We aimed to develop an artificial intelligence (AI)-based HCC prediction model by incorporating imaging biomarkers derived from abdominal computed tomography (CT) images along with clinical variables. An AI prediction model employing a gradient-boosting machine algorithm was developed utilizing imaging biomarkers extracted by DeepFore, a deep learning-based CT auto-segmentation software. The derivation cohort (n = 5,585) was randomly divided into the training and internal validation sets at a 3:1 ratio. The external validation cohort included 2,883 patients. Six imaging biomarkers (i.e. abdominal visceral fat-total fat volume ratio, total fat-trunk volume ratio, spleen volume, liver volume, liver-spleen Hounsfield unit ratio, and muscle Hounsfield unit) and eight clinical variables were selected as the main variables of our model, PLAN-B-DF. In the internal validation set (median follow-up duration = 7.4 years), PLAN-B-DF demonstrated an excellent predictive performance with a c-index of 0.91 and good calibration function (p = 0.78 by the Hosmer-Lemeshow test). In the external validation cohort (median follow-up duration = 4.6 years), PLAN-B-DF showed a significantly better discrimination function compared to previous models, including PLAN-B, PAGE-B, modified PAGE-B, and CU-HCC (c-index, 0.89 vs. 0.65-0.78; all p <0.001), and maintained a good calibration function (p = 0.42 by the Hosmer-Lemeshow test). When patients were classified into four groups according to the risk probability calculated by PLAN-B-DF, the 10-year cumulative HCC incidence was 0.0%, 0.4%, 16.0%, and 46.2% in the minimal-, low-, intermediate-, and high-risk groups, respectively. This AI prediction model, integrating deep learning-based auto-segmentation of CT images, offers improved performance in predicting HCC risk among patients with CHB compared to previous models. The novel predictive model PLAN-B-DF, employing an automated computed tomography segmentation algorithm, significantly improves predictive accuracy and risk stratification for hepatocellular carcinoma in patients with chronic hepatitis B (CHB). Using a gradient-boosting algorithm and computed tomography metrics, such as visceral fat volume and myosteatosis, PLAN-B-DF outperforms previous models based solely on clinical and demographic data. This model not only shows a higher c-index compared to previous models, but also effectively classifies patients with CHB into different risk groups. This model uses machine learning to analyze the complex relationships among various risk factors contributing to hepatocellular carcinoma occurrence, thereby enabling more personalized surveillance for patients with CHB.

Incorporating Radiologist Knowledge Into MRI Quality Metrics for Machine Learning Using Rank-Based Ratings.

Tang C, Eisenmenger LB, Rivera-Rivera L, Huo E, Junn JC, Kuner AD, Oechtering TH, Peret A, Starekova J, Johnson KM

pubmed logopapersJun 1 2025
Deep learning (DL) often requires an image quality metric; however, widely used metrics are not designed for medical images. To develop an image quality metric that is specific to MRI using radiologists image rankings and DL models. Retrospective. A total of 19,344 rankings on 2916 unique image pairs from the NYU fastMRI Initiative neuro database was used for the neural network-based image quality metrics training with an 80%/20% training/validation split and fivefold cross-validation. 1.5 T and 3 T T1, T1 postcontrast, T2, and FLuid Attenuated Inversion Recovery (FLAIR). Synthetically corrupted image pairs were ranked by radiologists (N = 7), with a subset also scoring images using a Likert scale (N = 2). DL models were trained to match rankings using two architectures (EfficientNet and IQ-Net) with and without reference image subtraction and compared to ranking based on mean squared error (MSE) and structural similarity (SSIM). Image quality assessing DL models were evaluated as alternatives to MSE and SSIM as optimization targets for DL denoising and reconstruction. Radiologists' agreement was assessed by a percentage metric and quadratic weighted Cohen's kappa. Ranking accuracies were compared using repeated measurements analysis of variance. Reconstruction models trained with IQ-Net score, MSE and SSIM were compared by paired t test. P < 0.05 was considered significant. Compared to direct Likert scoring, ranking produced a higher level of agreement between radiologists (70.4% vs. 25%). Image ranking was subjective with a high level of intraobserver agreement ( <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mn>94.9</mn> <mo>%</mo> <mo>±</mo> <mn>2.4</mn> <mo>%</mo></mrow> </math> ) and lower interobserver agreement ( <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mn>61.47</mn> <mo>%</mo> <mo>±</mo> <mn>5.51</mn> <mo>%</mo></mrow> </math> ). IQ-Net and EfficientNet accurately predicted rankings with a reference image ( <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mn>75.2</mn> <mo>%</mo> <mo>±</mo> <mn>1.3</mn> <mo>%</mo></mrow> </math> and <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow><mn>79.2</mn> <mo>%</mo> <mo>±</mo> <mn>1.7</mn> <mo>%</mo></mrow> </math> ). However, EfficientNet resulted in images with artifacts and high MSE when used in denoising tasks while IQ-Net optimized networks performed well for both denoising and reconstruction tasks. Image quality networks can be trained from image ranking and used to optimize DL tasks. 3 TECHNICAL EFFICACY: Stage 1.

Dual Energy CT for Deep Learning-Based Segmentation and Volumetric Estimation of Early Ischemic Infarcts.

Kamel P, Khalid M, Steger R, Kanhere A, Kulkarni P, Parekh V, Yi PH, Gandhi D, Bodanapally U

pubmed logopapersJun 1 2025
Ischemic changes are not visible on non-contrast head CT until several hours after infarction, though deep convolutional neural networks have shown promise in the detection of subtle imaging findings. This study aims to assess if dual-energy CT (DECT) acquisition can improve early infarct visibility for machine learning. The retrospective dataset consisted of 330 DECTs acquired up to 48 h prior to confirmation of a DWI positive infarct on MRI between 2016 and 2022. Infarct segmentation maps were generated from the MRI and co-registered to the CT to serve as ground truth for segmentation. A self-configuring 3D nnU-Net was trained for segmentation on (1) standard 120 kV mixed-images (2) 190 keV virtual monochromatic images and (3) 120 kV + 190 keV images as dual channel inputs. Algorithm performance was assessed with Dice scores with paired t-tests on a test set. Global aggregate Dice scores were 0.616, 0.645, and 0.665 for standard 120 kV images, 190 keV, and combined channel inputs respectively. Differences in overall Dice scores were statistically significant with highest performance for combined channel inputs (p < 0.01). Small but statistically significant differences were observed for infarcts between 6 and 12 h from last-known-well with higher performance for larger infarcts. Volumetric accuracy trended higher with combined inputs but differences were not statistically significant (p = 0.07). Supplementation of standard head CT images with dual-energy data provides earlier and more accurate segmentation of infarcts for machine learning particularly between 6 and 12 h after last-known-well.

Radiomics-driven spectral profiling of six kidney stone types with monoenergetic CT reconstructions in photon-counting CT.

Hertel A, Froelich MF, Overhoff D, Nestler T, Faby S, Jürgens M, Schmidt B, Vellala A, Hesse A, Nörenberg D, Stoll R, Schmelz H, Schoenberg SO, Waldeck S

pubmed logopapersJun 1 2025
Urolithiasis, a common and painful urological condition, is influenced by factors such as lifestyle, genetics, and medication. Differentiating between different types of kidney stones is crucial for personalized therapy. The purpose of this study is to investigate the use of photon-counting computed tomography (PCCT) in combination with radiomics and machine learning to develop a method for automated and detailed characterization of kidney stones. This approach aims to enhance the accuracy and detail of stone classification beyond what is achievable with conventional computed tomography (CT) and dual-energy CT (DECT). In this ex vivo study, 135 kidney stones were first classified using infrared spectroscopy. All stones were then scanned in a PCCT embedded in a phantom. Various monoenergetic reconstructions were generated, and radiomics features were extracted. Statistical analysis was performed using Random Forest (RF) classifiers for both individual reconstructions and a combined model. The combined model, using radiomics features from all monoenergetic reconstructions, significantly outperformed individual reconstructions and SPP parameters, with an AUC of 0.95 and test accuracy of 0.81 for differentiating all six stone types. Feature importance analysis identified key parameters, including NGTDM_Strength and wavelet-LLH_firstorder_Variance. This ex vivo study demonstrates that radiomics-driven PCCT analysis can improve differentiation between kidney stone subtypes. The combined model outperformed individual monoenergetic levels, highlighting the potential of spectral profiling in PCCT to optimize treatment through image-based strategies. Question How can photon-counting computed tomography (PCCT) combined with radiomics improve the differentiation of kidney stone types beyond conventional CT and dual-energy CT, enhancing personalized therapy? Findings Our ex vivo study demonstrates that a combined spectral-driven radiomics model achieved 95% AUC and 81% test accuracy in differentiating six kidney stone types. Clinical relevance Implementing PCCT-based spectral-driven radiomics allows for precise non-invasive differentiation of kidney stone types, leading to improved diagnostic accuracy and more personalized, effective treatment strategies, potentially reducing the need for invasive procedures and recurrence.

Age-dependent changes in CT vertebral attenuation values in opportunistic screening for osteoporosis: a nationwide multi-center study.

Kim Y, Kim HY, Lee S, Hong S, Lee JW

pubmed logopapersJun 1 2025
To examine how vertebral attenuation changes with aging, and to establish age-adjusted CT attenuation value cutoffs for diagnosing osteoporosis. This multi-center retrospective study included 11,246 patients (mean age ± standard deviation, 50 ± 13 years; 7139 men) who underwent CT and dual-energy X-ray absorptiometry (DXA) in six health-screening centers between 2022 and 2023. Using deep-learning-based software, attenuation values of L1 vertebral bodies were measured. Segmented linear regression in women and simple linear regression in men were used to assess how attenuation values change with aging. A multivariable linear regression analysis was performed to determine whether age is associated with CT attenuation values independently of the DXA T-score. Age-adjusted cutoffs targeting either 90% sensitivity or 90% specificity were derived using quantile regression. Performance of both age-adjusted and age-unadjusted cutoffs was measured, where the target sensitivity or specificity was considered achieved if a 95% confidence interval encompassed 90%. While attenuation values declined consistently with age in men, they declined abruptly in women aged > 42 years. Such decline occurred independently of the DXA T-score (p < 0.001). Age adjustment seemed critical for age ≥ 65 years, where the age-adjusted cutoffs achieved the target (sensitivity of 91.5% (86.3-95.2%) when targeting 90% sensitivity and specificity of 90.0% (88.3-91.6%) when targeting 90% specificity), but age-unadjusted cutoffs did not (95.5% (91.2-98.0%) and 73.8% (71.4-76.1%), respectively). Age-adjusted cutoffs provided a more reliable diagnosis of osteoporosis than age-unadjusted cutoffs since vertebral attenuation values decrease with age, regardless of DXA T-scores. Question How does vertebral CT attenuation change with age? Findings Independent of dual-energy X-ray absorptiometry T-score, vertebral attenuation values on CT declined at a constant rate in men and abruptly in women over 42 years of age. Clinical relevance Age adjustments are needed in opportunistic osteoporosis screening, especially among the elderly.

The role of deep learning in diagnostic imaging of spondyloarthropathies: a systematic review.

Omar M, Watad A, McGonagle D, Soffer S, Glicksberg BS, Nadkarni GN, Klang E

pubmed logopapersJun 1 2025
Diagnostic imaging is an integral part of identifying spondyloarthropathies (SpA), yet the interpretation of these images can be challenging. This review evaluated the use of deep learning models to enhance the diagnostic accuracy of SpA imaging. Following PRISMA guidelines, we systematically searched major databases up to February 2024, focusing on studies that applied deep learning to SpA imaging. Performance metrics, model types, and diagnostic tasks were extracted and analyzed. Study quality was assessed using QUADAS-2. We analyzed 21 studies employing deep learning in SpA imaging diagnosis across MRI, CT, and X-ray modalities. These models, particularly advanced CNNs and U-Nets, demonstrated high accuracy in diagnosing SpA, differentiating arthritis forms, and assessing disease progression. Performance metrics frequently surpassed traditional methods, with some models achieving AUCs up to 0.98 and matching expert radiologist performance. This systematic review underscores the effectiveness of deep learning in SpA imaging diagnostics across MRI, CT, and X-ray modalities. The studies reviewed demonstrated high diagnostic accuracy. However, the presence of small sample sizes in some studies highlights the need for more extensive datasets and further prospective and external validation to enhance the generalizability of these AI models. Question How can deep learning models improve diagnostic accuracy in imaging for spondyloarthropathies (SpA), addressing challenges in early detection and differentiation from other forms of arthritis? Findings Deep learning models, especially CNNs and U-Nets, showed high accuracy in SpA imaging across MRI, CT, and X-ray, often matching or surpassing expert radiologists. Clinical relevance Deep learning models can enhance diagnostic precision in SpA imaging, potentially reducing diagnostic delays and improving treatment decisions, but further validation on larger datasets is required for clinical integration.

Comparing fully automated AI body composition biomarkers at differing virtual monoenergetic levels using dual-energy CT.

Toia GV, Garret JW, Rose SD, Szczykutowicz TP, Pickhardt PJ

pubmed logopapersJun 1 2025
To investigate the behavior of artificial intelligence (AI) CT-based body composition biomarkers at different virtual monoenergetic imaging (VMI) levels using dual-energy CT (DECT). This retrospective study included 88 contrast-enhanced abdominopelvic CTs acquired with rapid-kVp switching DECT. Images were reconstructed into five VMI levels (40, 55, 70, 85, 100 keV). Fully automated algorithms for quantifying CT number (HU) in abdominal fat (subcutaneous and visceral), skeletal muscle, bone, calcium (abdominal Agatston score), and organ size (area or volume) were applied. Biomarker median difference relative to 70 keV and interquartile range were reported by energy level to characterize variation. Linear regression was performed to calibrate non-70 keV data and to estimate their equivalent 70 keV biomarker attenuation values. Relative to 70 keV, absolute median differences in attenuation-based biomarkers (excluding Agatston score) ranged 39-358, 12-102, 5-48, 9-75 HU for 40, 55, 85, 100 keV, respectively. For area-based biomarkers, differences ranged 6-15, 3-4, 2-7, 0-5 cm<sup>2</sup> for 40, 55, 85, 100 keV. For volume-based biomarkers, differences ranged 12-34, 8-68, 12-52, 1-57 cm<sup>3</sup> for 40, 55, 85, 100 keV. Agatston score behavior was more spurious with median differences ranging 70-204 HU. In general, VMI < 70 keV showed more variation in median biomarker measurement than VMI > 70 keV. This study characterized the behavior of a fully automated AI CT biomarker toolkit across varying VMI levels obtained with DECT. The data showed relatively little biomarker value change when measured at or greater than 70 keV. Lower VMI datasets should be avoided due to larger deviations in measured value as compared to 70 keV, a level considered equivalent to conventional 120 kVp exams.

Automatic 3-dimensional analysis of posterosuperior full-thickness rotator cuff tear size on magnetic resonance imaging.

Hess H, Gussarow P, Rojas JT, Zumstein MA, Gerber K

pubmed logopapersJun 1 2025
Tear size and shape are known to prognosticate the efficacy of surgical rotator cuff (RC) repair; however, current manual measurements on magnetic resonance images (MRIs) exhibit high interobserver variabilities and exclude 3-dimensional (3D) morphologic information. This study aimed to develop algorithms for automatic 3D analyses of posterosuperior full-thickness RC tear to enable efficient and precise tear evaluation and 3D tear visualization. A deep-learning network for automatic segmentation of the tear region in coronal and sagittal multicenter MRI was trained with manually segmented (consensus of 3 experts) proton density- and T2-weighted MRI of shoulders with full-thickness posterosuperior tears (n = 200). Algorithms for automatic measurement of tendon retraction, tear width, tear area, and automatic Patte classification considering the 3D morphology of the shoulder were implemented and evaluated against manual segmentation (n = 59). Automatic Patte classification was calculated using automatic segmented humerus and scapula on T1-weighted MRI of the same shoulders. Tears were automatically segmented, enabling 3D visualization of the tear, with a mean Dice coefficient of 0.58 ± 0.21 compared to an interobserver variability of 0.46 ± 0.21. The mean absolute error of automatic tendon retraction and tear width measurements (4.98 ± 4.49 mm and 3.88 ± 3.18 mm) were lower than the interobserver variabilities (5.42 ± 7.09 mm and 5.92 ± 1.02 mm). The correlations of all measurements performed on automatic tear segmentations compared with those on consensus segmentations were higher than the interobserver correlation. Automatic Patte classification achieved a Cohen kappa value of 0.62, compared with the interobserver variability of 0.56. Retraction calculated using standard linear measures underestimated the tear size relative to measurements considering the curved shape of the humeral head, especially for larger tears. Even on highly heterogeneous data, the proposed algorithms showed the feasibility to successfully automate tear size analysis and to enable automatic 3D visualization of the tear situation. The presented algorithms standardize cross-center tear analyses and enable the calculation of additional metrics, potentially improving the predictive power of image-based tear measurements for the outcome of surgical treatments, thus aiding in RC tear diagnosis, treatment decision, and planning.

MRI-based radiomic nomogram for predicting disease-free survival in patients with locally advanced rectal cancer.

Liu J, Liu K, Cao F, Hu P, Bi F, Liu S, Jian L, Zhou J, Nie S, Lu Q, Yu X, Wen L

pubmed logopapersJun 1 2025
Individual prognosis assessment is of paramount importance for treatment decision-making and active surveillance in cancer patients. We aimed to propose a radiomic model based on pre- and post-therapy MRI features for predicting disease-free survival (DFS) in locally advanced rectal cancer (LARC) following neoadjuvant chemoradiotherapy (nCRT) and subsequent surgical resection. This retrospective study included a total of 126 LARC patients, which were randomly assigned to a training set (n = 84) and a validation set (n = 42). All patients underwent pre- and post-nCRT MRI scans. Radiomic features were extracted from higher resolution T2-weighted images. Pearson correlation analysis and ANOVA or Relief were utilized for identifying radiomic features associated with DFS. Pre-treatment, post-treatment, and delta radscores were constructed by machine learning algorithms. An individualized nomogram was developed based on significant radscores and clinical variables using multivariate Cox regression analysis. Predictive performance was evaluated by the C-index, calibration curve, and decision curve analysis. The results demonstrated that in the validation set, the clinical model including pre-surgery carcinoembryonic antigen (CEA), chemotherapy after radiotherapy, and pathological stage yielded a C-index of 0.755 (95% confidence interval [CI]: 0.739-0.771). While the optimal pre-, post-, and delta-radscores achieved C-indices of 0.724 (95%CI: 0.701-0.747), 0.701 (95%CI: 0.671-0.731), and 0.625 (95%CI: 0.589-0.661), respectively. The nomogram integrating pre-surgery CEA, pathological stage, alongside pre- and post-nCRT radscore, obtained the highest C-index of 0.833 (95%CI: 0.815-0.851). The calibration curve and decision curves exhibited good calibration and clinical usefulness of the nomogram. Furthermore, the nomogram categorized patients into high- and low-risk groups exhibiting distinct DFS (both P < 0.0001). The nomogram incorporating pre- and post-therapy radscores and clinical factors could predict DFS in patients with LARC, which helps clinicians in optimizing decision-making and surveillance in real-world settings.
Page 114 of 1521519 results
Show
per page

Ready to Sharpen Your Edge?

Join hundreds of your peers who rely on RadAI Slice. Get the essential weekly briefing that empowers you to navigate the future of radiology.

We respect your privacy. Unsubscribe at any time.