Navigation auf zora.uzh.ch

Search ZORA

ZORA (Zurich Open Repository and Archive)

External validation of the deep learning system "SpineNet" for grading radiological features of degeneration on MRIs of the lumbar spine

Grob, Alexandra; Loibl, Markus; Jamaludin, Amir; Winklhofer, Sebastian; Fairbank, Jeremy C T; Fekete, Tamás; Porchet, François; Mannion, Anne F (2022). External validation of the deep learning system "SpineNet" for grading radiological features of degeneration on MRIs of the lumbar spine. European Spine Journal, 31(8):2137-2148.

Abstract

BACKGROUND: Magnetic resonance imaging (MRI) is used to detect degenerative changes of the lumbar spine. SpineNet (SN), a computer vision-based system, performs an automated analysis of degenerative features in MRI scans aiming to provide high accuracy, consistency and objectivity. This study evaluated SN's ratings compared with those of an expert radiologist.
METHOD: MRIs of 882 patients (mean age, 72 ± 8.8 years) with degenerative spinal disorders from two previous trials carried out in our spine center between 2011 and 2019, were analyzed by an expert radiologist. Lumbar segments (L1/2-L5/S1) were graded for Pfirrmann Grades (PG), Spondylolisthesis (SL) and Central Canal Stenosis (CCS). SN's analysis for the equivalent parameters was generated. Agreement between methods was analyzed using kappa (κ), Spearman correlation (ρ) and Lin's concordance correlation (ρ$_{c}$) coefficients and class average accuracy (CAA).
RESULTS: 4410 lumbar segments were analyzed. κ statistics showed moderate to substantial agreement in PG between the radiologist and SN depending on spinal level (range κ 0.63-0.77, all levels together 0.72; range CAA 45-68%, all levels 55%), slight to substantial agreement for SL (range κ 0.07-0.60, all levels 0.63; range CAA 47-57%, all levels 56%) and CCS (range κ 0.17-0.57, all levels 0.60; range CAA 35-41%, all levels 43%). SN tended to record more pathological features in PG than did the radiologist whereas the contrary was the case for CCS. SL showed an even distribution between methods.
CONCLUSION: SN is a robust and reliable tool with the ability to grade degenerative features such as PG, SL or CCS in lumbar MRIs with moderate to substantial agreement compared to the current gold-standard, the radiologist. It is a valuable alternative for analyzing MRIs from large cohorts for diagnostic and research purposes.

Additional indexing

Item Type:Journal Article, refereed, original work
Communities & Collections:04 Faculty of Medicine > University Hospital Zurich > Clinic for Neuroradiology
04 Faculty of Medicine > University Hospital Zurich > Clinic for Neurosurgery
Dewey Decimal Classification:610 Medicine & health
Scopus Subject Areas:Health Sciences > Surgery
Health Sciences > Orthopedics and Sports Medicine
Uncontrolled Keywords:Degenerative spinal disorders, Diagnostic imaging, Inter-rater agreement, Automated software, Machine learning, Disc degeneration
Language:English
Date:August 2022
Deposited On:03 Aug 2022 14:50
Last Modified:24 Feb 2025 02:42
Publisher:Springer
ISSN:0940-6719
OA Status:Closed
Publisher DOI:https://doi.org/10.1007/s00586-022-07311-x
PubMed ID:35835892

Metadata Export

Statistics

Citations

Dimensions.ai Metrics
18 citations in Web of Science®
14 citations in Scopus®
Google Scholar™

Altmetrics

Downloads

0 downloads since deposited on 03 Aug 2022
0 downloads since 12 months

Authors, Affiliations, Collaborations

Similar Publications