Background and Objective: High resolution computed tomography (HRCT) scan diagnostic classification for usual interstitial pneumonia (UIP) plays a critical role in therapeutic decision-making and clinical trial eligibility for interstitial lung disease (ILD) patients, but is subject to variability. A deep learning algorithm, the Systematic Objective Fibrotic Imaging Analysis Algorithm (SOFIA), has been validated to assist classification of HRCTs based on current guidelines. In this study, we evaluate the impact of SOFIA on inter-observer agreement for UIP classification and prognostic accuracy of clinicians' assessment of ILD HRCTs. Methods: Radiologists and pulmonologists (reviewers) were invited to evaluate 203 HRCTs from a national fibrotic ILD registry, scoring each of four UIP categories (definite UIP, probable UIP, indeterminate, or alternative diagnosis). SOFIA outputs were then provided, and reviewers were able to reevaluate their scores. Changes in interobserver agreement for UIP classification and prognostic accuracy were calculated. Results: Three hundred twelve reviewers (120 radiologists, 192 pulmonologists) from 49 countries evaluated 203 HRCT scans. Following SOFIA, inter-observer diagnostic agreement improved for definite UIP from moderate to good (ICCpre = 0.54[0.50–0.60]; ICCpost = 0.70[0.66–0.74]), and for probable UIP from fair to moderate (ICCpre = 0.30[0.27–0.35]; ICCpost = 0.53[0.49–0.58]). Following SOFIA, there was improved prognostic accuracy for reviewers' definite UIP, probable UIP, and indeterminate scores (significant change in c-index), and the proportion of reviewers whose probable UIP scores were significantly predictive of transplant-free survival increased by 42%. Conclusion: Providing SOFIA algorithm output to clinicians reviewing HRCT scans improved diagnostic agreement and prognostic accuracy for fibrotic ILD. SOFIA may be a useful automated assistive tool to support improved diagnostic consistency.

Deep-Learning Algorithm Diagnostic Support for Usual Interstitial Pneumonia Pattern Recognition in Fibrotic Interstitial Lung Disease / Fermoyle, C.C., Mackintosh, J.A., Navaratnam, V., Ellis, S.J., Cooper, W.A., Goh, N.S.L., Moodley, Y., Reynolds, P.N., Zappala, C.J., Hopkins, P., Glaspole, I.N., Corte, T.J., Walsh, S.L.F., Accornero, D., Agrawal, A., Akilli, I.K., Alamoudi, O., Alberti, M.L., Aliannejad, R., Aljahdali, H., et al.. - In: RESPIROLOGY. - ISSN 1323-7799. - (2026). [10.1002/resp.70246]

Deep-Learning Algorithm Diagnostic Support for Usual Interstitial Pneumonia Pattern Recognition in Fibrotic Interstitial Lung Disease

Balbi M.;Bocchino M.;Costantini P.;D'Andrea G.;Khan M. A.;Ledda R. E.;Marrocchio C.;Mukherjee S.;Novelli L.;Popa D.;Sebastiani A.;Toma C. L.;Zompatori M.
2026-01-01

Abstract

Background and Objective: High resolution computed tomography (HRCT) scan diagnostic classification for usual interstitial pneumonia (UIP) plays a critical role in therapeutic decision-making and clinical trial eligibility for interstitial lung disease (ILD) patients, but is subject to variability. A deep learning algorithm, the Systematic Objective Fibrotic Imaging Analysis Algorithm (SOFIA), has been validated to assist classification of HRCTs based on current guidelines. In this study, we evaluate the impact of SOFIA on inter-observer agreement for UIP classification and prognostic accuracy of clinicians' assessment of ILD HRCTs. Methods: Radiologists and pulmonologists (reviewers) were invited to evaluate 203 HRCTs from a national fibrotic ILD registry, scoring each of four UIP categories (definite UIP, probable UIP, indeterminate, or alternative diagnosis). SOFIA outputs were then provided, and reviewers were able to reevaluate their scores. Changes in interobserver agreement for UIP classification and prognostic accuracy were calculated. Results: Three hundred twelve reviewers (120 radiologists, 192 pulmonologists) from 49 countries evaluated 203 HRCT scans. Following SOFIA, inter-observer diagnostic agreement improved for definite UIP from moderate to good (ICCpre = 0.54[0.50–0.60]; ICCpost = 0.70[0.66–0.74]), and for probable UIP from fair to moderate (ICCpre = 0.30[0.27–0.35]; ICCpost = 0.53[0.49–0.58]). Following SOFIA, there was improved prognostic accuracy for reviewers' definite UIP, probable UIP, and indeterminate scores (significant change in c-index), and the proportion of reviewers whose probable UIP scores were significantly predictive of transplant-free survival increased by 42%. Conclusion: Providing SOFIA algorithm output to clinicians reviewing HRCT scans improved diagnostic agreement and prognostic accuracy for fibrotic ILD. SOFIA may be a useful automated assistive tool to support improved diagnostic consistency.
2026
Deep-Learning Algorithm Diagnostic Support for Usual Interstitial Pneumonia Pattern Recognition in Fibrotic Interstitial Lung Disease / Fermoyle, C.C., Mackintosh, J.A., Navaratnam, V., Ellis, S.J., Cooper, W.A., Goh, N.S.L., Moodley, Y., Reynolds, P.N., Zappala, C.J., Hopkins, P., Glaspole, I.N., Corte, T.J., Walsh, S.L.F., Accornero, D., Agrawal, A., Akilli, I.K., Alamoudi, O., Alberti, M.L., Aliannejad, R., Aljahdali, H., et al.. - In: RESPIROLOGY. - ISSN 1323-7799. - (2026). [10.1002/resp.70246]
File in questo prodotto:
File Dimensione Formato  
Respirology .pdf

accesso aperto

Tipologia: Versione (PDF) editoriale
Licenza: Creative commons
Dimensione 3.33 MB
Formato Adobe PDF
3.33 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11381/3065221
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact