Abstract :
Lung diseases remain a significant global health concern, necessitating the development of rapid and accurate diagnostic methods. While previous research has shown the promise of deep learning models, particularly transfer learning with architectures such as ResNet and VGG, limitations persist in evaluation scope, class imbalance handling, and model interpretability. This study proposes an enhanced deep learning framework for multi-label classification of thoracic diseases using chest X-ray images, addressing these gaps through comprehensive evaluation metrics, advanced data augmentation, and explainable AI (XAI) techniques. The NIH ChestX-ray14 dataset is utilized, with class imbalance mitigated via synthetic minority oversampling and weighted focal loss. Multiple state-of-the-art CNN architectures, including EfficientNet and ResNet variants, are benchmarked using precision, recall, F1 Score, AUC, and accuracy. Moreover, Gradient-weighted Class Activation Mapping (Grad-CAM) is integrated to visualize pathological regions, improving clinical interpretability. The offered framework can perform better in all assessment criteria, achieving an AUC of 0.91 with EfficientNet-B0, and provides interpretable outputs critical for deployment in real-world diagnostic settings. This work advances automated radiological diagnosis by addressing key methodological shortcomings and offers a reliable, explainable solution for lung disease detection.
Keywords :
Chest X-ray (CXR), Deep learning, EfficientNet, Grad- CAM, Lung Disease Classification, NIH ChestX-ray14, ResNet, Transfer learningReferences :
- V. Cottin et al., “Syndrome of Combined Pulmonary Fibrosis and Emphysema: An Official ATS/ERS/JRS/ALAT Research Statement,” Am J Respir Crit Care Med, vol. 206, no. 4, pp. e7–e41, Aug. 2022, doi: 10.1164/rccm.2022061041ST.
- C. F. Vogelmeier, M. Román-Rodríguez, D. Singh, M. K. Han, R. Rodríguez-Roisin, and G. T. Ferguson, “Goals of COPD treatment: Focus on symptoms and exacerbations,” Respir Med, vol. 166, p. 105938, May 2020, doi: 10.1016/j.rmed.2020.105938.
- V. E. Georgakopoulou, D. A. Spandidos, and A. Corlateanu, “Diagnostic tools in respiratory medicine,” Biomed Rep, vol. 23, no. 1, p. 112, 2025.
- J. C. Y. Seah et al., “Effect of a comprehensive deep-learning model on the accuracy of chest x-ray interpretation by radiologists: a retrospective, multireader multicase study,” Lancet Digit Health, vol. 3, no. 8, pp. e496–e506, Aug. 2021, doi: 10.1016/S2589-7500(21)00106-0.
- Md. R. Hasan, S. M. Azmat Ullah, and S. Md. Rabiul Islam, “Recent advancement of deep learning techniques for pneumonia prediction from chest X-ray image,” Medical Reports, vol. 7, p. 100106, Oct. 2024, doi: 10.1016/j.hmedic.2024.100106.
- A. Ait Nasser and M. A. Akhloufi, “A Review of Recent Advances in Deep Learning Models for Chest Disease Detection Using Radiography,” Diagnostics, vol. 13, no. 1, p. 159, Jan. 2023, doi: 10.3390/diagnostics13010159.
- A. Ebbehoj, M. Ø. Thunbo, O. E. Andersen, M. V. Glindtvad, and A. Hulman, “Transfer learning for non-image data in clinical research: A scoping review,” PLOS Digital Health, vol. 1, no. 2, p. e0000014, Feb. 2022, doi: 10.1371/journal.pdig.0000014.
- W. Xu, Y.-L. Fu, and D. Zhu, “ResNet and its application to medical image processing: Research progress and challenges,” Comput Methods Programs Biomed, vol. 240, p. 107660, Oct. 2023, doi: 10.1016/j.cmpb.2023.107660.
- W. Al-Khater and S. Al-Madeed, “Using 3D-VGG-16 and 3D-Resnet-18 deep learning models and FABEMD techniques in the detection of malware,” Alexandria Engineering Journal, vol. 89, pp. 39–52, Feb. 2024, doi: 10.1016/j.aej.2023.12.061.
- B. Koonce, “EfficientNet,” in Convolutional neural networks with swift for Tensorflow: image recognition and dataset categorization, Springer, 2021, pp. 109–123.
- K. S, R. S. Shudapreyaa, P. Prakash, V. S, V. V, and Y. S, “Classification of Lung Diseases Using Transfer Learning with Chest X-Ray Images,” in 2024 Second International Conference on Emerging Trends in Information Technology and Engineering (ICETITE), IEEE, Feb. 2024, pp. 1–6. doi: 10.1109/ic-ETITE58242.2024.10493367.
- T. Saito and M. Rehmsmeier, “The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets,” PLoS One, vol. 10, no. 3, p. e0118432, Mar. 2015, doi: 10.1371/journal.pone.0118432.
- R. R. Selvaraju, M. Cogswell, A. Das, R. Vedantam, D. Parikh, and D. Batra, “Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization,” in 2017 IEEE International Conference on Computer Vision (ICCV), 2017, pp. 618–626. doi: 10.1109/ICCV.2017.74.
- R. Rabee, M. S. Jarjees, M. R. Aziz, and A. Asim Hameed, “Machine learning Techniques for Spondylolisthesis Diagnosis: a review,” NTU Journal of Engineering and Technology (NTU-JET), vol. 3, no. 2, Jul. 2024, doi: 10.56286/ntujet.v3i2.768.
- X. Wang, Y. Peng, L. Lu, Z. Lu, M. Bagheri, and R. M. Summers, “ChestX-Ray8: Hospital-Scale Chest X-Ray Database and Benchmarks on Weakly-Supervised Classification and Localization of Common Thorax Diseases,” in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017, pp. 3462–3471. doi: 10.1109/CVPR.2017.369.
- T. Saito and M. Rehmsmeier, “The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets,” PLoS One, vol. 10, no. 3, p. e0118432, Mar. 2015, doi: 10.1371/journal.pone.0118432.
- A. Kabiraj, T. Meena, P. B. Reddy, and S. Roy, “Detection and Classification of Lung Disease Using Deep Learning Architecture from X-ray Images,” 2022, pp. 444–455. doi: 10.1007/978-3-031-20713-6_34.
- A. Souid, N. Sakli, and H. Sakli, “Classification and Predictions of Lung Diseases from Chest X-rays Using MobileNet V2,” Applied Sciences, vol. 11, no. 6, 2021, doi: 10.3390/app11062751.
- A. Holzinger, G. Langs, H. Denk, K. Zatloukal, and H. Müller, “Causability and explainability of artificial intelligence in medicine,” WIREs Data Mining and Knowledge Discovery, vol. 9, no. 4, Jul. 2019, doi: 10.1002/widm.1312.
- M. Ghazal, “Parkinson’s Disease Detection Based on Transfer Learning,” NTU Journal of Engineering and Technology (NTU-JET), vol. 3, no. 3, Sep. 2024, doi: 10.56286/ntujet.v3i3.1173.
- X. Wang, Y. Peng, L. Lu, Z. Lu, M. Bagheri, and R. M. Summers, “ChestX-Ray8: Hospital-Scale Chest X-Ray Database and Benchmarks on Weakly-Supervised Classification and Localization of Common Thorax Diseases,” in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, Jul. 2017, pp. 3462–3471. doi: 10.1109/CVPR.2017.369.
- N. V. Chawla, K. W. Bowyer, L. O. Hall, and W. P. Kegelmeyer, “SMOTE: Synthetic Minority Over-sampling Technique,” Journal of Artificial Intelligence Research, vol. 16, pp. 321–357, Jun. 2002, doi: 10.1613/jair.953.
- T.-Y. Lin, P. Goyal, R. Girshick, K. He, and P. Dollar, “Focal Loss for Dense Object Detection,” in 2017 IEEE International Conference on Computer Vision (ICCV), IEEE, Oct. 2017, pp. 2999–3007. doi: 10.1109/ICCV.2017.324.
- K. He, X. Zhang, S. Ren, and J. Sun, “Deep Residual Learning for Image Recognition,” 2015. [Online]. Available: https://arxiv.org/abs/1512.03385
- M. Tan and Q. V Le, “EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks,” 2020. [Online]. Available: https://arxiv.org/abs/1905.11946

