Efficient lung disease detection using a hybrid vision transformer and YOLO framework with transfer learning

Kashaf Khan, Abdul Aleem

Abstract


Lung diseasesĀ are among the most important causes of morbidity and mortality worldwide; it require prompt and accurate diagnosis methods. A novel hybrid deep learning framework for integrating you only look once version 8 (YOLOv8), considering real-time detection and vision transformer (ViT-B/16) for global context-based classification of lung diseases in chest X-ray images, is presented. Based on transfer learning and a two-stage detection-classification pipeline, this proposed model is applicable to dealing with inter-image variability, overlapped disease features and lack of annotated medical examples. Our developed hybrid model achieves the highest classification accuracy of 96.8% and 0.98 AUC-ROC on the National Institutes of Health (NIH) Chest X-ray dataset, which consists of over 112,000 images covering 14 diseases, and outperforms its several current state-of-the-art models. In addition, attention heatmaps and bounding box visualizations highly correlate with clinical variables and enhance interpretability. This paper demonstrates the practicability of hybrid vision driven architectures for better medical image analysis and shows their integration into clinical decision-support systems.


Keywords


Chest X-ray analysis; Deep learning; Hybrid model; Lung disease detection; Transfer learning; Vision transformer; YOLO framework;

Full Text:

PDF


DOI: http://doi.org/10.11591/ijeecs.v40.i2.pp1141-1148

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Indonesian Journal of Electrical Engineering and Computer Science (IJEECS)
p-ISSN: 2502-4752, e-ISSN: 2502-4760
This journal is published by the Institute of Advanced Engineering and Science (IAES).

shopify stats IJEECS visitor statistics