Hydrophobicity signal analysis for robust SARS-CoV-2 classification
Abstract
Rapid and accurate classification of viral pathogens is critical for effective public health interventions. This study introduces a novel approach using convolutional neural networks (CNN) to classify SARS-CoV-2 and non-SARS-CoV-2 viruses via hydrophobicity signal derived from DNA sequences. Conventional machine learning methods grapple with the variability of viral genetic material, requiring fixed-length sequences and extensive preprocessing. The proposed method transforms genetic sequences into image-based representations, enabling CNNs to handle complexity and variability without these constraints. The dataset includes 8,143 DNA sequences from seven coronaviruses, translated into amino acid sequences and evaluated for hydrophobicity. Experimental results demonstrate that the CNN model achieves superior performance, with an accuracy of over 99.84% in the classification task. The model also performs well with extended sequence lengths, showcasing robustness and adaptability. Compared to previous studies, this method offers higher accuracy and computational efficiency, providing a reliable solution for rapid virus detection with potential applications in bioinformatics and clinical settings.
Keywords
Convolutional neural networks; Genetic sequencing; Kyte-Doolittle scale; Machine learning; Virus identification
Full Text:
PDFDOI: http://doi.org/10.11591/ijeecs.v37.i2.pp1294-1305
Refbacks
- There are currently no refbacks.
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Indonesian Journal of Electrical Engineering and Computer Science (IJEECS)
p-ISSN: 2502-4752, e-ISSN: 2502-4760
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).