QV finder: an accurate Quran verse finder system

Bashar Al-Rfooh, Mohammad Abdel-Majeed, Shorouq Al-Awawdeh, Khaled A. Drabkeh

Abstract


A voice-based search is becoming increasingly important for accessing information across various domains. One of the most challenging areas is Quranic verse search, where precise recitation rules (Tajweed), dialectal variations, and background noise affect accuracy. In this work, we present QV finder, an artificial intelligence (AI)-powered system that utilizes a finetuned whisper-based automatic speech recognition (ASR) model specifically trained on diverse Quranic recitations for the whole Quran. In this paper, we present a robust pipeline for Quranic verse retrieval that bridges the gap between ASR technology and domain-specific linguistic complexity. The model supports both professional and normal reciters, even under noisy conditions. To enhance the localization of verses from partial recitations, we integrated tokenization and advanced string-matching algorithms such as Levenshtein distance and FuzzyWuzzy. For normal reciters, the proposed model achieves a word error rate (WER) of 10.1% and character error rate (CER) of 3.3%, outperforming Google ASR, which exhibits a WER of 27.04%, and a CER of 7.13%. The model also achieves 100% verse retrieval accuracy with a 2.5% false positive rate. Our best fine-tuned model is uploaded here: https://huggingface.co/basharalrfooh/whisper-small-quran.

Keywords


Arabic speech recognition; Automatic speech recognition; Speech recognition; String matching algorithms; Whisper

Full Text:

PDF


DOI: http://doi.org/10.11591/ijeecs.v40.i3.pp1486-1499

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Indonesian Journal of Electrical Engineering and Computer Science (IJEECS)
p-ISSN: 2502-4752, e-ISSN: 2502-4760
This journal is published by the Institute of Advanced Engineering and Science (IAES).

shopify stats IJEECS visitor statistics