Improving Kui digit recognition through machine learning and data augmentation techniques

Subrat Kumar Nayak, Ajit Kumar Nayak, Smitaprava Mishra, Prithviraj Mohanty, Nrusingha Tripathy, Sashikanta Prusty

Abstract


Speech digit recognition research is growing decisively, and a bulk of digit recognition algorithms are used in European and a few Asian languages. Kui is a low-resourced tribal language locally used in several states of India. Despite its significance, there is not much research on Kui's speech. This research aims to present an in-depth analysis of novel Kui digit recognition using predefined machine learning (ML) techniques. For this purpose, we first gathered spoken numbers i.e. from 0 to 9 of eight different speakers containing a total of 200 words. Secondly, we choose the numbers: ଶୂନ (zero), ଏକ (one), ଦୁଇ (two), ତିନି(three), ସାରି(four), ପାସ (five), ସଅ (six), ସାତ (seven), ଆଟ (eight), ନଅ (nine). Meanwhile, we build nine different ML models to recognize Kui digits that take the Mel-frequency cepstral coefficients (MFCCs) method to extract the relevant features for model predictions. Finally, we compared the performance of ML models for both augmented and non-augmented Kui data. The result shows that the SVM+Augmentation method for Kui digit recognition combined obtained the highest accuracy of 83% than other methods. Moreover, the difficulties and potential prospects for Kui digit recognition are also highlighted in this work.

Keywords


Data augmentation; Kui dataset; Low resource language; Mel-frequency cepstral coefficients; Speech recognition

Full Text:

PDF


DOI: http://doi.org/10.11591/ijeecs.v35.i2.pp867-877

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

The Indonesian Journal of Electrical Engineering and Computer Science (IJEECS)
p-ISSN: 2502-4752, e-ISSN: 2502-4760
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).

shopify stats IJEECS visitor statistics