Robust spoken word detection in assamese language using BiLSTM with data augmentation for noisy environments

Deepjyoti Kalita, Khurshid Alam Borbora

Abstract


This study focuses on enhancing spoken word detection in the Assamese language using bidirectional long shor term memory (BiLSTM). The primary objective is to improve the model’s robustness in noisy environments by using various data augmentation methods. The research addresses the challenges of keyword detection in low-resource languages like Assamese. A BiLSTM model was trained and tested using a speech corpus sourced from the Indian Language Technology Proliferation and Development Center (ILTP-DC), comprising 32,335 utterances from 1,000 speakers and 262 unique Assamese words. The model was trained on 10 specific keywords. Feature extraction was conducted using 39 coefficients, including MFCC, ΔMFCC, and ΔΔMFCC. The model’s performance was evaluated on clean and augmented noisy datasets. The application of data augmentation techniques significantly improved the model’s performance in noisy environments. This model achieved an average accuracy of 98.01% and a word error rate (WER) of 19.94% on noisy data, showcasing the effectiveness of augmentation in enhancing keyword detection. This work introduces a novel approach to Assamese spoken word detection by integrating BiLSTM with data augmentation techniques, making the model more noise-resilient. This study sets a benchmark for Assamese speech recognition and showcases augmentation techniques’ effectiveness in low-resource languages.

Keywords


BiLSTM; Data augmentation; Deep learning; Keyword detection; Machine learning; MFCC; WER

Full Text:

PDF


DOI: http://doi.org/10.11591/ijeecs.v40.i1.pp263-270

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Indonesian Journal of Electrical Engineering and Computer Science (IJEECS)
p-ISSN: 2502-4752, e-ISSN: 2502-4760
This journal is published by the Institute of Advanced Engineering and Science (IAES).

shopify stats IJEECS visitor statistics