Raga classification using enhanced spatial bound whale optimization algorithm

Bettadamadahally Shivakumaraswamy Gowrishankar, Nagappa U. Bhajantri


A raga is a unique set of notes with certain rules that carefully followed, retain and protect its purity and produce amazing musical effects. An automated raga transcription and identification is important for computational musicology, which is an important step for musicology for indexing, classifying, and recommending tunes. In the present research, the audio features such as mel frequency cepstrum coefficients (MFCCs), spectral flux, short time energy, audio feature extractor, and spectral centroid features are used for the prediction of a raga. The model showed more complexity which means it required lots of training data. The proposed enhanced spatial bound whale optimization algorithm (ESBWOA) is used that overcome the feature selection problem of high dimensional features. In addition to this, a weighted salp swarm algorithm (SSA) is used for selecting the tone-based features from the ragas based on amplitude or each raga sample. The features were fed for bidirectional long short-term memory (Bi-LSTM) network, which enhanced the success rate for raga identification and classification. The present research uses CompMusic dataset in the research work where 9 classes for Carnatic music and 7 classes in Hindustani music are considered for the classification of ragas.


Audio feature extractor; Enhanced spatial bound whale optimization algorithm; Mel frequency cepstrum coefficients; Short time energy; Spectral centroid features spectral flux

Full Text:


DOI: http://doi.org/10.11591/ijeecs.v30.i2.pp825-837


  • There are currently no refbacks.

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

shopify stats IJEECS visitor statistics