Development of natural language processing on morphologybased Minangkabau language stemming algorithm

Rini Sovia, Sarjon Defit, Yuhandri Yuhandri, Sulastri Sulastri

Abstract


Minangkabau language (ML) is one of the daily communication tools used by the people of West Sumatra, Indonesia. ML is a challenge in communicating. The ML language translation process is necessary to facilitate communication. This study aims to build a translation system for ML into Indonesian by developing the concept of natural language processing (NLP). NLP development adopts the performance of morphology-based Minangkabau language stemming algorithm (MLSA) which can separate basic words with affixes and endings. The research dataset adopts 600 basic ML words sourced from the big Minangkabau dictionary. The results of this study provide analytic output that can translate ML into Indonesian well. These results are presented based on the testing process on basic word input with an accuracy rate of 97.16% and based on text documents of 91.65%. Thus, the MLSA performance process presents the accuracy of the translation process. Based on these results, this research contributes to developing a stemming algorithm model in carrying out the process of removing prefixes, inserts, and suffixes in the Minangkabau language. Overall, this research can be useful as a tool for translating the ML into Indonesian.

Keywords


Minangkabau language; Natural language processing; Minangkabau language; Stemming algorithm; Stemming algorithm translation

Full Text:

PDF


DOI: http://doi.org/10.11591/ijeecs.v31.i1.pp542-552

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Indonesian Journal of Electrical Engineering and Computer Science (IJEECS)
p-ISSN: 2502-4752, e-ISSN: 2502-4760
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).

shopify stats IJEECS visitor statistics