EXTRACTION OF MALAY ROOT WORD THAT STARTS WITH LETTER P IN MALAY E-KHUTBAH USING RULE BASED

Zamri Abu Bakar; Nurhilyana  Anuar; Normaly Kamal Ismail

doi:10.15282/ijsecs.9.1.2023.4.0108

Authors

Zamri Abu Bakar Centre of Foundation Studies, Universiti Teknologi MARA Cawangan Selangor Kampus Dengkil
Nurhilyana Anuar Centre of Foundation Studies, Universiti Teknologi MARA Cawangan Selangor Kampus Dengkil, 43800 Selangor, Malaysia
Normaly Kamal Ismail Faculty of Computer and Mathematical Sciences, Universiti Teknologi MARA, 40450 Selangor, Malaysia

DOI:

https://doi.org/10.15282/ijsecs.9.1.2023.4.0108

Keywords:

Stemming, Affix, Rule-based approach, Natural language processing, Root word

Abstract

Stemming is an important process in text processing especially in Natural Language Processing (NLP). It could extract root word from the affix words in the text. In addition, it helps in extracting useful information that contributes to many area of research study such as Information Retrieval. Several stemming algorithms have been discussed in previous studies. However, there are limited studies on Malay stemming process and the number of experimental data used. In this study, we focus on stemming process of Malay stemming algorithm by using rule-based algorithm for a larger dataset of Malay language text. The syntatic linguistic rule-based method was used in the stemming process involves of removing prefixes, suffixes and, prefixes and suffixes. Training dataset was used in this study which consisted of 3233 sentences from e-khutbah text. The result of the experimental evaluation was done by measuring the precision, recall and f-measure. It was found that the algorithm used in this study showed a promising result based on total of dataset used for each test. The value of precision, recall and F-measure icrease to 95%, 97% and 97% respectively. The enhancement of the stemming process has shown a significant impact on Malay text processing which in general improved the performance of NLP applications.

Downloads

Download data is not yet available.

References

Rifai, Wafda, “Modification of Stemming Algorithm Using A Non Deterministic Approach To Indonesian Text,” Indonesian

Journal of Computing and Cybernetics Systems,13. 379,2019, doi: 10.22146/ijccs.49072.

M. N. Kassim, M. A. Maarof, A. Zainal and A. A. Wahab,”Word stemming challenges in Malay texts: A literature review” 2016

th International Conference on Information and Communication Technology (ICoICT), 2016, pp. 1-6, doi:

1109/ICoICT.2016.7571887.

Boukhalfa, I., Mostefai, S., & Chekkai, N. , “A Study of Graph Based Stemmer in Arabic Extrinsic Plagiarism Detection,”

Proceedings of the 2nd Mediterranean Conference on Pattern Recognition and Artificial Intelligence, pp. 27-32, 2018.

Permana, Y., & Emarilis, A., “Stemming Analysis Indonesian Language News Text with Porter Algorithm,” Journal of Physics:

Conference Series, Vol. 1845, No. 1, p. 012019, IOP Publishing, 2021.

Maheswari, S., & Arthi, K., “Rule Based Morphological Variation Removable Stemming Algorithm”, International Journal of

Recent Technology and Engineering (IJRTE), ISSN, 2277-3878, 2019.

Samuel, J., & Teferra, S.,“Designing A Rule Based Stemming Algorithm for Kambaata Language Text”, International Journal

of Computational Linguistics (IJCL), Volume 9 : Issue 2, 2018.

Siswandi, Arif & Permana, Yudi & Emarilis, Arvita. , “Stemming Analysis Indonesian Language News Text with Porter

Algorithm,” Journal of Physics: Conference Series. 1845. 012019, 2021, doi: 10.1088/1742-6596/1845/1/012019.

Razmi, N. A, Zamri, M. Z., Ghazalli, S. S. S., & Seman, N., “Visualizing Stemming Techniques on Online News Articles Text

Analytics”, Bulletin of Electrical Engineering and Informatics, [S.l.], v. 10, n. 1, p. 365-373, feb. 2021. ISSN 2302-9285.

doi:https://doi.org/10.11591/eei.v10i1.2504.

Khan, R. U., Mohamad, F. H, UlHaq, M. I., Adruce, S. A. Z., Anding, P. N., Khan, S. N., Al-Hababi, A. Y. S., “Malay Language

Stemmer” International Journal for Research In Emerging Science And Technology, Volume 4: Issue 12,2019.

EXTRACTION OF MALAY ROOT WORD THAT STARTS WITH LETTER P IN MALAY E-KHUTBAH USING RULE BASED

Authors

DOI:

Keywords:

Abstract

Downloads

References

Downloads

Published

Issue

Section

License

How to Cite

Similar Articles

sideblock