Social network approach for cyberbullying detection using machine learning

Authors

  • Anishah Muhammad Syafiq Faculty of Computing, Universiti Malaysia Pahang Al-Sultan Abdullah, Pekan Campus, Pekan 26600, Pahang, Malaysia
  • Mohd Faizal Ab Razak Faculty of Computing, Universiti Malaysia Pahang Al-Sultan Abdullah, Pekan Campus, Pekan 26600, Pahang, Malaysia
  • Ahmad Firdaus Zainal Abidin Faculty of Computing, Universiti Malaysia Pahang Al-Sultan Abdullah, Pekan Campus, Pekan 26600, Pahang, Malaysia
  • Salwana Mohamad@Asmara Faculty of Computing, Universiti Malaysia Pahang Al-Sultan Abdullah, Pekan Campus, Pekan 26600, Pahang, Malaysia
  • Nur Khairunnisa Kamaruddin Faculty of Computing, Universiti Malaysia Pahang Al-Sultan Abdullah, Pekan Campus, Pekan 26600, Pahang, Malaysia

DOI:

https://doi.org/10.15282/jgi.8.1.2025.11508

Keywords:

Fake news, Support vector machine, Naïve Bayes, Decision tree

Abstract

Cyberbullying is a serious issue that affects both adults and teenagers on the internet. Using social media and the internet is frequently associated with sending, receiving, and publishing derogatory, false, or unpleasant content about other people. This shows that cyberbullying has had a substantial negative impact on mental health, especially among the younger population. If action is not taken to stop cyberbullying, self-esteem and problems with mental health will impact a whole generation of young adults. Considering this, it is necessary to use machine learning (ML) approaches combined with natural language processing (NLP) and techniques like TF-IDF (Term Frequency-Inverse Document Frequency) to detect cyberbullying effectively. This study utilised a dataset that includes data on hate speech tweets. The research began by analysing the nature of cyberbullying and the challenges in its detection, underscoring the significance of automated methods. The study used NLP and TF-IDF to pre-process and analyse the dataset, identifying patterns and characteristics typical of cyberbullying behaviours. Subsequently, various ML techniques were utilised to accurately train models that can detect instances of cyberbullying in social media content. Specifically, the study has three primary goals: to create and implement an effective method for detecting online abusive and bullying messages by integrating NLP with ML; to evaluate the accuracy of the proposed detection algorithms for cyberbullying, specifically using Support Vector Machine (SVM), Naive Bayes (NB), and Random Forest (RF); and to compare the performance of these algorithms to identify the one with the highest accuracy in detecting text-based bullying. To sum up, this study highlights the potential of leveraging ML, NLP, and TF-IDF to address the escalating issue of cyberbullying on social media. The study advances the development of sophisticated detection algorithms by utilising a comprehensive dataset to emphasise the need for a multimodal approach that combines technological solutions with awareness-raising and education to foster a safer and more inclusive online community.

References

2020 IEEE International Conference on Big Data (Big Data). (2020). IEEE.

Al-Garadi, M. A., Hussain, M. R., Khan, N., Murtaza, G., Nweke, H. F., Ali, I., Mujtaba, G., Chiroma, H., Khattak, H. A., & Gani, A. (2019). Predicting cyberbullying on social media in the big data era using machine learning algorithms: Review of literature and open challenges. IEEE Access, 7, 70701–70718.

Ali, A., & Syed, A. M. (2020). Cyberbullying detection using machine learning. Pakistan Journal of Engineering and Technology, 3(2), 45-50.

Islam, M. M., Uddin, M. A., Islam, L., Akter, A., Sharmin, S., & Acharjee, U. K. (2020, December). Cyberbullying detection on social networks using machine learning approaches. In 2020 IEEE Asia-Pacific Conference on Computer Science and Data Engineering (CSDE) (pp. 1-6). IEEE.

Modem, S., Girish, H., Prathap, J. A., Ramakrishnappa, M., & Student, U. G. (2023). Faults identification in digital counter using Naive-Bayes’ algorithm. Tuijin Jishu/Journal of Propulsion Technology. 44(4), 1145-1151.

Muneer, A., & Fati, S. M. (2020). A comparative analysis of machine learning techniques for cyberbullying detection on twitter. Future Internet, 12(11), 187.

Perera, A., & Fernando, P. (2021). Accurate cyberbullying detection and prevention on social media. Procedia Computer Science, 181, 605-611.

Simões, R. D. M., Huber, P., Meier, N., Smailov, N., Füchslin, R. M., & Stockinger, K. (2023). Experimental evaluation of quantum machine learning algorithms. IEEE Access, 11, 6197-6208.

Pandian, A. P., Palanisamy, R., & Ntalianis, K. (Eds.). (2021). Proceedings of International Conference on Intelligent Computing, Information and Control Systems: ICICCS 2020. Springer.

Van Hee, C., Lefever, E., Verhoeven, B., Mennes, J., Desmet, B., De Pauw, G., ... & Hoste, V. (2015). Automatic detection and prevention of cyberbullying. In International Conference on Human and Social Analytics (HUSO 2015) (pp. 13-18). IARIA.

Xu, B., Guo, X., Ye, Y., & Cheng, J. (2012). An improved random forest classifier for text categorization. Journal of Computational Science, 7(12), 2913-2920.

Downloads

Published

2025-07-30

How to Cite

Muhammad Syafiq, A., Ab Razak, M. F., Zainal Abidin, A. F., Mohamad@Asmara, S., & Kamaruddin, N. K. (2025). Social network approach for cyberbullying detection using machine learning. Journal of Governance and Integrity, 8(1), 874-880. https://doi.org/10.15282/jgi.8.1.2025.11508

Similar Articles

1-10 of 20

You may also start an advanced similarity search for this article.

Most read articles by the same author(s)