Deteksi Hate Speech pada Unggahan Media Sosial dengan Naive Bayes Menggunakan Seleksi Fitur Chi-Square

Authors

  • Putu Steven Belva Chan Universitas Udayana Author
  • Ida Ayu Gde Suwiprabayanti Putra Universitas Udayana Author

DOI:

https://doi.org/10.24843/JNATIA.2024.v03.i01.p20

Keywords:

Hate Speech, Naive Bayes, TF-IDF, Chi-square

Abstract

In the digital age, social media's pervasive use has revolutionized global communication but also introduced challenges like hate speech. This study proposes a Multinomial Naive Bayes model optimized with Chi-square feature selection to detect hate speech efficiently from large-scale social media data. Leveraging machine learning, this approach aims to combat harmful content by identifying relevant text features crucial for distinguishing hate speech from non-hate speech. The study utilizes TF-IDF for feature extraction and Chi-square for feature selection, showing significant performance improvements in hate speech detection. The Chi-square feature selection model yielded average precision, recall, F1-score, and accuracy values of 92%, 92%, 91%, and 92% respectively. In contrast, the model without feature selection achieved values of 89%, 89%, 88%, and 89% for the same metrics. Results demonstrate enhanced accuracy, precision, recall, and F1-score across various hate speech categories. 

Downloads

Published

2024-11-01

How to Cite

[1]
Putu Steven Belva Chan and Ida Ayu Gde Suwiprabayanti Putra, “Deteksi Hate Speech pada Unggahan Media Sosial dengan Naive Bayes Menggunakan Seleksi Fitur Chi-Square”, Jnatia, vol. 3, no. 1, pp. 169–176, Nov. 2024, doi: 10.24843/JNATIA.2024.v03.i01.p20.

Most read articles by the same author(s)

Similar Articles

1-10 of 68

You may also start an advanced similarity search for this article.