Klasifikasi Genre Buku Berdasarkan Sinopsis Menggunakan Naïve Bayes dan Logistic Regression

Authors

  • Anak Agung Anom Witaradiani Universitas Udayana Author
  • I Gede Arta Wibawa Universitas Udayana Author
  • Putu Praba Santika Universitas Udayana Author

DOI:

https://doi.org/10.24843/JNATIA.2025.v03.i04.p13

Keywords:

Book Genre, Text Classification, Naïve Bayes, Logistic Regression, Confusion Matrix

Abstract

Genre is an important element in book categorization based on specific content characteristics or themes. However, manual classification processes are no longer efficient due to the increasing volume of literature. This study aims to compare the performance of Naïve Bayes and Logistic Regression algorithms in book genre classification based on synopses. The dataset used is secondary data obtained from Kaggle. The dataset consists of 4,535 samples after the preprocessing stage, with feature representation using the TF-IDF method. To address class distribution imbalance, the Synthetic Minority Over-sampling Technique (SMOTE) was applied. The evaluation was conducted using accuracy, precision, recall, and F1-score metrics. The experimental results show that Logistic Regression achieved the best performance with 75.19% accuracy and 75.16% F1-score, while Naïve Bayes achieved 72.22% accuracy and 72.11% F1-score. Based on this evaluation, Logistic Regression is considered more effective in classifying book genres from synopsis text.

Downloads

Published

2025-08-01

How to Cite

[1]
Anak Agung Anom Witaradiani, I Gede Arta Wibawa, and Putu Praba Santika, “Klasifikasi Genre Buku Berdasarkan Sinopsis Menggunakan Naïve Bayes dan Logistic Regression”, Jnatia, vol. 3, no. 4, pp. 835–844, Aug. 2025, doi: 10.24843/JNATIA.2025.v03.i04.p13.

Similar Articles

1-10 of 15

You may also start an advanced similarity search for this article.