Abusive Language Detection on Indonesian Online News Comments

DHAMIR RANIAH KIASATI DESRUL

Informasi Dasar

20.04.983
006.3
Karya Ilmiah - Skripsi (S1) - Reference

Abusive language is an expression used by a person with insulting delivery of any person’s aspect. In the modern era, the use of harsh words is often found on the internet, one of them is in the comment section of online news articles which contains harassment, insult, or a curse. An abusive language detection system is important to prevent the negative effect of such comments. Detecting abusive language in the online comment section is a challenge since abusive languages can be expressed in various words. Moreover, only a few studies have been conducted in Indonesian language. In this paper, we present an Indonesian abusive language detection system by tackling this problem as a classification task and solving it using the following classifiers: Naive Bayes, SVM, and KNN. We also performed feature selection procedure based on Mutual Information value between words. The experimental results show that SVM is the best classifier for detecting the abusive language in news comment with an accuracy score of 90,19% and the use of Mutual Information able to improve the classification accuracy by 1.63%. Mutual Information can increase the accuracy performance of the classifier.

Subjek

Natural language processing
 

Katalog

Abusive Language Detection on Indonesian Online News Comments
 
 
Indonesia

Sirkulasi

Rp. 0
Rp. 0
Tidak

Pengarang

DHAMIR RANIAH KIASATI DESRUL
Perorangan
ADE ROMADHONY, NIKEN DWI WAHYU
 

Penerbit

Universitas Telkom, S1 Informatika
Bandung
2020

Koleksi

Kompetensi

  • CIG4A3 - PEMBELAJARAN MESIN
  • CSH4O3 - PEMROSESAN BAHASA ALAMI
  • CSH4G3 - PENAMBANGAN DATA
  • CSH4H3 - PENAMBANGAN TEKS
  • CII4G3 - PEMROSESAN BAHASA ALAMI
  • CII4I3 - PENAMBANGAN DATA
  • CPI4I3 - PENAMBANGAN DATA

Download / Flippingbook

 

Ulasan

Belum ada ulasan yang diberikan
anda harus sign-in untuk memberikan ulasan ke katalog ini