ON THE FEATURE SELECTION AND CLASSIFICATION BASED ON INFORMATION GAIN FOR DOCUMENT-LEVEL SENTIMENT ANALYSIS

ASRIYANTI INDAH PRATIWI

Informasi Dasar

17.05.122
006.312
Karya Ilmiah - Thesis (S2) - Reference

Film and movies are part of digital lifestyle. A numerous video streaming platforms appear as consequence. Movie review is an alternative for choosing movies which are going to watch next. Unfortunately, the movie review may contain spoiler which is not good for movie bu?s. Actually, they only need to know the sentiment information that occurs in movie reviews. Extracting sentiment information in movie review can be done by sentiment analysis. Sentiment analysis, also known as sentiment categorization, is a study that analyses the subjective information on a speci?c object. Unfortunately, the bag of word representation used in sentiment analysis has problems in handling high dimensional feature matrix. This problem can be handled by selecting the feature by feature selection. A good feature is the one that has high relevance to the output class. Current work in feature selection for sentiment analysis succeeds in capturing highly relevant feature, but the occurrence of selected features is rare. As a result, sentiment analysis is facing an over-?tting problem. Information Gain is a common scoring method to select feature. In this study, a feature selection and classi?cation based on Information Gain is proposed. The proposed feature selection, IGDFFS, selects features which satisfy two criteria: (1) high relevance to the output class and (2) high occurrence. The Information Gain score is also used to build a polarity dictionary. The proposed classi?cation scheme, IGC, uses a dictionary to classify the movie review.The experiment result shows that the combination of IGDFFS and IGC, whose accuracy achieves 96%, is more e?ective than the other methods proposed in the previous work.

Subjek

Text mining
 

Katalog

ON THE FEATURE SELECTION AND CLASSIFICATION BASED ON INFORMATION GAIN FOR DOCUMENT-LEVEL SENTIMENT ANALYSIS
 
 
 

Sirkulasi

Rp. 0
Rp. 0
Tidak

Pengarang

ASRIYANTI INDAH PRATIWI
Perorangan
Adiwijaya
 

Penerbit

Universitas Telkom
Bogor
2017

Koleksi

Kompetensi

  • CSG5E3 - PERSIAPAN DAN PENAMBANGAN DATA
  • CSG503 - PEMODELAN DAN OPTIMASI
  • CSG523 - ANALISIS ALGORITMA
  • CSG533 - TEORI INFORMASI
  • CSG553 - SISTEM CERDAS LANJUT
  • CSG5G3 - DATA MINING LANJUT
  • CSG6Q3 - TOPIK KHUSUS DALAM NUMERICAL MACHINE LEARNING
  • CSH6G3 - ANALISIS DAN PENAMBANGAN TEKS
  • IEH5C3 - METODE OPTIMASI
  • CII7E3 - ANALISIS DAN PENAMBANGAN TEKS
  • IMI1C3 - METODE OPTIMASI
  • CII7E3 - ANALISIS DAN PENAMBANGAN TEKS

Download / Flippingbook

 

Ulasan

Belum ada ulasan yang diberikan
anda harus sign-in untuk memberikan ulasan ke katalog ini