Informasi Umum

Kode

17.05.126

Klasifikasi

006.312 - Data mining

Jenis

Karya Ilmiah - Thesis (S2) - Reference

Subjek

Data Mining

Dilihat

269 kali

Informasi Lainnya

Abstraksi

Stemming is a processs to find root word from its complex form by removing all affixes are attached on it. Stemming have been applied in text or document clustering, classification, summarization, information retrieval and word-based text compression. Various language stemmers have been developed, included Indonesian, but Indonesian lamguage is one of the most complicated amongs other languages. Indonesian language has complex affix forms, there are prefixes, infixes, suffixes, confixes, and repeated forms. In Indonesian language, there are morfological change when a root word is attached with affixes particularly prefixes. The first Indonesian stemmer was developed by Nazief-Adriani then Jelita Asian improved the algorithm called confix stripping (CS) stemmer. There were heaps of improvement was done by CS stemmer so it is highest accuracy stemmer algorithm, but there are still stemming failures. A new algorithm would be proposed to improve CS stemmer algorithm by modifying algorithm specifically by rearrange stemming process steps sequence. Experiment would be performed to compare the accuracy amongs Nazief – Adriani, CS stemmer, and new algorithm by using all of those algorithm to stemm the words from 3 document sources, those were a novel book, a hadits book, and online news. Stemming processses used a root word dictionary parsed from “Kamus Besar Bahasa Indonesia 2008”. Result of experiment showed that new algorithm have better accuracy than both Nazief-Adriani and CS stemmer

  • CSG5E3 - PERSIAPAN DAN PENAMBANGAN DATA
  • CSG5F3 - TOPIK KHUSUS DALAM PENAMBANGAN DATA A
  • CSG5H3 - TOPIK KHUSUS DALAM PENAMBANGAN DATA B
  • CSG643 - TOPIK KHUSUS DALAM PENAMBANGAN DATA LANJUT A
  • CSG653 - TOPIK KHUSUS DALAM PENAMBANGAN DATA LANJUT B
  • CSG6R3 - TOPIK KHUSUS DALAM PENAMBANGAN DATA TERAPAN
  • CSG6S3 - TOPIK KHUSUS DALAM PENAMBANGAN TEKS DAN WEB

Koleksi & Sirkulasi

Tersedia 1 dari total 1 Koleksi

Anda harus log in untuk mengakses flippingbook

Pengarang

Nama HARI WIDAYANTO
Jenis Perorangan
Penyunting Arief Fatchul Huda
Penerjemah

Penerbit

Nama Universitas Telkom
Kota Bandung
Tahun 2017

Sirkulasi

Harga sewa IDR 0,00
Denda harian IDR 0,00
Jenis Non-Sirkulasi