Informasi Umum

Kode

22.05.029

Klasifikasi

006.35 - Natural Language Processing, Computer Science

Jenis

Karya Ilmiah - Thesis (S2) - Reference

Subjek

Natural Language Processing

Dilihat

369 kali

Informasi Lainnya

Abstraksi

Arabic is a language with rich morphology and has problem where a lexical item can appear as a form with highly inflected forms in the corpus. This large variation can reduce the possibility of finding a single word form and reduce the effectiveness of other tasks in NLP (Natural Language Processing). Therefore, to handling the problem of finding single word form, this study aims to propose a model that can accurately perform word formation in Arabic. The model build using morphological reinflection techniques with an emphasis on the important elements of word formation in Arabic. These elements are type of words, wazan (verb-form), and dhamir (pronouns). The elements represented by morphological features namely MSD (Morphosyntactic Description). Previous research has success build a model for the reinflection process without wazan. In this study wazan is an additional feature and an important part in increasing accuracy. The model was built using character-based RNN (Recurrent Neural Network) seq2seq. This model successfully to map words correctly with 92.87% accuracy for task with MSD-source and 90.71% accuracy for task without the MSD-source. These accuracies are 1.78% and 7.91% higher than previous research. It means that this study produces more precise predictions.

Keywords: Arabic, single word form, morphological reinflection, morphosyntactic description, RNN seq2seq.

Koleksi & Sirkulasi

Seluruh (1) koleksi tidak tersedia

Anda harus log in untuk mengakses flippingbook

Pengarang

Nama LARAS GUPITASARI
Jenis Perorangan
Penyunting MOCH. ARIF BIJAKSANA
Penerjemah

Penerbit

Nama Universitas Telkom, S2 Informatika
Kota Bandung
Tahun 2022

Sirkulasi

Harga sewa IDR 0,00
Denda harian IDR 0,00
Jenis Non-Sirkulasi