Informasi Umum

Kode

17.04.1476

Klasifikasi

006.454 - Speech Recognition

Jenis

Karya Ilmiah - Skripsi (S1) - Reference

Subjek

Speech Recognition

Dilihat

10 kali

Informasi Lainnya

Abstraksi

This essay discusses about speaker recognition, a system that recognize subject identity by their voice. First, subject’s voice features are extracted using MFCC (Mel-Frequency Cepstrum Coefficients) method. Steps in the MFCC are pre-emphasis, framing, windowing, FFT (Fast Fourier Transform), mel scaling and DCT (Discrete Cosine Transform), which produce feature vector called cepstrums. These cepstrums are then modelled using GMM (Gaussian Mixture Model). Steps in th GMM are Expectation-step and Maximization-step, which produce gaussian distribution along with its parameters, mean (µ) and variance (?^2) which are different for every subjects. Classification step is done by comparing between training data parameters and testing data parameters. If the comparation gets high score, it means two datas are match, vice versa. Previous research done by student group at Preston University and Jinnah Women University, Pakistan, with title “Speaker Identification Using GMM with MFCC” gets accuration score 87,5% using feature extraction method MFCC, clustering method K-Means, modelling method GMM and classification by log probability. In this essay, we will pass the clustering step and classify by doing comparation between gaussian distribution using parameter mean (µ) and variance (?^2), which is the fastest and easiest way. In this essay, we do the best thing to get as close as possible with previous research accuration score knowing that the classification step can be called the ‘rough’ one in GMM usage so we are not expecting high, even though there are so many factors that can influence the simulation accuracy. Keywords : MFCC, GMM, Speaker Recognition

  • TT3113 - PENGOLAHAN SINYAL DIGITAL
  • TTG4K3 - PENGOLAHAN SINYAL BICARA
  • TTH4O3 - SPEECH PROCESSING

Koleksi & Sirkulasi

Tersedia 1 dari total 1 Koleksi

Anda harus log in untuk mengakses flippingbook

Pengarang

Nama DOANDA KHABI PUTRA
Jenis Perorangan
Penyunting IWAN IWUT, RATRI DWI ATMAJA
Penerjemah

Penerbit

Nama Universitas Telkom, S1 Teknik Telekomunikasi
Kota Bandung
Tahun 2017

Sirkulasi

Harga sewa IDR 0,00
Denda harian IDR 0,00
Jenis Non-Sirkulasi