Studi Perbandingan Penggabungan Metode Pemilihan Fitur dengan Metode Klasifikasi dalam Klasifikasi Teks

Genta Sahuri

Abstract


The main purposes of this comparative study is to obtain the best features and the method of selecting the most suitable for a particular classification method, as well as provides an overview of the performance and the accuracy of each selection features when combined with any method of classification.  From the experiment it shows that for Naive Bayes classification method has the maximum degree of accuracy when combined with feature selection using Support Vector Machine. K-Nearest Neighbor classification obtains maximum accuracy when it is combined with feature selection using Information Gain and Uncertainty, with the value of k is 4. Furthermore, for Neural Network classifier, it looks less when it is combined with the feature selection tested since it is only produce maximum accuracy less than 50% combined with Information Gain. Moreover, Support Vector Machine resulting maximum accuracy when it is tested using Information Gain, Chi Squared, Deviation and SVM.

Full Text:

PDF

References


Asti Tika Pratiwi, Achmad Ridok, Indriati, Klasifikasi Tema pada Lirik Lagu dengan Metode Transformed Weight-Normalized Complement Naïve Bayes (TWCNB), Repositori Jurnal Mahasiswa PTIIK UB, Vol. 3 No. 7, 2014.

Ramadan, Riza “Penerapan Pohon Untuk Klasifikasi Dokumen Teks Berbahasa Inggris”, Sekolah Teknik Elektro dan Informatika, Institut Teknologi Bandung.

Yang, Y., Pedersen J.P.: A Comparative Study on Feature Selection in Text Categorization (pdf) Proceedings of the Fourteenth International Conference on Machine Learning (ICML'97), 412-420, 1997.

Robert Neumayer, Rudolf Mayer, Kjetil Nørvåg: Combination of Feature Selection Methods for Text Categorisation, 33rd European Conference on IR Research, ECIR 2011, Dublin, Ireland, April 18-21, 2011. Proceedings, pp 763-766, ISBN: 978-3-642-20160-8, 2011.


Refbacks

  • There are currently no refbacks.