ANALISIS PERBANDINGAN ALGORITMA NAIVE BAYES DAN SUPPORT VECTOR MACHINE DALAM MENGKLASIFIKASIKAN JUMLAH PEMBACA ARTIKEL ONLINE
DOI:
https://doi.org/10.31000/.v2i2.1521Abstract
PT. Linktone Indonesia merupakan salah satu perusahaan yang bergerak dalam bidang portal berita online. Semakin banyaknya portal berita online di Indonesia, para penulis yang ada di PT. Linktone Indonesia harus dapat bersaing, agar artikel yang mereka publish mendapatkan jumlah pembaca yang maksimal. Jumlah pembaca pada sebuah artikel tidaklah menentu, dan sulit untuk diprediksi. Banyaknya jumlah artikel yang dimiliki, maka dapat dilakukan penelitian data mining untuk mengklasifikasi jumlah pembaca artikel. Terdapat beberapa algoritma dalam teknik klasifikasi, akan tetapi tidak semua algoritma memiliki kinerja dan tingkat keakuratan yang baik dalam mengklasifikasi jumlah pembaca artikel. Penelitian ini membandingkan dua algoritma klasifikasi antara Naive Bayes, Support Vector Machine dan Bagging pada tiap algoritma. Peneliti membagi menjadi 5 dataset dan menggunakan tools WEKA dengan tools options K-Folds Cross Validation dan Confussion Matrix. Hasil penelitian ini, dengan jumlah dataset 7111 record. Bagging kurang memperbaiki hasil klasifikasi dengan jumlah dataset yang besar dan memerlukan waktu pembuatan model yang sangat lama dengan klasifikasi Support Vector Machine. Sementara itu Naive Bayes dalam segi waktu pembuatan model mendapatkan waktu yang paling cepat.
References
Sartika, D. dan Indra, D. (2017) ‘Perbandingan
Algoritma Klasifikasi Naive Bayes , Nearest
Neighbour , dan Decision Tree pada Studi
Kasus Pengambilan Keputusan Pemilihan Pola
Pakaian’, Jurnal Teknik Informatika dan Sistem
Informasi (JATISI), 1(2), pp. 151–161.
Ariadi, D. dan Fithriasari, K. (2015)
‘Klasifikasi Berita Indonesia Menggunakan
Metode Naive Bayesian Classification dan
Support Vector Machine dengan Confix
Stripping Stemmer’, JURNAL SAINS DAN
SENI ITS Vol. 4, No.2, 4(2), pp. 248–253.
Pratiwi, O. N. (2016) ‘Analisa Perbandingan
Algoritma K-Means, Decision Tree, Dan
Naive Bayes Untuk Sistem Pengelompokkan
Siswa Otomatis’, II(2).
Tu, M. C., Shin, D. and Shin, D. (2009) ‘A
Comparative Study of Medical Data
Classification Methods Based on Decision
Tree and Bagging Algorithms’, in 2009 Eighth
IEEE International Conference on
Dependable, Autonomic and Secure
Computing, pp. 183–187. doi:
1109/DASC.2009.40
Witten, I. H., Frank, E. and Hall, M. A. (2011)
‘Tutorial Exercises for the Weka Explorer’, in
Data Mining: Practical Machine Learning Tools
and Techniques, pp. 559–585. doi:
1016/B978-0-12-374856-0.00017-1
Jiawei Han, M. K. A. J. P. (2012) ‘Data Mining:
Concepts and Techniques, Third Edition –
Books24x7’, Morgan Kaufmann Publishers, p.
doi: 10.1002/1521-3773(20010316)40:6
MacLennan, J. and Tang, Z. H. (2011) Data
mining with Microsoft SQL server 2008,
Cancer gene therapy. doi: 10.1038/cgt.2011.47.
Hoffer, J. A., Ramesh, V. and Topi, H. (2015)
Modern Database Management, Modern
Database Management. doi:
1017/CBO9781107415324.004
Cristianini, N. and Shawe-Taylor, J. (2000) ‘An
Introduction to Support Vector Machines and
other kernel based learning methods’, Ai
Magazine, p. 190. doi: citeulike-article-
id:114719
Breiman, L. (1994) Bagging predictors:
Technical Report No. 421, Machine Learning
Liang, G., Zhu, X. and Zhang, C. (2014) ‘The
effect of varying levels of class distribution on
bagging for different algorithms: An empirical
study’, International Journal of Machine
Learning and Cybernetics, 5(1), pp. 63–71. doi:
1007/s13042-012-0125-5
Alfaro, E., Gamez, M. and GarcÃa, N. (2013)
‘adabag : An R Package for Classification with
Boosting and Bagging’, Journal of Statistical
Software, 54(2), pp. 1–35. doi:
http://dx.doi.org/10.18637/jss.v054.i02
Oktafia, D. and Pardede, D. D. L. C. (2008)
‘Perbandingan Kinerja Algoritma Decission
Tree Dan Naive Bayes Dalam Memprediksi
Kebangkrutan’, 2008, p. 2008.
Duriqi, R., Raca, V. and Cico, B. (2016)
‘Comparative Analysis of Classification
Algorithms on Three Different Datasets using
WEKA’, Embedded Computing (MECO),
5th Mediterranean Conference on.
IEEE, pp. 96–101
Downloads
Published
Issue
Section
License
License and Copyright Agreement
In submitting the manuscript to the journal, the authors certify that:
- They are authorized by their co-authors to enter into these arrangements.
- That it is not under consideration for publication elsewhere,
- That its publication has been approved by all the author(s) and by the responsible authorities – tacitly or explicitly – of the institutes where the work has been carried out.
- They secure the right to reproduce any material that has already been published or copyrighted elsewhere.
- They agree to the following license and copyright agreement.
Copyright
Authors who publish with International Journal of Advances in Intelligent Informatics agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License (CC BY-SA 4.0) that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.Â
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.
Licensing for Data Publication
International Journal of Advances in Intelligent Informatics use a variety of waivers and licenses, that are specifically designed for and appropriate for the treatment of data:
Open Data Commons Attribution License, http://www.opendatacommons.org/licenses/by/1.0/ (default)
Creative Commons CC-Zero Waiver, http://creativecommons.org/publicdomain/zero/1.0/
Open Data Commons Public Domain Dedication and Licence, http://www.opendatacommons.org/licenses/pddl/1-0/
Other data publishing licenses may be allowed as exceptions (subject to approval by the editor on a case-by-case basis) and should be justified with a written statement from the author, which will be published with the article.
Open Data and Software Publishing and Sharing
The journal strives to maximize the replicability of the research published in it. Authors are thus required to share all data, code or protocols underlying the research reported in their articles. Exceptions are permitted but have to be justified in a written public statement accompanying the article.
Datasets and software should be deposited and permanently archived inappropriate, trusted, general, or domain-specific repositories (please consult http://service.re3data.org and/or software repositories such as GitHub, GitLab, Bioinformatics.org, or equivalent). The associated persistent identifiers (e.g. DOI, or others) of the dataset(s) must be included in the data or software resources section of the article. Reference(s) to datasets and software should also be included in the reference list of the article with DOIs (where available). Where no domain-specific data repository exists, authors should deposit their datasets in a general repository such as ZENODO, Dryad, Dataverse, or others.
Small data may also be published as data files or packages supplementary to a research article, however, the authors should prefer in all cases a deposition in data repositories.