IMPLEMENTASI WEB SCRAPING PADA SITUS JURNAL SINTA MENGGUNAKAN FRAMEWORK SELENIUM WEBDRIVER PYTHON
DOI:
https://doi.org/10.31000/jika.v7i1.7037Abstrak
Pencarian referensi artikel ilmiah merupakan tahap awal dari perancangan sebuah penelitian. Para peneliti seringkali memulai dengan mencari berbagai macam referensi yang memiliki topik dan tujuan yang sama dengan apa yang diteliti. Artikel ilmiah sangat mudah ditemukan melalui internet. Hanya saja dengan banyaknya jumlah artikel ilmiah, proses pencarian akan memakan waktu yang cukup lama. Hal ini dikarenakan beberapa artikel yang sudah terhapus atau membutuhkan akses lebih untuk membukanya. Maka dari itu, untuk mempercepat para mahasiswa dan peneliti mendapatkan artikel ilmiah yang sesuai dengan topik penelitian, diperlukan sebuah laman web khusus yang mampu mengumpulkan beberapa informasi mengenai artikel ilmiah. Penelitian ini bertujuan membuat sebuah laman web yang berisi informasi khusus mengenai referensi artikel ilmiah yang dapat membantu mahasiswa Sekolah Tinggi Analis Bakti Asih dan para peneliti untuk mempercepat pencarian referensi artikel ilmiah. Perancangan sistem ini menggunakan metode web scraping untuk pengambilan data artikel ilmiah, yang menggunakan bahasa pemrograman Python dengan menggunakan library selenium, library pandas, library BeautifulSoup dan library openpyxl. Hasil data scraping yang berupa file excel yang dibuatkan sebuah laman web pada web siakad Sekolah Tinggi Analis Bakti Asih Bandung yang menggunakan Framework CodeIgniter dengan bahasa pemrograman Php. Pengembangan sistem menggunakan metode SDLC (Waterfall) dan pengujian sistem menggunakan metode pengujian blackbox.Referensi
Adli, M. A., & Firgia, L. (2018). Rancang Bangun Web Scraping Pada Media Online Berita Nasional. Jurnal ENTER : Jurnal Online Mahasiswa Program Studi Teknik Informatika STMIK PONTIANAK, 1, 118–128.
Djufri, M. (2020). Penerapan Teknik Web Scraping Untuk Penggalian Potensi Pajak (Studi Kasus Pada Online Market Place Tokopedia, Shopee Dan Bukalapak). Jurnal BPPK : Badan Pendidikan Dan Pelatihan Keuangan, 13(2), 65–75. https://doi.org/10.48108/jurnalbppk.v13i2.636
Flores, V. A., Permatasari, P. A., & Jasa, L. (2020). Penerapan Web Scraping Sebagai Media Pencarian dan Menyimpan Artikel Ilmiah Secara Otomatis Berdasarkan Keyword. Majalah Ilmiah Teknologi Elektro, 19(2), 157. https://doi.org/10.24843/mite.2020.v19i02.p06
Mufidah, U. (2018). Perancangan Aplikasi Perbandingan Harga Produk (Historical Data) Menggunakan Teknik Web Scraping. Skripsi, 1(1), 1–14.
Purnomo, L. M., & Ayub, M. (2021). Analisis data hasil web scraping untuk menentukan kualitas jurnal ilmiah. Jurnal STRATEGI-Jurnal Maranatha, 3(1), 122–132. Retrieved from http://strategi.it.maranatha.edu/index.php/strategi/article/view/237
Rohim, A. A., & Rachman, R. (2022). Sistem Informasi Pemesanan Makanan Minuman berbasis Android Hybrid Sun and Grass Coffee, 3(1), 94–104.
Rohim, M. A. (2018). Digital Digital Repository IMPLEMENTASI EKSTRAKSI WEB (WEB SCRAPING) PADA SITUS BERITA MENGGUNAKAN METODE EKSPRESI REGULER. Skripsi, 68–74.
Sahria, Y. (2020). Implementasi Teknik Web Scraping pada Jurnal SINTA Untuk Analisis Topik Penelitian Kesehatan Indonesia. URECOL (Unversity Research Colloqium), 11(2020), 297–306. Retrieved from http://repository.urecol.org/index.php/proceeding/article/view/1079
Sahria, Y., & Fudholi, D. H. (2020). Analisis Topik Penelitian Kesehatan di Indonesia Menggunakan Metode Topic Modeling LDA. Jurnal Rekayasa Sistem Dan Teknologi Informasi, 4(2), 336–344.
Sanjaya, L., & Susanti, S. (2021). Perancangan Sistem Informasi Penerimaan Siswa Baru Berbasis Web Di Smp Taruna Mandiri Cimahi. E-PROSIDING SISTEM INFORMASIVol. 2, No. 2, Desember2021, 2(2), 84–92.
Satriajati, S., Panuntun, S. B., & Pramana, S. (2021). Implementasi Web Scraping Dalam Pengumpulan Berita Kriminal Pada Masa Pandemi Covid-19. Seminar Nasional Official Statistics, 2020(1), 300–308. https://doi.org/10.34123/semnasoffstat.v2020i1.578
Sembiring, F., Yudistyral, D., Sari, D. P., Sistem Informasi, P., Pendidikan, P., Dan, S., & Informasi, T. (2020). Penerapan Teknik Scraping Python Pada Website Marketplace Indonesia. INTEGRATED (Information Tecknology and Vocational Education), 2(1), 15–21.
Sujarwadi, F., & Zailani, A. U. (2019). Prosiding Seminar Nasional Informatika PERANCANGAN SISTEM INFORMASI WEB SCRAPING RESEP MASAKAN BERBASIS PHP DESIGN WEB SCRAPING INFORMATION SYSTEM OF PHP-BASED FOOD RECIPES. Prosiding Seminar Nasional Informatika Dan Sistem Informasi, 4(1), 34–45.
Syarif, M., & Nugraha, W. (2020). Pemodelan Diagram UML Sistem Pembayaran Tunai Pada Transaksi E-Commerce. Jurnal Teknik Informatika Kaputama (JTIK), 4(1), 70 halaman. Retrieved from http://jurnal.kaputama.ac.id/index.php/JTIK/article/view/240
Zharfan, R. N., & Najiyah, I. (2022). Rancang Bangun Aplikasi Cyber Rongsok Berbasis Website Menggunakan Framework Codeigniter. E-PROSIDING SISTEM INFORMASI, 3(1).
Unduhan
File Tambahan
Diterbitkan
Terbitan
Bagian
Lisensi
License and Copyright Agreement
In submitting the manuscript to the journal, the authors certify that:
- They are authorized by their co-authors to enter into these arrangements.
- That it is not under consideration for publication elsewhere,
- That its publication has been approved by all the author(s) and by the responsible authorities – tacitly or explicitly – of the institutes where the work has been carried out.
- They secure the right to reproduce any material that has already been published or copyrighted elsewhere.
- They agree to the following license and copyright agreement.
Copyright
Authors who publish with International Journal of Advances in Intelligent Informatics agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License (CC BY-SA 4.0) that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.Â
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.
Licensing for Data Publication
International Journal of Advances in Intelligent Informatics use a variety of waivers and licenses, that are specifically designed for and appropriate for the treatment of data:
Open Data Commons Attribution License, http://www.opendatacommons.org/licenses/by/1.0/ (default)
Creative Commons CC-Zero Waiver, http://creativecommons.org/publicdomain/zero/1.0/
Open Data Commons Public Domain Dedication and Licence, http://www.opendatacommons.org/licenses/pddl/1-0/
Other data publishing licenses may be allowed as exceptions (subject to approval by the editor on a case-by-case basis) and should be justified with a written statement from the author, which will be published with the article.
Open Data and Software Publishing and Sharing
The journal strives to maximize the replicability of the research published in it. Authors are thus required to share all data, code or protocols underlying the research reported in their articles. Exceptions are permitted but have to be justified in a written public statement accompanying the article.
Datasets and software should be deposited and permanently archived inappropriate, trusted, general, or domain-specific repositories (please consult http://service.re3data.org and/or software repositories such as GitHub, GitLab, Bioinformatics.org, or equivalent). The associated persistent identifiers (e.g. DOI, or others) of the dataset(s) must be included in the data or software resources section of the article. Reference(s) to datasets and software should also be included in the reference list of the article with DOIs (where available). Where no domain-specific data repository exists, authors should deposit their datasets in a general repository such as ZENODO, Dryad, Dataverse, or others.
Small data may also be published as data files or packages supplementary to a research article, however, the authors should prefer in all cases a deposition in data repositories.