IMPLEMENTASI WEB SCRAPING PADA SITUS JURNAL SINTA MENGGUNAKAN FRAMEWORK SELENIUM WEBDRIVER PYTHON

Amanny Ulfah; Ina Najiah

doi:10.31000/jika.v7i1.7037

IMPLEMENTASI WEB SCRAPING PADA SITUS JURNAL SINTA MENGGUNAKAN FRAMEWORK SELENIUM WEBDRIVER PYTHON

Penulis

Amanny Ulfah Universitas Adhirajasa Reswara Sanjaya
Ina Najiah Universitas Adhirajasa Reswara Sanjaya

DOI:

https://doi.org/10.31000/jika.v7i1.7037

Abstrak

Pencarian referensi artikel ilmiah merupakan tahap awal dari perancangan sebuah penelitian. Para peneliti seringkali memulai dengan mencari berbagai macam referensi yang memiliki topik dan tujuan yang sama dengan apa yang diteliti. Artikel ilmiah sangat mudah ditemukan melalui internet. Hanya saja dengan banyaknya jumlah artikel ilmiah, proses pencarian akan memakan waktu yang cukup lama. Hal ini dikarenakan beberapa artikel yang sudah terhapus atau membutuhkan akses lebih untuk membukanya. Maka dari itu, untuk mempercepat para mahasiswa dan peneliti mendapatkan artikel ilmiah yang sesuai dengan topik penelitian, diperlukan sebuah laman web khusus yang mampu mengumpulkan beberapa informasi mengenai artikel ilmiah. Penelitian ini bertujuan membuat sebuah laman web yang berisi informasi khusus mengenai referensi artikel ilmiah yang dapat membantuÂ mahasiswa Sekolah Tinggi Analis Bakti Asih dan para peneliti untuk mempercepat pencarian referensi artikel ilmiah. Perancangan sistem ini menggunakan metode web scraping untuk pengambilan data artikel ilmiah, yang menggunakan bahasa pemrograman Python dengan menggunakan library selenium, library pandas, library BeautifulSoup dan library openpyxl. Hasil data scraping yang berupa file excel yang dibuatkan sebuah laman webÂ pada web siakad Sekolah Tinggi Analis Bakti Asih Bandung yang menggunakan Framework CodeIgniter dengan bahasa pemrograman Php. Pengembangan sistem menggunakan metode SDLC (Waterfall) dan pengujian sistem menggunakanÂ metode pengujian blackbox.

Referensi

Adli, M. A., & Firgia, L. (2018). Rancang Bangun Web Scraping Pada Media Online Berita Nasional. Jurnal ENTER : Jurnal Online Mahasiswa Program Studi Teknik Informatika STMIK PONTIANAK, 1, 118â€“128.

Djufri, M. (2020). Penerapan Teknik Web Scraping Untuk Penggalian Potensi Pajak (Studi Kasus Pada Online Market Place Tokopedia, Shopee Dan Bukalapak). Jurnal BPPK : Badan Pendidikan Dan Pelatihan Keuangan, 13(2), 65â€“75. https://doi.org/10.48108/jurnalbppk.v13i2.636

Flores, V. A., Permatasari, P. A., & Jasa, L. (2020). Penerapan Web Scraping Sebagai Media Pencarian dan Menyimpan Artikel Ilmiah Secara Otomatis Berdasarkan Keyword. Majalah Ilmiah Teknologi Elektro, 19(2), 157. https://doi.org/10.24843/mite.2020.v19i02.p06

Mufidah, U. (2018). Perancangan Aplikasi Perbandingan Harga Produk (Historical Data) Menggunakan Teknik Web Scraping. Skripsi, 1(1), 1â€“14.

Purnomo, L. M., & Ayub, M. (2021). Analisis data hasil web scraping untuk menentukan kualitas jurnal ilmiah. Jurnal STRATEGI-Jurnal Maranatha, 3(1), 122â€“132. Retrieved from http://strategi.it.maranatha.edu/index.php/strategi/article/view/237

Rohim, A. A., & Rachman, R. (2022). Sistem Informasi Pemesanan Makanan Minuman berbasis Android Hybrid Sun and Grass Coffee, 3(1), 94â€“104.

Rohim, M. A. (2018). Digital Digital Repository IMPLEMENTASI EKSTRAKSI WEB (WEB SCRAPING) PADA SITUS BERITA MENGGUNAKAN METODE EKSPRESI REGULER. Skripsi, 68â€“74.

Sahria, Y. (2020). Implementasi Teknik Web Scraping pada Jurnal SINTA Untuk Analisis Topik Penelitian Kesehatan Indonesia. URECOL (Unversity Research Colloqium), 11(2020), 297â€“306. Retrieved from http://repository.urecol.org/index.php/proceeding/article/view/1079

Sahria, Y., & Fudholi, D. H. (2020). Analisis Topik Penelitian Kesehatan di Indonesia Menggunakan Metode Topic Modeling LDA. Jurnal Rekayasa Sistem Dan Teknologi Informasi, 4(2), 336â€“344.

Sanjaya, L., & Susanti, S. (2021). Perancangan Sistem Informasi Penerimaan Siswa Baru Berbasis Web Di Smp Taruna Mandiri Cimahi. E-PROSIDING SISTEM INFORMASIVol. 2, No. 2, Desember2021, 2(2), 84â€“92.

Satriajati, S., Panuntun, S. B., & Pramana, S. (2021). Implementasi Web Scraping Dalam Pengumpulan Berita Kriminal Pada Masa Pandemi Covid-19. Seminar Nasional Official Statistics, 2020(1), 300â€“308. https://doi.org/10.34123/semnasoffstat.v2020i1.578

Sembiring, F., Yudistyral, D., Sari, D. P., Sistem Informasi, P., Pendidikan, P., Dan, S., & Informasi, T. (2020). Penerapan Teknik Scraping Python Pada Website Marketplace Indonesia. INTEGRATED (Information Tecknology and Vocational Education), 2(1), 15â€“21.

Sujarwadi, F., & Zailani, A. U. (2019). Prosiding Seminar Nasional Informatika PERANCANGAN SISTEM INFORMASI WEB SCRAPING RESEP MASAKAN BERBASIS PHP DESIGN WEB SCRAPING INFORMATION SYSTEM OF PHP-BASED FOOD RECIPES. Prosiding Seminar Nasional Informatika Dan Sistem Informasi, 4(1), 34â€“45.

Syarif, M., & Nugraha, W. (2020). Pemodelan Diagram UML Sistem Pembayaran Tunai Pada Transaksi E-Commerce. Jurnal Teknik Informatika Kaputama (JTIK), 4(1), 70 halaman. Retrieved from http://jurnal.kaputama.ac.id/index.php/JTIK/article/view/240

Zharfan, R. N., & Najiyah, I. (2022). Rancang Bangun Aplikasi Cyber Rongsok Berbasis Website Menggunakan Framework Codeigniter. E-PROSIDING SISTEM INFORMASI, 3(1).

Unduhan

PDF (Inggris)

File Tambahan

Untitled

Diterbitkan

2023-02-16

Terbitan

Vol 7 No 1 (2023): JIKA (Jurnal Informatika)

Bagian

Articles

Lisensi

License and Copyright Agreement

In submitting the manuscript to the journal, the authors certify that:

They are authorized by their co-authors to enter into these arrangements.
That it is not under consideration for publication elsewhere,
That its publication has been approved by all the author(s) and by the responsible authorities â€“ tacitly or explicitly â€“ of the institutes where the work has been carried out.
They secure the right to reproduce any material that has already been published or copyrighted elsewhere.
They agree to the following license and copyright agreement.

Copyright

Authors who publish with International Journal of Advances in Intelligent Informatics agree to the following terms:

Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under aÂ Creative Commons Attribution License (CC BY-SA 4.0)Â that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this journal.Â
Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial publication in this journal.
Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work.

Licensing for Data Publication

International Journal of Advances in Intelligent Informatics use a variety of waivers and licenses, that are specifically designed for and appropriate for the treatment of data:

Open Data Commons Attribution License,Â http://www.opendatacommons.org/licenses/by/1.0/Â (default)
Creative Commons CC-Zero Waiver,Â http://creativecommons.org/publicdomain/zero/1.0/
Open Data Commons Public Domain Dedication and Licence,Â http://www.opendatacommons.org/licenses/pddl/1-0/

Other data publishing licenses may be allowed as exceptions (subject to approval by the editor on a case-by-case basis) and should be justified with a written statement from the author, which will be published with the article.

Open Data and Software Publishing and Sharing

The journal strives to maximize the replicability of the research published in it. Authors are thus required to share all data, code or protocols underlying the research reported in their articles. Exceptions are permitted but have to be justified in a written public statement accompanying the article.

Datasets and software should be deposited and permanently archived inappropriate, trusted, general, or domain-specific repositories (please consultÂ http://service.re3data.orgÂ and/or software repositories such asÂ GitHub,Â GitLab,Â Bioinformatics.org, or equivalent). The associated persistent identifiers (e.g. DOI, or others) of the dataset(s) must be included in the data or software resources section of the article. Reference(s) to datasets and software should also be included in the reference list of the article with DOIs (where available). Where no domain-specific data repository exists, authors should deposit their datasets in a general repository such asÂ ZENODO,Â Dryad,Â Dataverse, or others.

Small data may also be published as data files or packages supplementary to a research article, however, the authors should prefer in all cases a deposition in data repositories.

IMPLEMENTASI WEB SCRAPING PADA SITUS JURNAL SINTA MENGGUNAKAN FRAMEWORK SELENIUM WEBDRIVER PYTHON

Penulis

DOI:

Abstrak

Referensi

Unduhan

File Tambahan

Diterbitkan

Terbitan

Bagian

Lisensi

Dikembangkan Oleh

Informasi