PENGGUNAAN METODE COSINE SIMILARITY DAN TF-IDF UNTUK KLASIFIKASI JUDUL SEMINAR PROPOSAL PADA FAKULTAS TEKNIK UNIVERSITAS JABAL GHAFUR

Authors

  • Nilawati Teknik Informatika, Universitas Jabal Ghafur
  • Husaini Teknik Informatika, Universitas Jabal Ghafur, Sigli
  • Junaidi Salat Teknik Informatika, Universitas Jabal Ghafur, Sigli

DOI:

https://doi.org/10.61579/sagita.v2i1.60

Keywords:

Cosine Similarity, classification of titles, TF-IDF, Website, Seminar

Abstract

The title classification system aims to help users to Group A student seminar title document into a Category. The number of Student Research title documents requires an application system that can group these documents according to their respective categories. The purpose of this study is how to build a seminar title classification system by utilizing the approach through the text of the title and description of the seminar with the TF-IDF Method to give weight to the frequency of the relationship on the occurrence of a word (term) in the calculation of the merger of title and description of the seminar. And the Cosine Similarity algorithm is used as a comparison method to know how much similarity between the two documents. This research is conducted by implementing the text mining method with cosine similarity algorithm and TF-IDF weighting so that it is expected to classify data automatically, quickly, and accurately. Using dummy data as training data used in this study, totaling 100 seminars that have been carried out with several categories of different fields of science. And for its implementation using the PHP programming language, and with the help of data resources the use of text mining that is already in PHP which is useful for the text preprocessing stage, and then will continue with the implementation process of TF-IDF Motode and Cosine Similarity algorithm.

References

Agung Wahana Khusnul Khuluqiyah Tacbir Hendro Pudjiantoro (2019), Klasifikasi Data Pengaduan Masyarakat Pada Laman Pesduk Cimahi Menggunakan Rocchio

Ari Aulia Hakim Alva Erwin Kho I Eng Maulahikmah (2018), Automated Document Classification for News Article in Bahasa Indonesia based on Term Frequency Inverse Document Frequency (TFIDF) Approach

Ajiprayoga, H. P. Pemanfaatan Metode Cosine Similarity Dalam Menentukan Kemiripan Iklan Pada Situs Jual Beli Online. 2015.

Arifin, M. F., & Fitrianah, D. Penerapan Algoritma Klasifikasi C4.5 dalam Rekomendasi Penerimaan Mitra Penjualan Studi Kasus : PT Atria Artha Persada. 2018.

Bening Herwijayanti Dian Eka Ratnawati Lailil Muflikhah, Klasifikasi Berita Online dengan menggunakan Pembobotan TFIDF dan Cosine Similarity, 2018.

Kaplan, R.M. (1995). A Methode for Tokenizing Text. Palo Alto Research Center (Festscrift in The Honor of Prof. Kimmo Koskenniemi's 60 th Anniversary).

Sulistyo, W. & Sarno, R. (2008), Auto Matching Antar Dokumen dengan Metode Cosine Measure, Seminar Nasional Teknologi Informasi dan Komunikasi, Indonesia.

P. D. Ramadhan, "Analisis Pengelolaan Pengaduan Masyarakat," Jurnal Administrasi Publik, 2011.

Prasetya, C. S. D. Sistem Rekomendasi Pada E-Commerce Menggunakan K-Nearest Neighbor. 2017.

Oktora, R., & Susanty, W. Perancangan Aplikasi E-Commerce Dengan Sistem Rekomendasi Item-Based Collaborative Filltering. 2012.

Melita, R., Amrizal, V., Suseno, H. B., & Dirjam, T. Penerapan Metode Term Frequency Inverse Document

Frequency (TF-IDF) dan Cosine Similarity pada Sistem Temu Kembali Informasi untuk Mengetahui Syarah Hadits Berbasis Web. 2018.

Wahyuni, R. T., Prastiyanto, D., & Supraptono, E. Penerapan Algoritma Cosine Similarity dan Pembobotan TF-IDF pada Sistem Klasifikasi Dokumen Skripsi. 2017.

Dian Oktaviani, S. H. Rancang Bangun Portal Seminar Nasional. 2018.

Sugiyamta. Sistem Deteksi Kemiripan Dokumen Dengan Algoritma Cosine Similarity Dan Single Pass Clustering. 2015.

Rismayani. Sistem Rekomendasi Pencarian Jodoh Syariah Menggunakan. 2018.

Kesuma, H. W. A. Penerapan Metode Tf-Idf Dan Cosine Similarity Dalam Aplikasi Kitab Undang- Undang Hukum Dagang. 2016.

Imbar, V., Radiant. Adelia, Ayub, M., dan Rehatta, A. 2014. Implementasi Cosine Similarity dan Algoritma Smith Waterman untuk Mendeteksi Kemiripan Teks. Jurnal Informatika Volume 10,Nomor 1.

Sugiyamta. 2015. Sistem Deteksi Kemiripan Dokumen dengan Algoritma Cosine Similarity dan Single Pass Clustering. Jurnal Informatika Volume 7, Nomor 2.

Susandi, D. dan Sholahudin, U. 2016. Pemanfaatan Vector Space Model pada Penerapan Algoritma Nazief Adriani, KNN dan Fungsi Similarity Cosine untuk Pembobotan IDF dan WIDF pada Prototipe Sistem Klasifikasi Teks Bahasa Indonesia. Jurnal Teknologi Informasi Volume 3, Nomor 1.

Nurdiana, O., Jumadi., dan Nursantika, D. 2016. Perbandingan Metode Cosine Similarity dengan Metode Jaccard Similarity pada Aplikasi Pencarian Terjemahan Al-Qur’an dalam Bahasa Indonesia. Jurnal Online Informatika Volume 1, Nomor 1.

Kurniawan, A. Solihin, F., dan Hastarita, F. 2014. Perancangan dan Pembuatan Aplikasi Pencarian Informasi Beasiswa dengan Menggunakan Cosine Similarity. Jurnal SimanteC Volume 4, Nomor 2.

Nurjanah, M. Hamdani. dan Astuti, I. Fitri. 2013. Penerapan Algoritma Term Frequency-Inverse Document Frequency (TF-IDF) untuk Text Mining. Jurnal Informatika Volume 8, Nomor 3.

Ye, J. 2014. Vector Similarity Measures of Simplified Neutroshopic Sets and Their Application in Multicriteria Decision Making. Internasional Journal of Fuzzy Systems Volume 16, Nomor 2

Downloads

Published

2024-01-31

How to Cite

Nilawati, Husaini, & Salat , J. (2024). PENGGUNAAN METODE COSINE SIMILARITY DAN TF-IDF UNTUK KLASIFIKASI JUDUL SEMINAR PROPOSAL PADA FAKULTAS TEKNIK UNIVERSITAS JABAL GHAFUR. Sagita Academia Journal, 2(1), 72–79. https://doi.org/10.61579/sagita.v2i1.60