Comparative Analysis of Vectorization Methods for Academic Supervisor Recommendations
Abstract
Selecting final project supervisors often poses challenges for students due to limited lecturer quotas and difficulties in finding suitable expertise matches. This study proposes using the Cosine Similarity method with vectorization approaches such as Bidirectional Encoder Representations from Transformers (BERT), FastText, Bag of Words (BoW), Term Frequency-Inverse Document Frequency (TF-IDF), and Word2Vec to enhance the accuracy of recommendation systems. Data sourced from Google Scholar underwent scraping, preprocessing, and vectorization to evaluate the most effective method for understanding context and recommending relevant supervisors. The analysis revealed that BERT and Word2Vec based approaches achieved superior performance, delivering a perfect hit ratio (1.00) and overcoming the limitations of TF-IDF and BoW in capturing technical language. This recommendation system is expected to streamline the supervisor selection process, minimize mismatches, and effectively support academic advisory processes across educational institutions
The Authors submitting a manuscript do so on the understanding that if accepted for publication, copyright of the article shall be assigned to Jurnal Teknologi Informasi dan Terapan (J-TIT) and Department of Information Technology, Politeknik Negeri Jember as publisher of the journal. Copyright encompasses rights to reproduce and deliver the article in all form and media, including reprints, photographs, microfilms, and any other similar reproductions, as well as translations. Authors should sign a copyright transfer agreement when they have approved the final proofs sent by Jurnal Teknologi Informasi dan Terapan (J-TIT) prior to the publication. The copyright transfer agreement can be download here .
Jurnal Teknologi Informasi dan Terapan (J-TIT) and Department of Information Technology, Politeknik Negeri Jember and the Editors make every effort to ensure that no wrong or misleading data, opinions or statements be published in the journal. In any way, the contents of the articles and advertisements published in Jurnal Teknologi Informasi dan Terapan (J-TIT) are the sole responsibility of their respective authors and advertisers.
Users of this website will be licensed to use materials from this website following the Creative Commons Attribution 4.0 International License. No fees charged. Please use the materials accordingly.

This work is licensed under a Creative Commons Attribution-Share A like 4.0 International License
You are free to:
- Share — copy and redistribute the material in any medium or format
- Adapt — remix, transform, and build upon the material for any purpose, even commercially.
- The licensor cannot revoke these freedoms as long as you follow the license terms.





