Comparative Analysis of Collaborative Filtering Models on Highly Sparse Book Datasets

Authors

  • Mickael Sitompul Institut Teknologi Del, Faculty of Informatics and Electrical Engineering, Sitoluama, Laguboti, 22381, INDONESIA
  • Jeremy Nainggolan Institut Teknologi Del, Faculty of Informatics and Electrical Engineering, Sitoluama, Laguboti, 22381, INDONESIA
  • Reinaldy Hutapea Institut Teknologi Del, Faculty of Informatics and Electrical Engineering, Sitoluama, Laguboti, 22381, INDONESIA
  • Tennov Pakpahan Institut Teknologi Del, Faculty of Informatics and Electrical Engineering, Sitoluama, Laguboti, 22381, INDONESIA
  • Ioka Purba Institut Teknologi Del, Faculty of Informatics and Electrical Engineering, Sitoluama, Laguboti, 22381, INDONESIA

DOI:

https://doi.org/10.54074/jicsa.v1i02.19

Keywords:

Collaborative Filtering, Alternating Least Squares, Matrix Factorization, Data Sparsity, Implicit Feedback, Book Recommendation, MAP@10

Abstract

The rapid growth of digital information necessitates robust recommender systems to filter relevant content effectively. This study addresses the challenge of generating personalized book recommendations using a highly sparse dataset (>99% sparsity) derived from the Recommender System 2023 Challenge. The primary objective is to predict user interest based on implicit feedback and optimize the ranking quality of Top-N recommendations. We conduct a comparative analysis between a memory-based baseline, Item-Based Collaborative Filtering (IBCF), and a model-based approach, Alternating Least Squares (ALS). The methodology incorporates k-core decomposition for data preprocessing to mitigate sparsity issues, followed by rigorous hyperparameter optimization via grid search to tune latent factors, regularization parameters, and confidence weights. Experimental results demonstrate that the baseline IBCF model struggles significantly with data sparsity, yielding a Mean Average Precision at 10 (MAP@10) of only 0.0018. In contrast, the hyperparametertuned ALS model achieves a MAP@10 of 0.0348, representing a 19.3- fold improvement over the baseline and a 3.3-fold improvement
over the default ALS configuration. These findings confirm that
matrix factorization techniques, when systematically optimized,
significantly outperform memory-based methods in handling sparse
implicit feedback, providing a scalable and accurate solution for
ranking tasks.

References

[1] Abdalla, H. I., Amer, A. A., Amer, Y. A., Nguyen, L., & Al-Maqaleh, B. (2023). Boosting the Item-Based Collaborative Filtering Model with Novel Similarity Measures. International Journal of Computational Intelligence Systems, 16(1), 1–16. https://doi.org/10.1007/s44196-023-00299-2

[2] Awan, M. J., Khan, R. A., Nobanee, H., Yasin, A., Anwar, S. M., Naseem, U., & Singh, V. P. (2021). A recommendation engine for predicting movie ratings using a big data approach. Electronics (Switzerland), 10(10). https://doi.org/10.3390/electronics10101215

[3] Bobadilla, J., Hernando, A., Ortega, F., & Bernal, J. (2011). A framework for collaborative filtering recommender systems. Expert Systems with Applications, 38(12), 14609–14623. https://doi.org/10.1016/j.eswa.2011.05.021

[4] Da’u, A., & Salim, N. (2020). Recommendation system based on deep learning methods: a systematic review and new directions. In Artificial Intelligence Review (Vol. 53, Number 4). Springer Netherlands. https://doi.org/10.1007/s10462-019-09744-1

[5] Elahi, M., Ricci, F., & Rubens, N. (2016). A survey of active learning in collaborative filtering recommender systems. Computer Science Review, 20, 29–50. https://doi.org/10.1016/j.cosrev.2016.05.002

[6] Gupta, S. (2020). A Literature Review on Recommendation Systems sept 2020. Int. Res. J. Eng. Technol, 3600–3605. https://www.academia.edu/download/64789612/IRJET_V7I9633.pdf

[7] Hartatik, H., Nurhayati, S. D., & Widayani, W. (2021). Sistem Rekomendasi Wisata Kuliner di Yogyakarta dengan Metode Item-Based Collaborative Filtering. Journal Automation Computer Information System, 1(2), 55–63. https://doi.org/10.47134/jacis.v1i2.8

[8] Herlocker, J. L., Konstan, J. A., Terveen, L. G., & Riedl, J. T. (2004). Evaluating collaborative filtering recommender systems. ACM Transactions on Information Systems, 22(1), 5–53. https://doi.org/10.1145/963770.963772

[9] Iwendi, C., Ibeke, E., Eggoni, H., Velagala, S., & Srivastava, G. (2022). Pointer-Based Item-to-Item Collaborative Filtering Recommendation System Using a Machine Learning Model. International Journal of Information Technology and Decision Making, 21(1), 463–484. https://doi.org/10.1142/S0219622021500619

[10] Papadakis, H., Papagrigoriou, A., Panagiotakis, C., Kosmas, E., & Fragopoulou, P. (2022). Collaborative filtering recommender systems taxonomy. Knowledge and Information Systems, 64(1), 35–74. https://doi.org/10.1007/s10115-021-01628-7

[11] Patel, B., Desai, P., & Panchal, U. (2017). Methods of recommender system: A review. Proceedings of 2017 International Conference on Innovations in Information, Embedded and Communication Systems, ICIIECS 2017, 2018-Janua, 1–4. https://doi.org/10.1109/ICIIECS.2017.8275856

[12] Patel, K., & Patel, H. B. (2020). A state-of-the-art survey on recommendation system and prospective extensions. Computers and Electronics in Agriculture, 178(June), 105779. https://doi.org/10.1016/j.compag.2020.105779

[13] Rendle, S., Krichene, W., Zhang, L., & Koren, Y. (2022). Revisiting the Performance of iALS on Item Recommendation Benchmarks. RecSys 2022 - Proceedings of the 16th ACM Conference on Recommender Systems, 427–435. https://doi.org/10.1145/3523227.3548486

Downloads

Published

2026-05-06

Issue

Section

Articles