Luca Engel

Recommender System for Books

Dec 13, 2024

Recommender System for Books

TL;DR

At a glance

Problem

Recommender systems must predict user preferences despite sparse data: most users rate only one or two items, and new books suffer from cold-start issues. Traditional collaborative filtering (CF) struggles in such settings. The challenge: can we build a scalable recommender that performs well under sparsity, while still personalizing recommendations?

Solution overview

We implemented and compared multiple CF models:

  1. ALS matrix factorization — baseline latent factor model.
  2. User-based kNN CF — leverages user similarity for predictions.
  3. Item-based kNN CF with metadata — enriches similarity with book metadata (subjects, summaries, language).
  4. Hybrid CF — averages predictions from user- and item-based CF to balance their strengths.

Architecture

Data

Method

Experiments & Results

Benchmarks

A value of k = 5 was chosen for kNN models based on validation RMSEs.

Model VariantRMSE (test)
Hybrid CF + metadata0.8242
User-based CF0.8252
Item-based CF0.8260
Item-based CF + metadata0.8256
ALS1.1318

Evaluation protocol.

Analysis


Impact

What I learned

Future Work

References