Skip to content

Latest commit

 

History

History
147 lines (122 loc) · 9.58 KB

README.md

File metadata and controls

147 lines (122 loc) · 9.58 KB

kNN Bandit

This repository contains the code needed to reproduce the experiments of the paper:

J. Sanz-Cruzado, P. Castells, E. López. A Simple Multi-Armed Nearest-Neighbor Bandit for Interactive Recommendation. 13th ACM Conference on Recommender Systems (RecSys 2019). Copenhagen, Denmark, September 2019.

Authors

Information Retrieval Group at Universidad Autónoma de Madrid

Software description

This repository contains all the needed classes to reproduce the experiments explained in the paper. The software contains the following packages:

  • es.uam.eps.ir.knnbandit.data: Classes for handling the ratings by users for items. Extension of the RankSys preference data classes to allow the addition of new users, items and ratings.
  • es.uam.eps.ir.knnbandit.graph: Classes for handling graph data for contact recommendation.
  • es.uam.eps.ir.knnbandit.grid: Classes for reading the list of algorithms to execute.
  • es.uam.eps.ir.knnbandit.metrics: Classes implementing the cumulative metrics used in the experiments.
  • es.uam.eps.ir.knnbandit.recommendation: Implementation of recommendation algorithms and similarities.
  • es.uam.eps.ir.knnbandit.stats: Probability distributions.
  • es.uam.eps.ir.knnbandit.utils: Additional classes, useful for the rest of the program.

Algorithms

The software includes the implementation of several recommendation algorithms.

Multi-armed bandits for recommendation

  • kNN bandit: The main contribution of this paper: we implement our proposed approach by defining a user-based kNN recommender with the appropriate item scoring function, to be used with a stochastic similarity that uses Thompson sampling to estimate the similarities between users.
  • Item-oriented, non-personalized multi-armed bandits: ε-greedy, ε t-greedy, UCB1, UCB1-tuned, Thompson sampling. They are used as baseline bandit algorithms in the paper.

Myopic recommendation algorithms

These approaches are just an incrementally-updateable version of classical recommendation algorithms, used as baselines. The algorithms included in this comparison are:

  • Non-personalized recommendation: Random recommendation, popularity-based recommendation, average rating.
  • Matrix factorization: Implicit matrix factorization (iMF) [1], fast iMF [2], pLSA [3].
  • User-based kNN: Non-normalized implementations of classic user-based cosine kNN.

Metrics

To evaluate and analyze the different algorithms, we implement two metrics:

  • Cumulative Recall: The proportion of relevant ratings that have been discovered at a certain point in time.
  • Cumulative Gini: Measures how imbalanced is the distribution of the number of times each item has been recommended up to some point in time.

System Requirements

Java JDK: 1.8 or above (the software was tested using version 1.8.0_112).

Maven: tested with version 3.6.0.

Installation

In order to install this program, you need to have Maven (https://maven.apache.org) installed on your system. Then, download the files into a directory, and execute the following command:

mvn compile assembly::single

If you do not want to use Maven, it is still possible to compile the code using any Java compiler. In that case, you will need the following libraries:

Execution

Once you have generated a .jar, you can execute the program. There are two different ways to run this program: one for general recommendation (movies, songs, venues...) and one for contact recommendation in social networks, since the respective evaluation protocols have slight differences between them.

General recommendation

java -jar knnbandit-jar-with-dependencies.jar generalrec algorithmsFile dataFile outputFolder numIter threshold resume binarize

where the command line arguments are:

  • algorithmsFile: A file indicating which algorithms have to be executed.
  • dataFile: The rating data, including one rating per line with the format: user \t item \t rating.
  • outputFolder: The directory where the output files will be stored.
  • numIter: The number of iterations to run for each algorithm. Use value 0 for running until no new items can be recommended.
  • threshold: Relevance threshold. Ratings greater than or equal to this value will be considered as relevant.
  • resume: Set value to true to resume execution following up from the output of a previous execution (if any) or false to overwrite and start the interactive recommendation cycle from scratch.
  • useRatings: Set value to false for using binarized rating values (1 for relevant, 0 for non-relevant), true to leave rating values as are.

For reproducing the exact experiments of the paper, program argument values are:

  • numIter = 500000 for Foursquare-NY, numIter = 1000000 for Foursquare-Tokyo and numIter = 3000000 for MovieLens1M.
  • threshold = 1 for Foursquare and threshold = 4 for MovieLens1M.
  • useRatings = false for all datasets.

Contact recommendation

java -jar knnbandit-jar-with-dependencies.jar contactRec algorithmsFile dataFile outputFolder numIter directed resume notReciprocal

where

  • algorithmsFile: A file indicating which algorithms have to be executed
  • dataFile: The graph data, including one edge per line with the format: originUser \t destUser \t weight.
  • outputFolder: The directory where the output files will be stored.
  • numIter: The number of iterations to run for each algorithm. Use value 0 for running until no new items can be recommended.
  • directed: Set value to true if the social network is directed, false otherwise.
  • resume: Set value to true to resume execution following up from the output of a previous execution (if any) or false to overwrite and start the interactive recommendation cycle from scratch.
  • notReciprocal: Set value to true if the algorithms should not recommend reciprocal links, false otherwise.

For reproducing the exact experiments of the paper, the arguments are:

  • numIter = 5000000.
  • directed = true.
  • notReciprocal = true.

Algorithm files

In order to execute different configurations, we include in the config folder the optimal configurations for the different datasets we used in the paper. Each row represents the configuration for a single algorithm.

Example:

popularity
random
average
ubknn-100
knnbandit-1-1-10
mf-10-fastimf-10-10-20-true
itembandit-epsilon-0.2-stationary
itembandit-thompson-1-100

In the above configuration file example, we choose different algorithms ot be run (each with specific parameter settings): popularity, random recommendation, average rating, myopic user-based kNN, kNN bandit, matrix factorization, ε-greedy and Thompson Sampling.

Testing different configurations for the kNN bandit

To test the different settings for this algorithm, the format of the line to add in the configuration file is:

knnbandit-k-alpha-beta

where:

  • k is the number of neighbors to use.
  • alpha is the initial number of hits of the algorithm (high value for optimistic start).
  • beta is the initial number of errors of the algorithm (high value for pessimistic start).

Random seed

It is possible to set a random seed for the experiments, so that the selection of users and other random choices are the same when the experiment is repeated. For that purpose, in the output directory, just add a file named rngseed (without any file extension) containing the seed in the first line, and set the parameter resume to true. We include the random seeds we used in our experiments in the rng-seeds folder. In order to use them, they have to be renamed when added to the output directory.

Output format

The output of both programs is the same: for each algorithm in the comparison, a file will be created. The name of the file will be the same as the chosen algorithm configuration. Each of the output files has the following format: separated by tabs, the first line contains the header of the file. Then, each row contains the information of a single iteration: the number of the iteration, the selected user, the selected item, the value of the metrics and the time taken to execute the iteration (in ms.)

This is an example of the content format of this file:

iter	user	item	recall	gini	time
0	1713	4901	0.0	1.0	27
1	1880	1477	0.0	0.9999838334195551	13
2	1626	56725	0.0	0.9999676668391102	3
3	2002	34539	0.0	0.9999515002586653	3
4	477	5085	0.0	0.9999353336782204	6
5	2012	44312	0.0	0.9999191670977755	5
6	1526	60448	0.0	0.9999030005173306	45
7	528	9392	0.0	0.9998868339368857	46
8	887	2878	0.0	0.9998706673564408	31
9	1313	22947	0.0	0.9998545007759959	56
10	2274	45478	0.0	0.9998383341955509	1
11	1615	7493	0.0	0.999822167615106	2
12	1481	58528	0.0	0.9998060010346611	0

References

  1. Hofmann, T. (2004). Latent semantic models for collaborative filtering. ACM Transactions on Information Systems, 22(1), pp. 89–115
  2. Hu, Y., Koren, Y., & Volinsky, C. (2008). Collaborative Filtering for Implicit Feedback Datasets. In 2008 Eighth IEEE International Conference on Data Mining (ICDM 2008). Pisa, Italy, December 2008, pp. 263–272.
  3. Pilászy, I., Zibriczky, D., & Tikk, D. (2010). Fast ALS-based matrix factorization for explicit and implicit feedback datasets. In Proceedings of the 4th ACM conference on Recommender systems (Recsys 2010). Barcelona, Spain, September 2010, pp. 71-78.