Are you sure you want to delete this access key?
title | parent | nav_order |
---|---|---|
BookCrossing | Data Model | 5 |
The BookCrossing data set consists of user-provided ratings — both implicit and explicit — of books.
If you use this data, cite:
Cai-Nicolas Ziegler, Sean M. McNee, Joseph A. Konstan, and Georg Lausen. 2005. Improving Recommendation Lists Through Topic Diversification. Proceedings of the 14th International World Wide Web Conference (WWW '05), May 10-14, 2005, Chiba, Japan.
Imported data lives in the bx
schema. The source data files are automatically downloaded and unpacked by
the provided scripts and DVC stages.
The import is controlled by the following DVC steps:
data/BX.dvc
data/BX-CSV-Dump.zip.dvc
schemas/bx-schema.dvc
bx-schema.sql
to set up the base schema.import/bx-ratings.dvc
data/BX-Book-Ratings.csv
.index/bx-index.dvc
bx-index.sql
to index the rating data and integrate with book data.The raw rating data, with invalid characters cleaned up, is in the bx.raw_ratings
table, with
the following columns:
We extract the following tables for BookCrossing ratings:
rating
rating > 0
) from the raw ratings table.add_action
Both of these tables are pre-clustered, so the book IDs refer to book clusters and not individual ISBNs or editions. They have the following columns:
user_id
book_id
rating
rating
table.nactions
Press p or to see the previous file or, n or to see the next file
Are you sure you want to delete this access key?
Are you sure you want to delete this access key?