Are you sure you want to delete this access key?
title | parent | nav_order |
---|---|---|
GoodReads | Data Model | 7 |
{: .no_toc}
We import GoodReads data from the UCSD Book Graph for additional book and user interaction information. The source files are not automatically downloaded; you will need the following:
We do not yet support reviews.
If you use this data, cite the paper(s) documented on the data set web site.
Imported data lives in the gr
schema.
The import is controlled by the following DVC steps:
schemas/gr-schema.dvc
gr-schema.sql
to set up the base schema.import/gr-*.dvc
data/
index/gr-index-books.dvc
gr-index-books.sql
to index the book data and extract identifiers.index/gr-book-info.dvc
gr-book-info.sql
to extract additional book and work metadata.index/gr-index-ratings.dvc
gr-index-ratings.sql
to index the rating and interaction data.The raw rating data, with invalid characters cleaned up, is in the various gr.raw_*
tables:
raw_book
raw_work
raw_author
raw_series
raw_book_genres
raw_interaction
Each table has the following columns:
JSONB
column containing imported data.We extract the following tables for book and work data:
work_ids
book_ids
gr_isbn
)gr_isbn13
)This table extracts the textual versions of ISBNs and ASINs directly from the raw_book
table. It does not resolve them to ISBN IDs.
book_isbn
book_genres
work_title
book_pub_date
pub_date
contains the date as an SQL date. These are the publication_*
fields in the book JSON data.work_pub_date
book_pub_date
, but from a work's original_publication_*
fields.book_cluster
We extract the following tables for book ratings and interactions (add-to-shelf actions):
user_info
interaction
gr_interaction_rid
gr_book_id
gr_user_rid
rating
is_read
isRead
flag from original JSON data.date_added
date_updated
rating
add_action
Press p or to see the previous file or, n or to see the next file
Are you sure you want to delete this access key?
Are you sure you want to delete this access key?
Are you sure you want to delete this access key?
Are you sure you want to delete this access key?