Are you sure you want to delete this access key?
title |
---|
Library of Congress |
One of our sources of book data is the Library of Congress MDSConnect Books bibliography records.
We download and import the XML versions of these files.
Imported data lives under the locmds
schema.
The import is controlled by the following DVC steps:
schemas/loc-mds-schema.dvc
loc-mds-schema.sql
to set up the base schema.import/loc-mds-books.dvc
data/loc-books/
.import/loc-mds-extract-isbns.dvc
index/loc-mds-index-books.dvc
loc-mds-index-books.sql
to index the book data and extract tables.index/loc-mds-book-info.dvc
loc-mds-book-info.sql
to extract additional book data into tables.The locmds.book_marc_fields
table contains the raw data imported from the MARC files, as MARC fields. The LOC book data follows the MARC 21 Bibliographic Data format; the various tags, field codes, and indicators are defined there. This table is not terribly useful on its own, but it is the source from which the other tables are derived.
It has the following columns:
rec_id
fld_no
fld_no
with their containing field.tag
LDR
for the MARC leader.ind1
, ind2
sf_code
contents
We then extract a number of tables and views from this MARC data. These tables include:
book_record_info
More information about the last three is in the leader specification.
book
book_record_info
intended to capture the actual books in the collection,
as opposed to other types of materials. We consider a book to be anything that has MARC
record type ‘a’ or ‘t’ (language material), and is not also classified as a government
record in MARC field 008.book_extracted_isbn
parse-isbns
parses out ISBNs, along with additional tags or
descriptors, from the ISBN strings using a number of best-effort heuristics. This table contains
the results of that process.book_rec_isbn
book_author_name
book_pub_year
book_title
Press p or to see the previous file or, n or to see the next file
Are you sure you want to delete this access key?
Are you sure you want to delete this access key?