Install DagsHub:
pip install dagshub
To stream this data directly on DagsHub
from dagshub.streaming import DagsHubFilesystem
fs = DagsHubFilesystem(".", repo_url="https://dagshub.com/DagsHub-Datasets/comonscreens-dataset")
fs.listdir("s3://common-screens")
Description
A corpus of web screenshot and metadata data composed of over 70 million websites.
Additional information
Documentation
Update frequency
Monthly