Photo by Shubham Dhage on Unsplash

Multimedia Commons Dataset for Machine Learning

Install DagsHub:

pip install dagshub
Click on copy button to copy content

To stream this data directly on DagsHub

from dagshub.streaming import DagsHubFilesystem

fs = DagsHubFilesystem(".", repo_url="https://dagshub.com/DagsHub-Datasets/multimedia-commons-dataset")

fs.listdir("s3://multimedia-commons")
Click on copy button to copy content

Description

The Multimedia Commons is a collection of audio and visual features computed for the nearly 100 million Creative Commons-licensed Flickr images and videos in the YFCC100M dataset from Yahoo! Labs, along with ground-truth annotations for selected subsets. The International Computer Science Institute (ICSI) and Lawrence Livermore National Laboratory are producing and distributing a core set of derived feature sets and annotations as part of an effort to enable large-scale video search capabilities. They have released this feature corpus into the public domain, under Creative Commons License 0, so it is free for anyone to use for any purpose.

Additional information

Update frequency

Not updated.

License

The International Computer Science Institute and Lawrence Livermore National Laboratory have released the feature corpus and annotations under Creative Commons 0 (public domain), so there are no restrictions on use. More information on licensing and citation of the original metadata and the underlying images and videos is available from Yahoo! Labs at https://webscope.sandbox.yahoo.com/catalog.php?datatype=i&did=67

Related datasets

BodyM Dataset

Cloud to Street – Microsoft Flood and Clouds Dataset

A2D2: Audi Autonomous Driving Dataset

Galaxy Evolution Explorer Satellite (GALEX)

Launch your ML development to new heights with DagsHub

Back to top