Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
e5579d98c1
Added data
1 year ago
e5579d98c1
Added data
1 year ago
e5579d98c1
Added data
1 year ago
c4d4fd7970
Add a README
1 year ago
e5579d98c1
Added data
1 year ago
e5579d98c1
Added data
1 year ago
Storage Buckets
Data Pipeline
Legend
DVC Managed File
Git Managed File
Metric
Stage File
External File

README.md

You have to be logged in to leave a comment. Sign In

LAION-Aesthetics V2 (6.5+)

LAION-Aesthetics is a subset of the LAION 5B dataset with high visual quality.

The dataset in this repo is specifically the images that scored 6.5 or higher via aesthetics prediction models. These models were trained to predict the rating people gave when asked “How much do you like this image on a scale from 1 to 10?”.

The data is composed of image-caption pairs along with an aesthetics score for each image and the original URL, from which it was downloaded.

Example usage

All of the images can be found under the data/ folder.

A TSV-formatted labels file can also be found there (data/labels.tsv.

This TSV file contains 4 columns: image file name, caption, aesthetics score, url

import os

from dagshub.streaming import DagsHubFilesystem
from PIL import Image

# Setup data streaming from DagsHub
fs = DagsHubFilesystem('.', repo_url='https://dagshub.com/DagsHub-Datasets/LAION-Aesthetics-V2-6.5plus')
fs.install_hooks()

# Get all images + labels.tsv file
files = fs.listdir('data/')

# Get the data for the first 5 images in the labels.tsv file
with fs.open('data/labels.tsv') as tsv:
    for row in tsv.readlines()[:5]:
        row = row.strip()
        img_file, caption, score, url = row.split('\t')

        # Load the image file
        img_path = os.path.join('data', img_file)
        img = Image.open(img_path)
        print(f'{img_file} has a size of {img.size} and an aesthetics score of {score}')

Further information

For further information, see the LAION-Aesthetics project page. This dataset was created by LAION.

Tip!

Press p or to see the previous file or, n or to see the next file

About

A subset of the LAION Aesthetics V2 dataset that contains only images with an aesthetics score of 6.5 or larger.

Collaborators 5

Comments

Lei Shi

commented in commitc4d4fd7970on branch main

2 weeks ago

Great data! That is what I am looking for. Really appreicate your work. I think the filename has been changed in terms of the original laion-asesthetics dataset. How can I get correspoding filename for the original laion-aesthetics dataset? Thanks

Loading...