Photo by Shubham Dhage on Unsplash

LAION-Aesthetics V2 (6.5+) Dataset for Machine Learning

Install DagsHub:

pip install dagshub
Click on copy button to copy content

To stream this data directly on DagsHub

import os

from dagshub.streaming import DagsHubFilesystem
from PIL import Image

# Setup data streaming from DagsHub
fs = DagsHubFilesystem('.', repo_url='https://dagshub.com/DagsHub-Datasets/LAION-Aesthetics-V2-6.5plus')
fs.install_hooks()

# Get all images + labels.tsv file
files = fs.listdir('data/')

# Get the data for the first 5 images in the labels.tsv file
with fs.open('data/labels.tsv') as tsv:
    for row in tsv.readlines()[:5]:
        row = row.strip()
        img_file, caption, score, url = row.split('\t')

        # Load the image file
        img_path = os.path.join('data', img_file)
        img = Image.open(img_path)
        print(f'{img_file} has a size of {img.size} and an aesthetics score of {score}')
Click on copy button to copy content

Description

LAION-Aesthetics is a subset of the LAION 5B dataset with high visual quality. This dataset is specifically the images that scored 6.5 or higher via aesthetics prediction models. These models were trained to predict the rating people gave when asked “How much do you like this image on a scale from 1 to 10?”.

Additional information

Managed by

Related datasets

BodyM Dataset

Cloud to Street – Microsoft Flood and Clouds Dataset

A2D2: Audi Autonomous Driving Dataset

Galaxy Evolution Explorer Satellite (GALEX)

Launch your ML development to new heights with DagsHub

Back to top