Photo by Milad Fakurian on Unsplash

LEGO-Spoken-Dialogue-Corpus Dataset for Machine Learning

Install DagsHub:

pip install dagshub
Click on copy button to copy content

To stream this data directly on DagsHub

from dagshub.streaming import DagsHubFilesystem

fs = DagsHubFilesystem(".", repo_url="https://dagshub.com/DagsHub-Datasets/LEGO-Spoken-Dialogue-Corpus")

fs.listdir("corpus")
Click on copy button to copy content

Description

The LEGOv2 database is a parameterized and annotated version of the CMU Let’s Go database from 2006 and 2007. This spoken dialogue corpus contains interactions captured from the CMU Let’s Go (LG) System by Carnegie Mellon University in 2006 and 2007. It is based on raw log-files from the LG system. The corpus has been parameterized and annotated by the Dialogue Systems Group at Ulm University, Germany.

Additional information

Related datasets

CREMA-D

daps-dataset

UrbanSounds

UrbanSound8K-Labeled Urban Sound Excerpts Dataset

Launch your ML development to new heights with DagsHub

Back to top