Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
03aa72b0f6
Initial commit
1 year ago
6cece0e8e9
update readme automation
1 year ago
Storage Buckets

README.md

You have to be logged in to leave a comment. Sign In

NLP - fast.ai datasets

Stream data with DDA:

from dagshub.streaming import DagsHubFilesystem

fs = DagsHubFilesystem(".", repo_url="https://dagshub.com/DagsHub-Datasets/fast-ai-nlp-dataset")

fs.listdir("s3://fast-ai-nlp")

Description:

Some of the most important datasets for NLP, with a focus on classification, including IMDb, AG-News, Amazon Reviews (polarity and full), Yelp Reviews (polarity and full), Dbpedia, Sogou News (Pinyin), Yahoo Answers, Wikitext 2 and Wikitext 103, and ACL-2010 French-English 10^9 corpus. This is part of the fast.ai datasets collection hosted by AWS for convenience of fast.ai students. See documentation link for citation and license details for each dataset.

Contact:

Some of the most important datasets for NLP, with a focus on classification, including IMDb, AG-News, Amazon Reviews (polarity and full), Yelp Reviews (polarity and full), Dbpedia, Sogou News (Pinyin), Yahoo Answers, Wikitext 2 and Wikitext 103, and ACL-2010 French-English 10^9 corpus. This is part of the fast.ai datasets collection hosted by AWS for convenience of fast.ai students. See documentation link for citation and license details for each dataset.

Update Frequency:

As required

Managed By:

http://www.fast.ai/

Resources:

  1. resource:
    • Description: Datasets
    • ARN: arn:aws:s3:::fast-ai-nlp
    • Region: us-east-1
    • Type: S3 Bucket

Tags:

aws-pds, deep learning, natural language processing, machine learning

Tip!

Press p or to see the previous file or, n or to see the next file

About

fast-ai-nlp-dataset is originate from the Registry of Open Data on AWS

Collaborators 5

Comments

Loading...