Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
a39491f4de
Initial commit
1 year ago
04ad222cde
update readme automation
1 year ago
Storage Buckets

README.md

You have to be logged in to leave a comment. Sign In

MultiCoNER Dataset

Stream data with DDA:

from dagshub.streaming import DagsHubFilesystem

fs = DagsHubFilesystem(".", repo_url="https://dagshub.com/DagsHub-Datasets/multiconer-dataset")

fs.listdir("s3://multiconer")

Description:

MultiCoNER is a large multilingual dataset (11 languages) for Named Entity Recognition. It is designed to represent some of the contemporary challenges in NER, including low-context scenarios (short and uncased text), syntactically complex entities such as movie titles, and long-tail entity distributions.

Contact:

MultiCoNER is a large multilingual dataset (11 languages) for Named Entity Recognition. It is designed to represent some of the contemporary challenges in NER, including low-context scenarios (short and uncased text), syntactically complex entities such as movie titles, and long-tail entity distributions.

Managed By:

https://www.amazon.com/

Resources:

  1. resource:
    • Description: Data files
    • ARN: arn:aws:s3:::multiconer
    • Region: us-west-2
    • Type: S3 Bucket

Tags:

natural language processing

Publication:

  1. publication:

  2. publication:

  3. publication:

Tip!

Press p or to see the previous file or, n or to see the next file

About

multiconer-dataset is originate from the Registry of Open Data on AWS

Collaborators 5

Comments

Loading...