Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
General:  open-data-registry Type:  dataset Integration:  git aws s3
24c0075867
Initial commit
1 year ago
0c8244a1b3
update readme automation
1 year ago
Storage Buckets

README.md

You have to be logged in to leave a comment. Sign In

WikiSum: Coherent Summarization Dataset for Efficient Human-Evaluation

Stream data with DDA:

from dagshub.streaming import DagsHubFilesystem

fs = DagsHubFilesystem(".", repo_url="https://dagshub.com/DagsHub-Datasets/wikisum-dataset")

fs.listdir("s3://wikisum")

Description:

This dataset provides how-to articles from wikihow.com and their summaries, written as a coherent paragraph. The dataset itself is available at wikisum.zip, and contains the article, the summary, the wikihow url, and an official fold (train, val, or test). In addition, human evaluation results are available at wikisum-human-eval.zip. It consists of human evaluation of the summary of the Pegasus system, annotators response regarding the difficulty of the task, and words they marked as unknown.

Contact:

This dataset provides how-to articles from wikihow.com and their summaries, written as a coherent paragraph. The dataset itself is available at wikisum.zip, and contains the article, the summary, the wikihow url, and an official fold (train, val, or test). In addition, human evaluation results are available at wikisum-human-eval.zip. It consists of human evaluation of the summary of the Pegasus system, annotators response regarding the difficulty of the task, and words they marked as unknown.

Update Frequency:

Not currently being updated

Managed By:

https://www.amazon.com/

Resources:

  1. resource:

Tags:

amazon.science, natural language processing, machine learning

Publication:

  1. publication:
    • Title: WikiSum: Coherent Summarization Dataset for Efficient Human-Evaluation
    • URL: https://2021.aclweb.org/
    • AuthorName: Nachshon Cohen, Oren Kalinsky, Yftah Ziser & Alessandro Moschitti
Tip!

Press p or to see the previous file or, n or to see the next file

About

wikisum-dataset is originate from the Registry of Open Data on AWS

Collaborators 5

Comments

Loading...