Register
Login
Resources
Docs Blog Datasets Glossary Case Studies Tutorials & Webinars
Product
Data Engine LLMs Platform Enterprise
Pricing Explore
Connect to our Discord channel
Type:  dataset Data Domain:  nlp Integration:  dvc git github
c7c0314cfc
added dvc remote
1 year ago
6d8e83cebb
Update readme
7 years ago
b2c0046463
minor bug fixes
7 years ago
b3f763c4fc
added dataset
1 year ago
b3f763c4fc
added dataset
1 year ago
34a26ae40f
add license
4 years ago
92ab560e75
remove links
5 years ago
b3f763c4fc
added dataset
1 year ago
7c5f4796b1
Add requirements; update readme
7 years ago
Storage Buckets
Data Pipeline
Legend
DVC Managed File
Git Managed File
Metric
Stage File
External File

README.md

You have to be logged in to leave a comment. Sign In

TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension

  • This repo contains code for the paper Mandar Joshi, Eunsol Choi, Daniel Weld, Luke Zettlemoyer.

TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension In Association for Computational Linguistics (ACL) 2017, Vancouver, Canada.

  • The data can be downloaded from the TriviaQA website.
  • Please contact Mandar Joshi (<first-name>90@cs.washington.edu) for suggestions and comments.

Requirements

General

  • Python 3. You should be able to run the evaluation scripts using Python 2.7 if you take care of unicode in utils.utils.py.
  • BiDAF requires Python 3 -- check the original repository for more details.

Python Packages

  • tensorflow (only if you want to run BiDAF, verified on r0.11)
  • nltk
  • tqdm

Evaluation

The dataset file parameter refers to files in the qa directory of the data (e.g., wikipedia-dev.json). For file format, check out the sample directory in the repo.

python3 -m evaluation.triviaqa_evaluation --dataset_file samples/triviaqa_sample.json --prediction_file samples/sample_predictions.json

Miscellaneous

  • If you have a SQuAD model and want to run on TriviaQA, please refer to utils.convert_to_squad_format.py
Tip!

Press p or to see the previous file or, n or to see the next file

About

Code for the TriviaQA reading comprehension dataset

Collaborators 4

Comments

Loading...