1 Branches

.dvc

c7c0314cfc

added dvc remote

1 year ago

dataset

evaluation

6d8e83cebb

Update readme

7 years ago

utils

b2c0046463

minor bug fixes

7 years ago

.dvcignore

b3f763c4fc

added dataset

1 year ago

.gitignore

b3f763c4fc

added dataset

1 year ago

LICENSE

34a26ae40f

add license

4 years ago

README.md

92ab560e75

remove links

5 years ago

dataset.dvc

b3f763c4fc

added dataset

1 year ago

requirements.txt

7c5f4796b1

Add requirements; update readme

7 years ago

DagsHub Storage

Legend
DVC Managed File
Git Managed File
Metric
Stage File
External File

Legend
DVC Managed File
Git Managed File
Metric
Stage File
External File

You have to be logged in to leave a comment.

TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension

This repo contains code for the paper Mandar Joshi, Eunsol Choi, Daniel Weld, Luke Zettlemoyer.

TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension In Association for Computational Linguistics (ACL) 2017, Vancouver, Canada.

The data can be downloaded from the TriviaQA website.
Please contact Mandar Joshi (<first-name>90@cs.washington.edu) for suggestions and comments.

Requirements

General

Python 3. You should be able to run the evaluation scripts using Python 2.7 if you take care of unicode in utils.utils.py.
BiDAF requires Python 3 -- check the original repository for more details.

Python Packages

tensorflow (only if you want to run BiDAF, verified on r0.11)
nltk
tqdm

Evaluation

The dataset file parameter refers to files in the qa directory of the data (e.g., wikipedia-dev.json). For file format, check out the sample directory in the repo.

python3 -m evaluation.triviaqa_evaluation --dataset_file samples/triviaqa_sample.json --prediction_file samples/sample_predictions.json

Miscellaneous

If you have a SQuAD model and want to run on TriviaQA, please refer to utils.convert_to_squad_format.py

Tip!

Press p or to see the previous file or, n or to see the next file

README.md

TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension

Requirements

General

Python Packages

Evaluation

Miscellaneous

Comments

Use Google Cloud Storage!

Specify your Google Storage bucket

Service Account Key

Congratulations!

Use AWS S3 as storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use any S3 compatible storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use Azure Cloud Storage!

Specify your Azure Storage bucket

Access key (If needed)

Congratulations!

DagsHub / triviaqa connected to https://github.com/jinensetpal/triviaqa.git

README.md

TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension

Requirements

General

Python Packages

Evaluation

Miscellaneous

Comments

Use Google Cloud Storage!

Specify your Google Storage bucket

Service Account Key

Congratulations!

Use AWS S3 as storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use any S3 compatible storage!

Specify your S3 bucket

Access key (If needed)

Congratulations!

Use Azure Cloud Storage!

Specify your Azure Storage bucket

Access key (If needed)

Congratulations!

DagsHub
/
triviaqa
connected to https://github.com/jinensetpal/triviaqa.git