SQuAD v2#

Publication#

Know What You Don’t Know: Unanswerable Questions for SQuAD

Repositories#

https://worksheets.codalab.org/worksheets/0xbe2859a20b9e41d2a2b63ea11bd97740

Available Models#

This implementation wraps the evaluation for SQuAD v2 on the official dev set. The dev set is equivalent to predicting on the “validation” split using the HuggingfaceDatasetsDatasetReader with the dataset_name="squad_v2".

  • Evaluation

    • Description: The SQuAD-v2 evaluation script for the dev set.

    • Name: squad-v2

    • Usage:

      from repro.models.squad_v2 import SQuADv2Evaluation
      model = SQuADv2Evaluation()
      inputs = [
          {"instance_id": "56ddde6b9a695914005b9628", "prediction": "France", "null_probability": 4.3727909708190646e-07}
      ]
      metrics = model.predict_batch(inputs)
      

Implementation Notes#

Docker Information#

  • Image name: squad-v2

  • Build command:

    repro setup squad-v2 [--silent]
    
  • Requires network: No

Testing#

n/a

Status#

Appears to work as expected.