Scialom et al. (2019)#

Publication#

Answers Unite! Unsupervised Metrics for Reinforced Summarization Models

Repositories#

https://github.com/ThomasScialom/summa-qa

Available Models#

This implementation wraps the SummaQA metric.

  • SummaQA

    • Description: A reference-free QA-based metric

    • Name: scialom2019-summaqa

    • Usage:

      from repro.models.scialom2019 import SummaQA
      model = SummaQA()
      inputs = [
          {"candidate": "The candidate", "references": ["The reference"], "sources": ["The source"]},
          ...
      ]   
      macro, micro = model.predict_batch(inputs)
      

      macro is the averaged SummaQA scores over the inputs, and micro is the individual scores per input.

Implementation Notes#

  • The metric only supports 1 source document, so the length of sources must be 1.

  • The metric does not support using the GPU

Docker Information#

  • Image name: scialom2019

  • Build command:

    repro setup scialom2019 [--silent]
    
  • Requires network: No

Testing#

repro setup scialom2019
pytest models/scialom2019/tests

Status#

  • [x] Regression unit tests pass

  • [x] Correctness unit tests pass
    See here. We have reproduced the examples from the original code’s Github repo.

  • [ ] Model runs on full test dataset
    Not tested

  • [ ] Predictions approximately replicate results reported in the paper
    Not tested

  • [ ] Predictions exactly replicate results reported in the paper
    Not tested