Scialom et al. (2021)#
Publication#
Repositories#
https://github.com/ThomasScialom/QuestEval
Available Models#
This implementation wraps the QuestEval
metric.
QuestEval
Description: A QA-based metric that uses an optional source document and/or reference
Name:
scialom2021-questeval
Usage:
from repro.models.scialom2021 import QuestEval model = QuestEval() # Either "references" and/or "sources" must be present. Only supports single # references/documents inputs = [ {"candidate": "The candidate", "references": ["The reference"], "sources": ["The source"]}, ... ] macro, micro = model.predict_batch(inputs)
macro
is the averaged QuestEval scores over the inputs, andmicro
is the individual scores per input. Any**kwargs
passed topredict
orpredict_batch
will be passed to the constructor of the QuestEval metric in the original code.
QuestEvalForSummarization
Description: A wrapper around
QuestEval
which passes the arguments specific to summarization to the original code’s QuestEval constructor by default.Name:
scialom2021-questeval-summarization
Usage:
from repro.models.scialom2021 import QuestEvalForSummarization model = QuestEvalForSummarization() # Either "references" and/or "sources" must be present. Only supports single # references/documents inputs = [ {"candidate": "The candidate", "references": ["The reference"], "sources": ["The source"]}, ... ] macro, micro = model.predict_batch(inputs)
QuestEvalForSimplification
Description: A wrapper around
QuestEval
which passes the arguments specific to simplification to the original code’s QuestEval constructor by default.Name:
scialom2021-questeval-simplification
Usage:
from repro.models.scialom2021 import QuestEvalForSimplification model = QuestEvalForSimplification() # Either "references" and/or "sources" must be present. Only supports single # references/documents inputs = [ {"candidate": "The candidate", "references": ["The reference"], "sources": ["The source"]}, ... ] macro, micro = model.predict_batch(inputs)
Implementation Notes#
This implementation is based on the
v0.0.1
tag of the QuestEval repro based on the authors’ recommendation for the correct API for the summarization task.
Docker Information#
Image name:
scialom2021
Build command:
repro setup scialom2021 [--silent]
Requires network: No
Testing#
Setting TEST_DEVICES=<gpu-id>
is required for the tests.
repro setup scialom2021
pytest models/scialom2021/tests
Status#
[x] Regression unit tests pass
[x] Correctness unit tests pass
We have reproduced the examples from the original code’s Github repo.[ ] Model runs on full test dataset
Not tested[ ] Predictions approximately replicate results reported in the paper
Not tested[ ] Predictions exactly replicate results reported in the paper
Not tested