deepset
/

roberta-base-squad2-distilled

Question Answering

Inference Endpoints

Model card Files Files and versions Community

Edit model card

Overview

Language model: deepset/roberta-base-squad2-distilled
Language: English
Training data: SQuAD 2.0 training set Eval data: SQuAD 2.0 dev set Infrastructure: 4x V100 GPU
Published: Dec 8th, 2021

Details

haystack's distillation feature was used for training. deepset/roberta-large-squad2 was used as the teacher model.

Hyperparameters

batch_size = 80
n_epochs = 4
max_seq_len = 384
learning_rate = 3e-5
lr_schedule = LinearWarmup
embeds_dropout_prob = 0.1
temperature = 1.5
distillation_loss_weight = 0.75

Performance

"exact": 79.8366040596311
"f1": 83.916407079888

Authors

Timo Möller: timo.moeller@deepset.ai
Julian Risch: julian.risch@deepset.ai
Malte Pietsch: malte.pietsch@deepset.ai
Michel Bartels: michel.bartels@deepset.ai

About us

deepset is the company behind the open-source NLP framework Haystack which is designed to help you build production ready NLP systems that use: Question answering, summarization, ranking etc.

Some of our other work:

Get in touch and join the Haystack community

For more info on Haystack, visit our GitHub repo and Documentation.

We also have a Discord community open to everyone!

Twitter | LinkedIn | Discord | GitHub Discussions | Website

By the way: we're hiring!

Downloads last month: 14,499

Safetensors

Model size

124M params

Tensor type

I64

·

F32

·

Dataset used to train deepset/roberta-base-squad2-distilled

Spaces using deepset/roberta-base-squad2-distilled 6

Evaluation results

Exact Match on squad_v2
validation set verified

80.859
F1 on squad_v2
validation set verified

84.010
Exact Match on squad
validation set self-reported

86.225
F1 on squad
validation set self-reported

92.483
Exact Match on adversarial_qa
validation set self-reported

29.900
F1 on adversarial_qa
validation set self-reported

41.183
Exact Match on squad_adversarial
validation set self-reported

79.071
F1 on squad_adversarial
validation set self-reported

84.472
Exact Match on squadshifts amazon
test set self-reported

70.733
F1 on squadshifts amazon
test set self-reported

83.958
Exact Match on squadshifts new_wiki
test set self-reported

82.011
F1 on squadshifts new_wiki
test set self-reported

91.092
Exact Match on squadshifts nyt
test set self-reported

84.203
F1 on squadshifts nyt
test set self-reported

91.521
Exact Match on squadshifts reddit
test set self-reported

72.029
F1 on squadshifts reddit
test set self-reported

83.454

View on Papers With Code