abisee/cnn_dailymail
Viewer • Updated • 936k • 180k • 344
How to use chunwoolee0/cnn_dailymail_t5_small with Transformers:
# Load model directly
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
tokenizer = AutoTokenizer.from_pretrained("chunwoolee0/cnn_dailymail_t5_small")
model = AutoModelForSeq2SeqLM.from_pretrained("chunwoolee0/cnn_dailymail_t5_small")This model is a fine-tuned version of t5-small on the cnn_dailymail dataset. It achieves the following results on the evaluation set:
Text-To-Text Transfer Transformer (T5) T5-Small is the checkpoint with 60 million parameters.
This is an exercise for finetuning of pretrained t5 model.
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
|---|---|---|---|---|---|---|---|---|
| 1.9158 | 1.0 | 10000 | 1.7333 | 0.2313 | 0.0948 | 0.1879 | 0.1879 | 18.9998 |
| 1.9316 | 2.0 | 20000 | 1.7271 | 0.2321 | 0.0955 | 0.1887 | 0.1887 | 18.9998 |
Base model
google-t5/t5-small