abisee/cnn_dailymail
Viewer • Updated • 936k • 180k • 344
How to use HHousen/distil-led-large-cnn-16384 with Transformers:
# Load model directly
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
tokenizer = AutoTokenizer.from_pretrained("HHousen/distil-led-large-cnn-16384")
model = AutoModelForSeq2SeqLM.from_pretrained("HHousen/distil-led-large-cnn-16384")distil-led-large-cnn-16384 was initialized from sshleifer/distilbart-cnn-12-6, in a fashion similar to allenai/led-large-16384.
To be able to process 16K tokens, sshleifer/distilbart-cnn-12-6's position embedding matrix was simply copied 16 times.
This checkpoint should be loaded into LEDForConditionalGeneration.from_pretrained. See the LED documentation for more information.