Instructions to use vocab-transformers/distilbert-word2vec_256k-MLM_best with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use vocab-transformers/distilbert-word2vec_256k-MLM_best with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("fill-mask", model="vocab-transformers/distilbert-word2vec_256k-MLM_best")# Load model directly from transformers import AutoTokenizer, AutoModelForMaskedLM tokenizer = AutoTokenizer.from_pretrained("vocab-transformers/distilbert-word2vec_256k-MLM_best") model = AutoModelForMaskedLM.from_pretrained("vocab-transformers/distilbert-word2vec_256k-MLM_best") - Notebooks
- Google Colab
- Kaggle
YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
DistilBERT with word2vec token embeddings
This model has a word2vec token embedding matrix with 256k entries. The word2vec was trained on 100GB data from C4, MSMARCO, News, Wikipedia, S2ORC, for 3 epochs.
Then the model was trained on this dataset with MLM for 1.37M steps (batch size 64). The token embeddings were NOT updated.
For the initial word2vec weights with Gensim see: https://huggingface.co/vocab-transformers/distilbert-word2vec_256k-MLM_1M/tree/main/word2vec
- Downloads last month
- 10