tner/bc5cdr
Viewer • Updated • 16.4k • 1.93k • 5
How to use sschet/scibert_scivocab_uncased-finetuned-ner with Transformers:
# Use a pipeline as a high-level helper
from transformers import pipeline
pipe = pipeline("token-classification", model="sschet/scibert_scivocab_uncased-finetuned-ner") # Load model directly
from transformers import AutoTokenizer, AutoModelForTokenClassification
tokenizer = AutoTokenizer.from_pretrained("sschet/scibert_scivocab_uncased-finetuned-ner")
model = AutoModelForTokenClassification.from_pretrained("sschet/scibert_scivocab_uncased-finetuned-ner")This is a SciBERT-based model fine-tuned to perform Named Entity Recognition for drug names and adverse drug effects.

This model classifies input tokens into one of five classes:
B-DRUG: beginning of a drug entityI-DRUG: within a drug entityB-EFFECT: beginning of an AE entityI-EFFECT: within an AE entityO: outside either of the above entitiesTo get started using this model for inference, simply set up an NER pipeline like below:
from transformers import (AutoModelForTokenClassification,
AutoTokenizer,
pipeline,
)
model_checkpoint = "jsylee/scibert_scivocab_uncased-finetuned-ner"
model = AutoModelForTokenClassification.from_pretrained(model_checkpoint, num_labels=5,
id2label={0: 'O', 1: 'B-DRUG', 2: 'I-DRUG', 3: 'B-EFFECT', 4: 'I-EFFECT'}
)
tokenizer = AutoTokenizer.from_pretrained(model_checkpoint)
model_pipeline = pipeline(task="ner", model=model, tokenizer=tokenizer)
print( model_pipeline ("Abortion, miscarriage or uterine hemorrhage associated with misoprostol (Cytotec), a labor-inducing drug."))
SciBERT: https://huggingface.co/allenai/scibert_scivocab_uncased