diarizers-community/voxconverse
Viewer • Updated • 448 • 1.79k • 9
How to use tgrhn/speaker-segmentation-fine-tuned-voxconverse-en with Transformers:
# Load model directly
from transformers import AutoModel
model = AutoModel.from_pretrained("tgrhn/speaker-segmentation-fine-tuned-voxconverse-en", dtype="auto")This model is a fine-tuned version of pyannote/segmentation-3.0 on the diarizers-community/voxconverse dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
| Training Loss | Epoch | Step | Validation Loss | Der | False Alarm | Missed Detection | Confusion |
|---|---|---|---|---|---|---|---|
| 0.9302 | 1.0 | 791 | 0.9903 | 0.6790 | 0.5013 | 0.0965 | 0.0812 |
| 0.8848 | 2.0 | 1582 | 1.0536 | 0.7965 | 0.3991 | 0.3409 | 0.0565 |
| 0.8513 | 3.0 | 2373 | 1.0884 | 0.8114 | 0.4017 | 0.3528 | 0.0569 |
| 0.7926 | 4.0 | 3164 | 1.1292 | 0.8378 | 0.3660 | 0.4219 | 0.0500 |
| 0.8147 | 5.0 | 3955 | 1.1250 | 0.8257 | 0.3733 | 0.3995 | 0.0528 |
Base model
pyannote/segmentation-3.0