Running 3.6k The Ultra-Scale Playbook 🌌 3.6k The ultimate guide to training LLM on large GPU Clusters
Intel/distilbert-base-uncased-sparse-90-unstructured-pruneofa Fill-Mask • Updated Apr 11, 2023 • 18 • 2