Everybody Prune Now: Structured Pruning of LLMs with only Forward Passes
Paper
• 2402.05406 • Published
Working on improving reasoning of Bonsai Paper.
Note Original Model with 10 iterations to get 50% sparsity
Note Finetuned Bonsai (pruned on C4) on Wikitext