AI & ML interests

A one-year long research workshop on large language models: the Summer of Language Models 21 🌸

Recent Activity

christopher 
in bigscience/bloom 27 days ago

[SPAM] Deleted

3
#289 opened 27 days ago by
sarthak-saxena
stas 
posted an update 29 days ago
view post
Post
201
Good news! Ulysses Sequence Parallelism from the Snowflake AI Research and the Deepspeed teams has been integrated into
HuggingFace Trainer, Accelerate and TRL

For extensive details please see this writeup:
https://huggingface.co/blog/ulysses-sp

Thanks a lot to Kashif Rasul for helping make it happen. Also the others in the HF team who helped with integration.
christopher 
in bigscience/bloom about 1 month ago

pretokenizer Regex issues?

8
#278 opened almost 2 years ago by
hpcpony

Test PR

#286 opened about 1 month ago by
FIRSTACCOUNT69

Test discussion

#287 opened about 1 month ago by
FIRSTACCOUNT69

Test discussion

#288 opened about 1 month ago by
FIRSTACCOUNT69