10 46

srivatsa

srivatsa92

devsrivatsa

AI & ML interests

rag, agents, fine-tuning

Recent Activity

liked a model 16 days ago

unsloth/NVIDIA-Nemotron-3-Super-120B-A12B-GGUF

liked a Space 22 days ago

lm-provers/qed-nano-blogpost

liked a dataset 27 days ago

google/mobile-actions

View all activity

Organizations

liked a model 16 days ago

unsloth/NVIDIA-Nemotron-3-Super-120B-A12B-GGUF

Text Generation • 121B • Updated 13 days ago • 86.5k • 101

liked a Space 22 days ago

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

📝

Who needs 1T parameters? Olympiad proofs with a 4B model

liked a dataset 27 days ago

google/mobile-actions

Viewer • Updated Dec 18, 2025 • 9.65k • 2.04k • 261

liked a model 3 months ago

ai21labs/AI21-Jamba2-3B

Text Generation • Updated Feb 2 • 3.2k • 40

liked a Space 4 months ago

The Smol Training Playbook

📚

3.08k

The secrets to building world-class LLMs

upvoted a collection 5 months ago

SmolVLM

Collection

State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct. Check our blog: https://huggingface.co/blog/smolvlm • 5 items • Updated May 5, 2025 • 42

liked a dataset 5 months ago

bigcode/the-stack

Viewer • Updated Apr 13, 2023 • 546M • 13.8k • 973

upvoted an article 5 months ago

Article

Let's talk about LLM evaluation

May 23, 2024

•

207

liked a Space 5 months ago

Open ASR Leaderboard

🏆

1.29k

Explore speech model benchmarks and request new evaluations

liked a dataset 7 months ago

neerajaabhyankar/hindustani-raag-small

Viewer • Updated Mar 20, 2024 • 1.25k • 993 • 3

upvoted 2 articles 8 months ago

Article

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

Apr 16, 2025

•

Article

Efficient Request Queueing – Optimizing LLM Performance

Apr 2, 2025

•

updated a Space 8 months ago

GPU VRAM Estimator

🚀

Estimate VRAM and training time for LLMs

published a Space 8 months ago

GPU VRAM Estimator

🚀

Estimate VRAM and training time for LLMs

liked a model 9 months ago

Comfy-Org/Wan_2.1_ComfyUI_repackaged

Updated Jan 28 • 4.43M • 871

liked 2 datasets 10 months ago

vidore/colpali_train_set

Viewer • Updated Jun 20, 2025 • 119k • 7.82k • 91

llamaindex/vdr-multilingual-train

Viewer • Updated Jan 10, 2025 • 424k • 2.35k • 28

liked 2 models 10 months ago

unsloth/Nanonets-OCR-s-GGUF

Image-Text-to-Text • 3B • Updated Jul 3, 2025 • 2.92k • 61

nanonets/Nanonets-OCR-s

Image-Text-to-Text • 4B • Updated Jun 20, 2025 • 64.2k • 1.59k

upvoted an article 11 months ago

Article

The Transformers Library: standardizing model definitions

May 15, 2025

•

121

srivatsa

AI & ML interests

Recent Activity

Organizations

srivatsa92's activity

QED-Nano: Teaching a Tiny Model to Prove Hard Theorems

The Smol Training Playbook

Let's talk about LLM evaluation

Open ASR Leaderboard

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

Efficient Request Queueing – Optimizing LLM Performance

GPU VRAM Estimator

GPU VRAM Estimator

The Transformers Library: standardizing model definitions