4 14 170

Turbo Pascal

TurboPascal

AI & ML interests

None yet

Recent Activity

upvoted a paper about 16 hours ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

liked a model 6 days ago

google/gemma-3-27b-it

new activity 26 days ago

Alibaba-NLP/new-impl:torch.AcceleratorError: CUDA error: device-side assert triggered

View all activity

Organizations

upvoted a paper about 16 hours ago

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published Jan 30 • 110

liked a model 6 days ago

google/gemma-3-27b-it

Image-Text-to-Text • 27B • Updated Mar 21, 2025 • 1.18M • • 1.93k

New activity in Alibaba-NLP/new-impl 26 days ago

torch.AcceleratorError: CUDA error: device-side assert triggered

#14 opened 26 days ago by

TurboPascal

liked a model 5 months ago

HuggingFaceTB/SmolVLM-256M-Instruct

Image-Text-to-Text • 0.3B • Updated Apr 8, 2025 • 295k • 344

upvoted an article 6 months ago

Article

Training and Finetuning Reranker Models with Sentence Transformers v4

Mar 26, 2025

•

185

liked a model 7 months ago

ByteDance-Seed/Seed-OSS-36B-Instruct

Text Generation • Updated Aug 26, 2025 • 24.6k • 492

upvoted a collection 7 months ago

BGE

Collection

31 items • Updated Feb 4 • 150

liked a dataset 7 months ago

HuggingFaceTB/smoltalk2

Viewer • Updated Oct 31, 2025 • 8.61M • 10.6k • 145

liked 2 models 7 months ago

Alibaba-NLP/WebDancer-32B

Text Generation • Updated Jun 26, 2025 • 11 • • 57

zai-org/GLM-4.5V

Image-Text-to-Text • 108B • Updated Oct 25, 2025 • 46.1k • • 710

upvoted a paper 9 months ago

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

Paper • 2507.01352 • Published Jul 2, 2025 • 58

liked a Space 9 months ago

The Ultra-Scale Playbook

🌌

3.75k

The ultimate guide to training LLM on large GPU Clusters

liked a model 9 months ago

Qwen/Qwen3-Reranker-0.6B

Text Ranking • 0.6B • Updated Jun 9, 2025 • 661k • 324

liked a model 10 months ago

Qwen/Qwen3-Embedding-0.6B

Feature Extraction • 0.6B • Updated Jun 20, 2025 • 5.1M • • 939

upvoted a collection 10 months ago

GTE models

Collection

General Text Embedding Models Released by Tongyi Lab of Alibaba Group • 20 items • Updated 22 days ago • 36

updated a dataset 10 months ago

Mmoment/Mirage_Multimodal_Benchmark

Updated May 15, 2025 • 24

liked a dataset 11 months ago

Anthropic/hh-rlhf

Viewer • Updated May 26, 2023 • 169k • 25.1k • 1.68k

upvoted 2 papers 12 months ago

AdaMMS: Model Merging for Heterogeneous Multimodal Large Language Models with Unsupervised Coefficient Optimization

Paper • 2503.23733 • Published Mar 31, 2025 • 10

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback

Paper • 2503.22230 • Published Mar 28, 2025 • 45

liked a model 12 months ago

nvidia/NV-Embed-v2

Feature Extraction • 8B • Updated Jul 21, 2025 • 54.7k • 506