Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
62.8
TFLOPS
91
3
37
hai
cloudyu
Follow
markyfsun's profile picture
llhhtt7788's profile picture
FrankLiuDundun's profile picture
165 followers
·
44 following
yu-hai-52a1702a
AI & ML interests
Looking for a full time job.
Recent Activity
new
activity
4 days ago
cloudyu/Mixtral_34Bx2_MoE_60B:
Update README.md
updated
a model
3 months ago
cloudyu/quant_signal
published
a model
3 months ago
cloudyu/quant_signal
View all activity
Organizations
cloudyu
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
cloudyu/Mixtral_34Bx2_MoE_60B
4 days ago
Update README.md
#17 opened 4 days ago by
cherry0328
updated
a model
3 months ago
cloudyu/quant_signal
Updated
Oct 10, 2025
published
a model
3 months ago
cloudyu/quant_signal
Updated
Oct 10, 2025
New activity in
deepseek-ai/DeepSeek-V3.2-Exp
3 months ago
咱这个模型是非得国庆前更新吗??
😔
👍
113
31
#1 opened 3 months ago by
luckjone
New activity in
deepseek-ai/DeepSeek-V3.1-Terminus
3 months ago
国庆是休息日,请给我们关注的同学一点休息时间
👀
👍
64
1
#10 opened 3 months ago by
luckjone
New activity in
deepseek-ai/DeepSeek-V3.2-Exp
3 months ago
Transformers does not recognize this architecture
6
#6 opened 3 months ago by
eva20150932-atlascloud
liked
a model
3 months ago
deepseek-ai/DeepSeek-V3.2-Exp
Text Generation
•
685B
•
Updated
Nov 18, 2025
•
67.8k
•
•
932
New activity in
unsloth/grok-2-GGUF
4 months ago
mac studio : loading model vocabulary: unknown pre-tokenizer type: 'grok-2'
#5 opened 4 months ago by
cloudyu
New activity in
Wan-AI/Wan2.2-T2V-A14B-Diffusers
5 months ago
demo能不能亲自跑一下,成功了再发出来?
#8 opened 5 months ago by
cloudyu
New activity in
ByteDance-Seed/Seed-OSS-36B-Instruct
5 months ago
Why is the chat_template mixed with Chinese and English?
👍
2
5
#8 opened 5 months ago by
Daucloud
updated
a model
7 months ago
cloudyu/Deep-Think-32B
33B
•
Updated
Jun 18, 2025
•
7
published
a model
7 months ago
cloudyu/Deep-Think-32B
33B
•
Updated
Jun 18, 2025
•
7
New activity in
onnx-community/Qwen3-1.7B-ONNX
8 months ago
please share how export qwen3 to onnx foramt, many thanks!
👍
1
2
#1 opened 8 months ago by
cloudyu
liked
a model
9 months ago
nvidia/OpenMath-Nemotron-14B-Kaggle
Text Generation
•
15B
•
Updated
May 29, 2025
•
251
•
•
18
New activity in
Qwen/QwQ-32B
10 months ago
It's challenging for QwQ to generate long codes...
2
#38 opened 10 months ago by
DXBTR74
updated
a model
11 months ago
cloudyu/S1-Llama-3.2-3Bx4-MoE
10B
•
Updated
Feb 5, 2025
•
6
published
a model
11 months ago
cloudyu/S1-Llama-3.2-3Bx4-MoE
10B
•
Updated
Feb 5, 2025
•
6
New activity in
unsloth/DeepSeek-R1-Distill-Qwen-32B-GGUF
12 months ago
error when to try this gguf
👀
1
3
#3 opened 12 months ago by
cloudyu
New activity in
unsloth/DeepSeek-R1-Distill-Llama-8B-GGUF
12 months ago
unknown pre-tokenizer type: 'deepseek-r1-qwen'
👍
4
2
#1 opened 12 months ago by
Neman
updated
a model
about 1 year ago
cloudyu/Nemo-DPO-V23
Text Generation
•
12B
•
Updated
Jan 10, 2025
•
2
•
1
Load more