Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
CKeibel
's Collections
SLMs
PII
Code-Embeddings
Speech2Text (ASR)
Seq2Seq
Reward Models
diffusion models
Text-Classification
Data
PEFT (Papers)
LLMs (Papers)
Causal LMs, seq2seq models
Embedding models
Vision stuff
datasets
NER
BERT based tasks (models)
Multimodal
Multimodal
updated
Apr 15, 2025
Upvote
-
HuggingFaceM4/idefics-80b-instruct
Text Generation
•
80B
•
Updated
Oct 12, 2023
•
4.46k
•
188
liuhaotian/llava-v1.5-13b
Image-Text-to-Text
•
Updated
May 9, 2024
•
27.4k
•
528
llava-hf/llava-v1.6-34b-hf
Image-Text-to-Text
•
35B
•
Updated
Jan 27, 2025
•
3.86k
•
94
HuggingFaceM4/idefics2-8b
Image-Text-to-Text
•
8B
•
Updated
Oct 14, 2024
•
120k
•
623
microsoft/Phi-3-vision-128k-instruct
Text Generation
•
4B
•
Updated
Dec 10, 2025
•
263k
•
971
google/paligemma-3b-pt-224
Image-Text-to-Text
•
3B
•
Updated
Sep 21, 2024
•
562k
•
456
jinaai/jina-clip-v1
Feature Extraction
•
0.2B
•
Updated
Apr 8
•
74.3k
•
256
Qwen/Qwen2-VL-2B-Instruct
Image-Text-to-Text
•
2B
•
Updated
Jan 12, 2025
•
3.8M
•
506
llamaindex/vdr-2b-multi-v1
Image-Text-to-Text
•
2B
•
Updated
Apr 8
•
931
•
128
Upvote
-
Share collection
View history
Collection guide
Browse collections