A GPT-4V Level Multimodal LLM on Your Phone
chongyi
yuzaa
AI & ML interests
multimodal large language models
Recent Activity
updated
a model
8 days ago
openbmb/MiniCPM-V-4_5
authored
a paper
3 months ago
AgentCPM-GUI: Building Mobile-Use Agents with Reinforcement Fine-Tuning
authored
a paper
3 months ago
MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and
Training Recipe