Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Wakals 's Collections
CoVT: Chain-of-Visual-Thought

CoVT: Chain-of-Visual-Thought

updated Nov 25, 2025

Enrich VLMs’ vision-centric reasoning capabilities via Chain-of-Visual-Thought!

Upvote
6

  • Wakals/CoVT-7B-seg_depth_dino

    8B • Updated about 1 month ago • 528 • 2

  • Wakals/CoVT-7B-seg_depth_dino_edge

    8B • Updated about 1 month ago • 192 • 2

  • Wakals/CoVT-7B-depth

    8B • Updated about 1 month ago • 15 • 2

  • Wakals/CoVT-7B-seg

    8B • Updated about 1 month ago • 44 • 1

  • Wakals/CoVT-LLaVA-13B-depth

    13B • Updated about 1 month ago • 6 • 2

  • Wakals/CoVT-Dataset

    Viewer • Updated about 1 month ago • 1.17M • 3.62k • 9

  • Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens

    Paper • 2511.19418 • Published Nov 24, 2025 • 28
Upvote
6
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs