Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
PaddlePaddle 's Collections
PaddleOCR-VL-1.5
PaddleOCR-VL
PP-StructureV3
PP-OCRv5
PP-OCRv4
PP-OCRv3

PaddleOCR-VL-1.5

updated 9 days ago

Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing

Upvote
10

  • PaddlePaddle/PaddleOCR-VL-1.5

    Image-Text-to-Text • 1.0B • Updated 9 days ago • 22.3k • 468

  • Running
    Featured
    66

    PaddleOCR-VL-1.5 Online Demo

    😻
    66

    PaddleOCR-VL-1.5_Online_Demo


  • PaddlePaddle/PP-DocLayoutV3

    Image Segmentation • Updated Jan 30 • 16.6k • 54

  • PaddlePaddle/PP-DocLayoutV3_safetensors

    Object Detection • Updated Feb 10 • 214k • 19

  • PaddleOCR-VL-1.5: Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing

    Paper • 2601.21957 • Published Jan 29 • 19

  • PaddlePaddle/Real5-OmniDocBench

    Viewer • Updated 4 days ago • 2.8k • 8.25k • 6

  • PaddlePaddle/PaddleOCR-VL-1.5-GGUF

    0.5B • Updated 9 days ago • 1.24k • 7
Upvote
10
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs