Image-Text-to-Text
PaddleOCR
Safetensors
English
Chinese
multilingual
paddleocr_vl
ERNIE4.5
PaddlePaddle
image-to-text
ocr
document-parse
layout
table
formula
chart
conversational
custom_code
Eval Results
Instructions to use PaddlePaddle/PaddleOCR-VL with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PaddleOCR
How to use PaddlePaddle/PaddleOCR-VL with PaddleOCR:
# See https://www.paddleocr.ai/latest/version3.x/pipeline_usage/PaddleOCR-VL.html to installation from paddleocr import PaddleOCRVL pipeline = PaddleOCRVL(pipeline_version="v1") output = pipeline.predict("path/to/document_image.png") for res in output: res.print() res.save_to_json(save_path="output") res.save_to_markdown(save_path="output") - Notebooks
- Google Colab
- Kaggle
Update tokenizer_config.json
Browse files- tokenizer_config.json +2 -2
tokenizer_config.json
CHANGED
|
@@ -8324,7 +8324,7 @@
|
|
| 8324 |
"<|video_pad|>"
|
| 8325 |
],
|
| 8326 |
"auto_map": {
|
| 8327 |
-
"AutoProcessor": "
|
| 8328 |
},
|
| 8329 |
"bos_token": "<s>",
|
| 8330 |
"clean_up_tokenization_spaces": false,
|
|
@@ -8336,7 +8336,7 @@
|
|
| 8336 |
"mask_token": "<mask:1>",
|
| 8337 |
"model_max_length": 131072,
|
| 8338 |
"pad_token": "<unk>",
|
| 8339 |
-
"processor_class": "
|
| 8340 |
"sep_token": "<|end_of_sentence|>",
|
| 8341 |
"sp_model_kwargs": {},
|
| 8342 |
"spaces_between_special_tokens": false,
|
|
|
|
| 8324 |
"<|video_pad|>"
|
| 8325 |
],
|
| 8326 |
"auto_map": {
|
| 8327 |
+
"AutoProcessor": "processing_paddleocr_vl.PaddleOCRVLProcessor"
|
| 8328 |
},
|
| 8329 |
"bos_token": "<s>",
|
| 8330 |
"clean_up_tokenization_spaces": false,
|
|
|
|
| 8336 |
"mask_token": "<mask:1>",
|
| 8337 |
"model_max_length": 131072,
|
| 8338 |
"pad_token": "<unk>",
|
| 8339 |
+
"processor_class": "PaddleOCRVLProcessor",
|
| 8340 |
"sep_token": "<|end_of_sentence|>",
|
| 8341 |
"sp_model_kwargs": {},
|
| 8342 |
"spaces_between_special_tokens": false,
|