Visual Question Answering
Transformers
Safetensors
English
Chinese
qwen2_vl
image-text-to-text
multimodal
text-generation-inference
Instructions to use Cylingo/Xinyuan-VL-2B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use Cylingo/Xinyuan-VL-2B with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("visual-question-answering", model="Cylingo/Xinyuan-VL-2B")# Load model directly from transformers import AutoProcessor, AutoModelForImageTextToText processor = AutoProcessor.from_pretrained("Cylingo/Xinyuan-VL-2B") model = AutoModelForImageTextToText.from_pretrained("Cylingo/Xinyuan-VL-2B") - Notebooks
- Google Colab
- Kaggle
- Xet hash:
- 63fd9278d30134757a8fb9841a9c3d31a81c15fcacbda78bad7c3c93c2e00363
- Size of remote file:
- 7.03 kB
- SHA256:
- f5973d5c2c89a6876c0ef8c7738e7563fdc51bc40e9241cb5ec6620f91da03ca
·
Xet efficiently stores Large Files inside Git, intelligently splitting files into unique chunks and accelerating uploads and downloads. More info.