mLLMs_merging_4_DMO
Collection
Official checkpoints from the paper "Linear Model Merging Unlocks Simple and Scalable Multimodal Data Mixture Optimization". • 21 items • Updated
This is an official checkpoint from the paper: "Linear Model Merging Unlocks Simple and Scalable Multimodal Data Mixture Optimization " (link). See the official implementation for more information on how to use the models.
This repo contains fine-tuned versions of OpenGVLab/InternVL3_5-2B-Pretrained-HF on diverse dataset mixtures of Chart, Counting, GeneralVQA, and OCR data (~100k samples).
The following hyperparameters were used during training:
Base model
OpenGVLab/InternVL3_5-2B-Pretrained