🧠 Why does DeepSeek-OCR not use Multi-Head Latent Attention (MLA)?

#53

by ZoneTwelve - opened Oct 28, 2025

Oct 28, 2025

Hi DeepSeek team 👋,

First of all, thank you for releasing DeepSeek-OCR — it’s an impressive and elegant vision-to-text model.

While exploring the model architecture and configuration files (config.json), I noticed that Multi-Head Latent Attention (MLA) is default enabled in this OCR model.

Questions

Could you please share some insights into why MLA was not used in DeepSeek-OCR?

Was it due to compatibility issues between MLA and the vision encoder–decoder pipeline?
Or did MLA not provide practical benefits in the OCR setting (e.g., shorter sequence lengths or the main bottleneck lying elsewhere)?
Is there any plan to integrate MLA into future versions of DeepSeek-OCR to improve inference efficiency?

I’m asking because MLA has demonstrated significant efficiency gains in your other models (e.g., DeepSeek-V2/V3), and I’m curious about the reasoning behind excluding it here.

Thanks again for your excellent work and for open-sourcing this project! 🙏

ZoneTwelve changed discussion title from Why does DeepSeek-OCR not use Multi-Head Latent Attention (MLA)? to 🧠 Why does DeepSeek-OCR not use Multi-Head Latent Attention (MLA)? Oct 28, 2025

HaoranWei

Oct 28, 2025

Hello,
We actually have an internal MLA-enabled version of DeepSeek-OCR.
The only reason it hasn’t been open-sourced yet is simply that I haven’t had the bandwidth to implement the code needed to convert the internal weights into the Hugging Face format.
Best regards

Excel001

Oct 28, 2025

ali4566544

Nov 2, 2025

Hello Sir, I am currently in School (8th Grade from Pakistan) doing basic ML, any suggestions for me ?

vihaan134354

Nov 5, 2025

Hello Sir, I am currently in School (8th Grade from Pakistan) doing basic ML, any suggestions for me ?

hey bro I would like you to make a bunch of neural networks on ur ow like ur own architectures and experiment and train models u will get a perfect one

Rashworld

Nov 8, 2025

Rashworld

Nov 8, 2025

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment