LatentMem-Qwen3-4B

This is the official repository for LatentMem-Qwen3-4B, based on the Qwen3-4B backbone and optimized using Latent Memory Policy Optimization (LMPO).

LatentMem is a learnable multi-agent memory framework designed to customize agent-specific memories in a token-efficient manner. It comprises an experience bank that stores raw interaction trajectories and a memory composer that synthesizes compact latent memories conditioned on retrieved experience and agent-specific contexts.

Resources

πŸ“‚ Repository Structure

The repository is organized into two primary directories:

  • data/: Contains the Raw Trajectories collected during the initial data collection phase. These include state-action pairs, environmental feedback, and intermediate reasoning steps used for training.
  • model/: Contains the LatentMem Weights. These are the final weights after undergoing LMPO.

Citation

If you find this work helpful, please consider citing the paper:

@misc{fu2026latentmemcustomizinglatentmemory,
      title={LatentMem: Customizing Latent Memory for Multi-Agent Systems}, 
      author={Muxin Fu and Guibin Zhang and Xiangyuan Xue and Yafu Li and Zefeng He and Siyuan Huang and Xiaoye Qu and Yu Cheng and Yang Yang},
      year={2026},
      eprint={2602.03036},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2602.03036}, 
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Model tree for Kana-s/LatentMem-Qwen3-4B

Finetuned
(1728)
this model

Paper for Kana-s/LatentMem-Qwen3-4B