MOSS-Audio Collection An open-source audio understanding model supporting speech recognition, environmental sound analysis, music understanding, time-aware QA, and complex • 5 items • Updated about 10 hours ago • 37
TIPSv2 Collection TIPSv2 foundational vision-language models. Webpage: https://gdm-tipsv2.github.io/ • 9 items • Updated 6 days ago • 17
ERNIE-Image Collection The serieas of image generation models, including text2img、img2img. • 2 items • Updated 6 days ago • 22
DFlash Collection Block Diffusion for Flash Speculative Decoding • 14 items • Updated 4 days ago • 66
WildDet3D Collection This is the collection of WildDet3D artifacts, including demos, model checkpoints and data. https://github.com/allenai/WildDet3D • 8 items • Updated 7 days ago • 17
Marco-MoE Collection A suit of multilingual MoE models with highly-sparse architectures • 5 items • Updated 13 days ago • 15