Image Datasets Our frontier vision data! takara-ai/image_captions Viewer • Updated Feb 11, 2025 • 1.07M • 672 • 21 takara-ai/MovieStills_Captioned_SmolVLM Viewer • Updated Feb 25, 2025 • 74.9k • 83
3D 3D Convex Splatting: Radiance Field Rendering with 3D Smooth Convexes Paper • 2411.14974 • Published Nov 22, 2024 • 15
3D Convex Splatting: Radiance Field Rendering with 3D Smooth Convexes Paper • 2411.14974 • Published Nov 22, 2024 • 15
Synthetic Data Generation A collection of research papers, datasets and models focused on data generation by machines. proj-persona/PersonaHub Viewer • Updated Sep 26, 2025 • 375k • 13.5k • 689
Foundational Vision microsoft/Florence-2-large Image-Text-to-Text • 0.8B • Updated Aug 4, 2025 • 748k • 1.73k
Model Security Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks Paper • 2407.02855 • Published Jul 3, 2024 • 12
Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks Paper • 2407.02855 • Published Jul 3, 2024 • 12
Small LLM’s HuggingFaceTB/SmolLM-135M-Instruct Text Generation • 0.1B • Updated Sep 4, 2024 • 12.2k • 129 HuggingFaceTB/SmolLM-360M-Instruct Text Generation • 0.4B • Updated Aug 18, 2024 • 4.85k • 83 HuggingFaceTB/SmolLM-1.7B-Instruct Text Generation • 2B • Updated Aug 18, 2024 • 3.44k • 118
MultiModal Running on Zero Featured 816 Florence 2 📉 816 Generate captions and analyze images with various tasks
Running on Zero Featured 816 Florence 2 📉 816 Generate captions and analyze images with various tasks
SwarmFormer Our collection of our frontier SwarmFormer architecture models. takara-ai/SwarmFormer-Sentiment-Small Updated Jun 21, 2025 • 9 • 5 takara-ai/SwarmFormer-Sentiment-Base Updated Jun 21, 2025 • 12 • 5
Medical Teach Multimodal LLMs to Comprehend Electrocardiographic Images Paper • 2410.19008 • Published Oct 21, 2024 • 26
Teach Multimodal LLMs to Comprehend Electrocardiographic Images Paper • 2410.19008 • Published Oct 21, 2024 • 26
LLM Performance Papers, tools and techniques on maximising LLM performance both in production and in training. facebook/multi-token-prediction Updated Jun 18, 2024 • 371 Learning to (Learn at Test Time): RNNs with Expressive Hidden States Paper • 2407.04620 • Published Jul 5, 2024 • 33
Learning to (Learn at Test Time): RNNs with Expressive Hidden States Paper • 2407.04620 • Published Jul 5, 2024 • 33
LLM Scaling LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages Paper • 2407.05975 • Published Jul 8, 2024 • 36 Associative Recurrent Memory Transformer Paper • 2407.04841 • Published Jul 5, 2024 • 35
LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages Paper • 2407.05975 • Published Jul 8, 2024 • 36
Large LLM's meta-llama/Llama-3.1-405B-Instruct Text Generation • 406B • Updated Sep 25, 2024 • 148k • • 587
Autonomous Agents The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery Paper • 2408.06292 • Published Aug 12, 2024 • 127 CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents Paper • 2407.01511 • Published Jul 1, 2024
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery Paper • 2408.06292 • Published Aug 12, 2024 • 127
CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents Paper • 2407.01511 • Published Jul 1, 2024
Image Datasets Our frontier vision data! takara-ai/image_captions Viewer • Updated Feb 11, 2025 • 1.07M • 672 • 21 takara-ai/MovieStills_Captioned_SmolVLM Viewer • Updated Feb 25, 2025 • 74.9k • 83
SwarmFormer Our collection of our frontier SwarmFormer architecture models. takara-ai/SwarmFormer-Sentiment-Small Updated Jun 21, 2025 • 9 • 5 takara-ai/SwarmFormer-Sentiment-Base Updated Jun 21, 2025 • 12 • 5
3D 3D Convex Splatting: Radiance Field Rendering with 3D Smooth Convexes Paper • 2411.14974 • Published Nov 22, 2024 • 15
3D Convex Splatting: Radiance Field Rendering with 3D Smooth Convexes Paper • 2411.14974 • Published Nov 22, 2024 • 15
Medical Teach Multimodal LLMs to Comprehend Electrocardiographic Images Paper • 2410.19008 • Published Oct 21, 2024 • 26
Teach Multimodal LLMs to Comprehend Electrocardiographic Images Paper • 2410.19008 • Published Oct 21, 2024 • 26
Synthetic Data Generation A collection of research papers, datasets and models focused on data generation by machines. proj-persona/PersonaHub Viewer • Updated Sep 26, 2025 • 375k • 13.5k • 689
LLM Performance Papers, tools and techniques on maximising LLM performance both in production and in training. facebook/multi-token-prediction Updated Jun 18, 2024 • 371 Learning to (Learn at Test Time): RNNs with Expressive Hidden States Paper • 2407.04620 • Published Jul 5, 2024 • 33
Learning to (Learn at Test Time): RNNs with Expressive Hidden States Paper • 2407.04620 • Published Jul 5, 2024 • 33
Foundational Vision microsoft/Florence-2-large Image-Text-to-Text • 0.8B • Updated Aug 4, 2025 • 748k • 1.73k
LLM Scaling LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages Paper • 2407.05975 • Published Jul 8, 2024 • 36 Associative Recurrent Memory Transformer Paper • 2407.04841 • Published Jul 5, 2024 • 35
LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages Paper • 2407.05975 • Published Jul 8, 2024 • 36
Model Security Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks Paper • 2407.02855 • Published Jul 3, 2024 • 12
Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks Paper • 2407.02855 • Published Jul 3, 2024 • 12
Small LLM’s HuggingFaceTB/SmolLM-135M-Instruct Text Generation • 0.1B • Updated Sep 4, 2024 • 12.2k • 129 HuggingFaceTB/SmolLM-360M-Instruct Text Generation • 0.4B • Updated Aug 18, 2024 • 4.85k • 83 HuggingFaceTB/SmolLM-1.7B-Instruct Text Generation • 2B • Updated Aug 18, 2024 • 3.44k • 118
Large LLM's meta-llama/Llama-3.1-405B-Instruct Text Generation • 406B • Updated Sep 25, 2024 • 148k • • 587
MultiModal Running on Zero Featured 816 Florence 2 📉 816 Generate captions and analyze images with various tasks
Running on Zero Featured 816 Florence 2 📉 816 Generate captions and analyze images with various tasks
Autonomous Agents The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery Paper • 2408.06292 • Published Aug 12, 2024 • 127 CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents Paper • 2407.01511 • Published Jul 1, 2024
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery Paper • 2408.06292 • Published Aug 12, 2024 • 127
CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents Paper • 2407.01511 • Published Jul 1, 2024