takara.ai

company

Verified

https://takara.ai

takara-ai

Activity Feed Request to join this org

AI & ML interests

GenAI, Diffusion, LLM's and State of the Art Solutions.

Recent Activity

takarajordan updated a dataset about 1 month ago

takara-ai/poker_hands

takarajordan published a dataset about 1 month ago

takara-ai/poker_hands

takarajordan updated a dataset about 2 months ago

takara-ai/FRED-CONVERTED

View all activity

takara-ai 's collections 15

Image Datasets

Our frontier vision data!

takara-ai/image_captions

Viewer • Updated Feb 11, 2025 • 1.07M • 672 • 21
takara-ai/MovieStills_Captioned_SmolVLM

Viewer • Updated Feb 25, 2025 • 74.9k • 83

3D Convex Splatting: Radiance Field Rendering with 3D Smooth Convexes

Paper • 2411.14974 • Published Nov 22, 2024 • 15

Synthetic Data Generation

A collection of research papers, datasets and models focused on data generation by machines.

proj-persona/PersonaHub

Viewer • Updated Sep 26, 2025 • 375k • 13.5k • 689

Foundational Vision

microsoft/Florence-2-large

Image-Text-to-Text • 0.8B • Updated Aug 4, 2025 • 748k • 1.73k

Model Security

Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks

Paper • 2407.02855 • Published Jul 3, 2024 • 12

Small LLM’s

HuggingFaceTB/SmolLM-135M-Instruct

Text Generation • 0.1B • Updated Sep 4, 2024 • 12.2k • 129
HuggingFaceTB/SmolLM-360M-Instruct

Text Generation • 0.4B • Updated Aug 18, 2024 • 4.85k • 83
HuggingFaceTB/SmolLM-1.7B-Instruct

Text Generation • 2B • Updated Aug 18, 2024 • 3.44k • 118

MultiModal

Running on Zero

Featured

816

Florence 2

📉

816

Generate captions and analyze images with various tasks

Audio

MuCodec: Ultra Low-Bitrate Music Codec

Paper • 2409.13216 • Published Sep 20, 2024 • 22

SwarmFormer

Our collection of our frontier SwarmFormer architecture models.

takara-ai/SwarmFormer-Sentiment-Small

Updated Jun 21, 2025 • 9 • 5
takara-ai/SwarmFormer-Sentiment-Base

Updated Jun 21, 2025 • 12 • 5

Medical

Teach Multimodal LLMs to Comprehend Electrocardiographic Images

Paper • 2410.19008 • Published Oct 21, 2024 • 26

LLM Performance

Papers, tools and techniques on maximising LLM performance both in production and in training.

facebook/multi-token-prediction

Updated Jun 18, 2024 • 371
Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Paper • 2407.04620 • Published Jul 5, 2024 • 33

LLM Scaling

LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages

Paper • 2407.05975 • Published Jul 8, 2024 • 36
Associative Recurrent Memory Transformer

Paper • 2407.04841 • Published Jul 5, 2024 • 35

VLM Performance

Vision language models are blind

Paper • 2407.06581 • Published Jul 9, 2024 • 84

Large LLM's

meta-llama/Llama-3.1-405B-Instruct

Text Generation • 406B • Updated Sep 25, 2024 • 148k • • 587

Autonomous Agents

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery

Paper • 2408.06292 • Published Aug 12, 2024 • 127
CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents

Paper • 2407.01511 • Published Jul 1, 2024