Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
In a Training Loop š
38.7
TFLOPS
13
14
93
fahrizalfarid
akahana
Follow
agentlans's profile picture
piercus's profile picture
DualityAI-RebekahBogdanoff's profile picture
11 followers
Ā·
50 following
fahrizalfarid
fahrizalfarid
AI & ML interests
NLP
Recent Activity
reacted
to
SeaWolf-AI
's
post
with š„
about 7 hours ago
šļø Smol AI WorldCup: A 4B Model Just Beat 8B ā Here's the Data We evaluated 18 small language models from 12 makers on 125 questions across 7 languages. The results challenge the assumption that bigger is always better. Community Article: https://huggingface.co/blog/FINAL-Bench/smol-worldcup Live Leaderboard: https://huggingface.co/spaces/ginigen-ai/smol-worldcup Dataset: https://huggingface.co/datasets/ginigen-ai/smol-worldcup What we found: ā Gemma-3n-E4B (4B, 2GB RAM) outscores Qwen3-8B (8B, 5.5GB). Doubling parameters gained only 0.4 points. RAM cost: 2.75x more. ā GPT-OSS-20B fits in 1.5GB yet matches Champions-league dense models requiring 8.5GB. MoE architecture is the edge AI game-changer. ā Thinking models hurt structured output. DeepSeek-R1-7B scores 8.7 points below same-size Qwen3-8B and runs 2.7x slower. ā A 1.3B model fabricates confident fake content 80% of the time when prompted with nonexistent entities. Qwen3 family hits 100% trap detection across all sizes. ā Qwen3-1.7B (1.2GB) outscores Mistral-7B, Llama-3.1-8B, and DeepSeek-R1-14B. Latest architecture at 1.7B beats older architecture at 14B. What makes this benchmark different? Most benchmarks ask "how smart?" ā we measure five axes simultaneously: Size, Honesty, Intelligence, Fast, Thrift (SHIFT). Our ranking metric WCS = sqrt(SHIFT x PIR_norm) rewards models that are both high-quality AND efficient. Smart but massive? Low rank. Tiny but poor? Also low. Top 5 by WCS: 1. GPT-OSS-20B ā WCS 82.6 ā 1.5GB ā Raspberry Pi tier 2. Gemma-3n-E4B ā WCS 81.8 ā 2.0GB ā Smartphone tier 3. Llama-4-Scout ā WCS 79.3 ā 240 tok/s ā Fastest model 4. Qwen3-4B ā WCS 76.6 ā 2.8GB ā Smartphone tier 5. Qwen3-1.7B ā WCS 76.1 ā 1.2GB ā IoT tier Built in collaboration with the FINAL Bench research team. Interoperable with ALL Bench Leaderboard for full small-to-large model comparison. Dataset is open under Apache 2.0 (125 questions, 7 languages). We welcome new model submissions.
updated
a dataset
18 days ago
akahana/wikipedia-id-conv
published
a dataset
18 days ago
akahana/wikipedia-id-conv
View all activity
Organizations
None yet
akahana
's models
58
Sort:Ā Recently updated
akahana/indo-psikologi-sft
0.6B
ā¢
Updated
Jan 20
ā¢
2
akahana/driver-drowsiness
Updated
Jan 8
akahana/sl-cartpole-v1
Reinforcement Learning
ā¢
Updated
Jan 7
akahana/humanoid-rl-sac
Reinforcement Learning
ā¢
Updated
Jan 6
akahana/rocov2-enid-marianmt
Updated
Dec 18, 2025
akahana/pokemon-3b
4B
ā¢
Updated
Dec 18, 2025
ā¢
3
akahana/en-id-finetuned-4bit
Translation
ā¢
73.7M
ā¢
Updated
Dec 11, 2025
akahana/m2m100-nllb200-4bit
Updated
Dec 5, 2025
akahana/rag-contextual-indo-0.6b
0.6B
ā¢
Updated
Dec 5, 2025
ā¢
6
akahana/qwen3-4b-text-embedding-4bit
Feature Extraction
ā¢
4B
ā¢
Updated
Dec 4, 2025
ā¢
3
akahana/rag-contextual-indo-4b
Text Generation
ā¢
4B
ā¢
Updated
Dec 4, 2025
akahana/rag-contextual-indo-1b-v1
1B
ā¢
Updated
Dec 4, 2025
akahana/qwen3-next-80b
31B
ā¢
Updated
Nov 30, 2025
ā¢
8
akahana/rag-contextual-indo-270m
Text Generation
ā¢
0.3B
ā¢
Updated
Nov 29, 2025
ā¢
1
akahana/alpaca-indo-3b
3B
ā¢
Updated
Nov 25, 2025
akahana/rag-contextual-indo-8b
8B
ā¢
Updated
Nov 22, 2025
akahana/DeepSeek-R1-Distill-Llama-70B-GGUF
71B
ā¢
Updated
Nov 20, 2025
ā¢
19
akahana/indo-psikologi-1b
Text Generation
ā¢
1B
ā¢
Updated
Nov 20, 2025
akahana/indo-psikologi-8b
Text Generation
ā¢
8B
ā¢
Updated
Nov 13, 2025
ā¢
1
akahana/indo-psikologi-7b
Text Generation
ā¢
7B
ā¢
Updated
Nov 12, 2025
ā¢
1
akahana/qwen3-8b-embedding-4bit
8B
ā¢
Updated
Nov 9, 2025
ā¢
37
akahana/qwen3-0.6b-text-embedding
0.6B
ā¢
Updated
Nov 8, 2025
ā¢
914
akahana/qwen3-8b-text-embedding
8B
ā¢
Updated
Nov 8, 2025
ā¢
205
akahana/sahabatai-goto
8B
ā¢
Updated
Nov 6, 2025
ā¢
4
akahana/starstreak
7B
ā¢
Updated
Nov 3, 2025
ā¢
4
akahana/kesehatan-v0
7B
ā¢
Updated
Nov 1, 2025
ā¢
5
akahana/llm-models
3B
ā¢
Updated
Mar 13, 2025
ā¢
27
akahana/translation
Updated
Mar 8, 2025
akahana/dokter-chat-v0.1
Question Answering
ā¢
Updated
Jan 14, 2025
akahana/whisper-small-id
Automatic Speech Recognition
ā¢
0.2B
ā¢
Updated
Jan 2, 2025
Previous
1
2
Next