-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 185 • 98 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 36 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88
Collections
Discover the best community collections!
Collections including paper arxiv:2511.03276
-
TiDAR: Think in Diffusion, Talk in Autoregression
Paper • 2511.08923 • Published • 119 -
Diffusion Language Models are Super Data Learners
Paper • 2511.03276 • Published • 128 -
What Makes Diffusion Language Models Super Data Learners?
Paper • 2510.04071 • Published -
LLaDA2.0: Scaling Up Diffusion Language Models to 100B
Paper • 2512.15745 • Published • 78
-
TradingAgents: Multi-Agents LLM Financial Trading Framework
Paper • 2412.20138 • Published • 15 -
MinerU: An Open-Source Solution for Precise Document Content Extraction
Paper • 2409.18839 • Published • 36 -
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing
Paper • 2509.22186 • Published • 139 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 122
-
Fast-dLLM v2: Efficient Block-Diffusion LLM
Paper • 2509.26328 • Published • 55 -
Attention Is All You Need for KV Cache in Diffusion LLMs
Paper • 2510.14973 • Published • 40 -
Attention Sinks in Diffusion Language Models
Paper • 2510.15731 • Published • 48 -
Diffusion Language Models are Super Data Learners
Paper • 2511.03276 • Published • 128
-
General Agentic Memory Via Deep Research
Paper • 2511.18423 • Published • 161 -
Diffusion Language Models are Super Data Learners
Paper • 2511.03276 • Published • 128 -
SAM 3: Segment Anything with Concepts
Paper • 2511.16719 • Published • 125 -
Back to Basics: Let Denoising Generative Models Denoise
Paper • 2511.13720 • Published • 67
-
A Survey of Context Engineering for Large Language Models
Paper • 2507.13334 • Published • 259 -
GUI-G^2: Gaussian Reward Modeling for GUI Grounding
Paper • 2507.15846 • Published • 133 -
ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents
Paper • 2507.22827 • Published • 99 -
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Paper • 2508.18265 • Published • 211
-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 185 • 98 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 36 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88
-
General Agentic Memory Via Deep Research
Paper • 2511.18423 • Published • 161 -
Diffusion Language Models are Super Data Learners
Paper • 2511.03276 • Published • 128 -
SAM 3: Segment Anything with Concepts
Paper • 2511.16719 • Published • 125 -
Back to Basics: Let Denoising Generative Models Denoise
Paper • 2511.13720 • Published • 67
-
TiDAR: Think in Diffusion, Talk in Autoregression
Paper • 2511.08923 • Published • 119 -
Diffusion Language Models are Super Data Learners
Paper • 2511.03276 • Published • 128 -
What Makes Diffusion Language Models Super Data Learners?
Paper • 2510.04071 • Published -
LLaDA2.0: Scaling Up Diffusion Language Models to 100B
Paper • 2512.15745 • Published • 78
-
TradingAgents: Multi-Agents LLM Financial Trading Framework
Paper • 2412.20138 • Published • 15 -
MinerU: An Open-Source Solution for Precise Document Content Extraction
Paper • 2409.18839 • Published • 36 -
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing
Paper • 2509.22186 • Published • 139 -
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
Paper • 2508.03680 • Published • 122
-
Fast-dLLM v2: Efficient Block-Diffusion LLM
Paper • 2509.26328 • Published • 55 -
Attention Is All You Need for KV Cache in Diffusion LLMs
Paper • 2510.14973 • Published • 40 -
Attention Sinks in Diffusion Language Models
Paper • 2510.15731 • Published • 48 -
Diffusion Language Models are Super Data Learners
Paper • 2511.03276 • Published • 128
-
A Survey of Context Engineering for Large Language Models
Paper • 2507.13334 • Published • 259 -
GUI-G^2: Gaussian Reward Modeling for GUI Grounding
Paper • 2507.15846 • Published • 133 -
ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents
Paper • 2507.22827 • Published • 99 -
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Paper • 2508.18265 • Published • 211