paperdaily
updated
2.5 Years in Class: A Multimodal Textbook for Vision-Language
Pretraining
Paper
•
2501.00958
•
Published
•
109
Are Vision-Language Models Truly Understanding Multi-vision Sensor?
Paper
•
2412.20750
•
Published
•
19
Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs
Paper
•
2412.21187
•
Published
•
40
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs
Paper
•
2412.18925
•
Published
•
106
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via
Collective Monte Carlo Tree Search
Paper
•
2412.18319
•
Published
•
39
Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM
Reasoning via Autoregressive Search
Paper
•
2502.02508
•
Published
•
22
Process Reinforcement through Implicit Rewards
Paper
•
2502.01456
•
Published
•
61
PhD Knowledge Not Required: A Reasoning Challenge for Large Language
Models
Paper
•
2502.01584
•
Published
•
9
s1: Simple test-time scaling
Paper
•
2501.19393
•
Published
•
124
Reward-Guided Speculative Decoding for Efficient LLM Reasoning
Paper
•
2501.19324
•
Published
•
39
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model
Post-training
Paper
•
2501.17161
•
Published
•
123
Open Problems in Mechanistic Interpretability
Paper
•
2501.16496
•
Published
•
20
Qwen2.5-1M Technical Report
Paper
•
2501.15383
•
Published
•
72
Baichuan-Omni-1.5 Technical Report
Paper
•
2501.15368
•
Published
•
60
Can We Generate Images with CoT? Let's Verify and Reinforce Image
Generation Step by Step
Paper
•
2501.13926
•
Published
•
43
Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament
Paper
•
2501.13007
•
Published
•
19
MMVU: Measuring Expert-Level Multi-Discipline Video Understanding
Paper
•
2501.12380
•
Published
•
84