When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought Paper • 2511.02779 • Published Nov 4, 2025 • 59
DSGym: A Holistic Framework for Evaluating and Training Data Science Agents Paper • 2601.16344 • Published 12 days ago • 10
UAlign: Pushing the Limit of Template-free Retrosynthesis Prediction with Unsupervised SMILES Alignment Paper • 2404.00044 • Published Mar 25, 2024 • 1
EvoLM: In Search of Lost Language Model Training Dynamics Paper • 2506.16029 • Published Jun 19, 2025