UCoder: Unsupervised Code Generation by Internal Probing of Large Language Models Paper • 2512.17385 • Published 13 days ago • 17
Multi-Agent Collaboration for Multilingual Code Instruction Tuning Paper • 2502.07487 • Published Feb 11, 2025
SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models Paper • 2502.13059 • Published Feb 18, 2025
CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models Paper • 2502.16614 • Published Feb 23, 2025 • 27
KORGym: A Dynamic Game Platform for LLM Reasoning Evaluation Paper • 2505.14552 • Published May 20, 2025 • 1
M3TQA: Massively Multilingual Multitask Table Question Answering Paper • 2508.16265 • Published Aug 22, 2025
From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence Paper • 2511.18538 • Published Nov 23, 2025 • 279
V-GameGym: Visual Game Generation for Code Large Language Models Paper • 2509.20136 • Published Sep 24, 2025 • 9