72 121 71

Ge Zhang

zhangysk

AI & ML interests

None yet

Recent Activity

upvoted a paper 17 days ago

NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents

submitted a paper 17 days ago

NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents

upvoted a paper about 1 month ago

How Far Are We from Genuinely Useful Deep Research Agents?

View all activity

Organizations

upvoted a paper 17 days ago

NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents

Paper • 2512.12730 • Published 18 days ago • 43

submitted a paper to Daily Papers 17 days ago

NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents

Paper • 2512.12730 • Published 18 days ago • 43

upvoted a paper about 1 month ago

How Far Are We from Genuinely Useful Deep Research Agents?

Paper • 2512.01948 • Published Dec 1, 2025 • 54

authored a paper about 1 month ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 279

upvoted a paper about 1 month ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 279

authored 5 papers about 1 month ago

MME-CC: A Challenging Multi-Modal Evaluation Benchmark of Cognitive Capacity

Paper • 2511.03146 • Published Nov 5, 2025 • 7

RLoop: An Self-Improving Framework for Reinforcement Learning with Iterative Policy Initialization

Paper • 2511.04285 • Published Nov 6, 2025 • 7

upvoted 2 papers about 2 months ago

Virtual Width Networks

Paper • 2511.11238 • Published Nov 14, 2025 • 37

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published Nov 12, 2025 • 201

liked a dataset about 2 months ago

m-a-p/LPFQA

Viewer • Updated Nov 10, 2025 • 502 • 124 • 5

upvoted a paper about 2 months ago

MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs

Paper • 2511.07250 • Published Nov 10, 2025 • 17

upvoted a collection 2 months ago

Ouro

Collection

a family of pre-trained Looped Language Models. • 4 items • Updated Oct 29, 2025 • 21

liked a model 2 months ago

ByteDance/Ouro-1.4B

Text Generation • Updated Nov 16, 2025 • 14.2k • 57

authored 4 papers 2 months ago

IFEvalCode: Controlled Code Generation

Paper • 2507.22462 • Published Jul 30, 2025

VideoScore2: Think before You Score in Generative Video Evaluation

Paper • 2509.22799 • Published Sep 26, 2025 • 25

Towards Personalized Deep Research: Benchmarks and Evaluations

Paper • 2509.25106 • Published Sep 29, 2025 • 29

Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution

Paper • 2509.25301 • Published Sep 29, 2025 • 19

Ge Zhang

AI & ML interests

Recent Activity

Organizations

zhangysk's activity