Sparkleholic 's Collections 202505
updated
Survey on Evaluation of LLM-based Agents
Paper
• 2503.16416
• Published
• 96
Perception, Reason, Think, and Plan: A Survey on Large Multimodal
Reasoning Models
Paper
• 2505.04921
• Published
• 186
Survey of User Interface Design and Interaction Techniques in Generative
AI Applications
Paper
• 2410.22370
• Published
• 12
Survey of Hallucination in Natural Language Generation
Paper
• 2202.03629
• Published
Evaluating Large Language Models: A Comprehensive Survey
Paper
• 2310.19736
• Published
• 2
Large Language Model Alignment: A Survey
Paper
• 2309.15025
• Published
• 2
A Survey on Multimodal Large Language Models
Paper
• 2306.13549
• Published
• 1
Natural Language Reasoning, A Survey
Paper
• 2303.14725
• Published
• 2
ToolSandbox: A Stateful, Conversational, Interactive Evaluation
Benchmark for LLM Tool Use Capabilities
Paper
• 2408.04682
• Published
• 18