adamsecada 's Collections Favorites
updated
Bootstrapping Language Models with DPO Implicit Rewards
Paper
• 2406.09760
• Published
• 41
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code
Intelligence
Paper
• 2406.11931
• Published
• 69
Prism: A Framework for Decoupling and Assessing the Capabilities of VLMs
Paper
• 2406.14544
• Published
• 35
Instruction Pre-Training: Language Models are Supervised Multitask
Learners
Paper
• 2406.14491
• Published
• 96
Mixture-of-Agents Enhances Large Language Model Capabilities
Paper
• 2406.04692
• Published
• 59
CRAG -- Comprehensive RAG Benchmark
Paper
• 2406.04744
• Published
• 46
Hierarchical Prompting Taxonomy: A Universal Evaluation Framework for
Large Language Models
Paper
• 2406.12644
• Published
• 5
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs
with Nothing
Paper
• 2406.08464
• Published
• 71
AdvPrompter: Fast Adaptive Adversarial Prompting for LLMs
Paper
• 2404.16873
• Published
• 29
LLM Agents can Autonomously Hack Websites
Paper
• 2402.06664
• Published
• 3
Negotiating with LLMS: Prompt Hacks, Skill Gaps, and Reasoning Deficits
Paper
• 2312.03720
• Published
Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of
LLMs through a Global Scale Prompt Hacking Competition
Paper
• 2311.16119
• Published
• 2
On the Exploitability of Instruction Tuning
Paper
• 2306.17194
• Published
• 9
Teams of LLM Agents can Exploit Zero-Day Vulnerabilities
Paper
• 2406.01637
• Published
• 2
Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems
Paper
• 2407.01370
• Published
• 89
Imagine yourself: Tuning-Free Personalized Image Generation
Paper
• 2409.13346
• Published
• 69
Training Language Models to Self-Correct via Reinforcement Learning
Paper
• 2409.12917
• Published
• 140
LLMs + Persona-Plug = Personalized LLMs
Paper
• 2409.11901
• Published
• 35
Seed-Music: A Unified Framework for High Quality and Controlled Music
Generation
Paper
• 2409.09214
• Published
• 53
Tutor CoPilot: A Human-AI Approach for Scaling Real-Time Expertise
Paper
• 2410.03017
• Published
• 29
Unbounded: A Generative Infinite Game of Character Life Simulation
Paper
• 2410.18975
• Published
• 37
A Survey of Small Language Models
Paper
• 2410.20011
• Published
• 46
SocialGPT: Prompting LLMs for Social Relation Reasoning via Greedy
Segment Optimization
Paper
• 2410.21411
• Published
• 19
The Danger of Overthinking: Examining the Reasoning-Action Dilemma in
Agentic Tasks
Paper
• 2502.08235
• Published
• 59
MAPS: A Multi-Agent Framework Based on Big Seven Personality and
Socratic Guidance for Multimodal Scientific Problem Solving
Paper
• 2503.16905
• Published
• 54
Efficient Agents: Building Effective Agents While Reducing Cost
Paper
• 2508.02694
• Published
• 86
ProtoReasoning: Prototypes as the Foundation for Generalizable Reasoning
in LLMs
Paper
• 2506.15211
• Published
• 39
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent
Distillation and Agentic RL
Paper
• 2508.13167
• Published
• 129
Prompt Orchestration Markup Language
Paper
• 2508.13948
• Published
• 48
WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic
Data and Scalable Reinforcement Learning
Paper
• 2509.13305
• Published
• 91
Less is More: Recursive Reasoning with Tiny Networks
Paper
• 2510.04871
• Published
• 509
Agent Learning via Early Experience
Paper
• 2510.08558
• Published
• 273