Cornell-AGI

university

AI & ML interests

Reinforcement Learning from Human Feedback

Recent Activity

GitBag submitted a paper about 2 months ago

p1: Better Prompt Optimization with Fewer Prompts

GitBag authored a paper 8 months ago

Prompt Curriculum Learning for Efficient LLM Post-Training

GitBag authored a paper 12 months ago

Pre-trained Large Language Models Learn Hidden Markov Models In-context

View all activity

Collections 3

View 3 collections

models 20

Cornell-AGI/apo_math_qwen2.5_1.5b

Text Generation • 2B • Updated May 5, 2025 • 2

Cornell-AGI/ppo_math_qwen2.5_1.5b

Text Generation • 2B • Updated May 5, 2025 • 1

Cornell-AGI/rebel_math_qwen2.5_1.5b

Text Generation • 2B • Updated May 5, 2025 • 2

Cornell-AGI/grpo_math_qwen2.5_3b

Text Generation • 3B • Updated May 5, 2025 • 1

Cornell-AGI/grpo_math_qwen2.5_1.5b

Text Generation • 2B • Updated May 5, 2025 • 3

Cornell-AGI/ppo_math_qwen2.5_3b

Text Generation • 3B • Updated May 5, 2025 • 3

Cornell-AGI/rebel_math_qwen2.5_3b

Text Generation • 3B • Updated May 5, 2025 • 4

Cornell-AGI/apo_math_qwen2.5_3b

Text Generation • 3B • Updated May 5, 2025 • 4

Cornell-AGI/grpo_math_qwen2.5_7b

Text Generation • 8B • Updated May 5, 2025 • 4

Cornell-AGI/ppo_math_qwen2.5_7b

Text Generation • 8B • Updated May 5, 2025 • 3

datasets 15

Cornell-AGI/math_size_qwen2.5_7b_eval

Viewer • Updated May 29, 2025 • 7.5k • 10

Cornell-AGI/math_size_qwen2.5_3b_eval

Viewer • Updated May 29, 2025 • 7.5k • 93

Cornell-AGI/math_size_qwen2.5_1.5b_eval

Viewer • Updated May 29, 2025 • 7.5k • 18

Cornell-AGI/gsm8k_size_qwen2.5_7b_eval

Viewer • Updated May 29, 2025 • 7.47k • 12

Cornell-AGI/gsm8k_size_qwen2.5_3b_eval

Viewer • Updated May 29, 2025 • 7.47k • 73

Cornell-AGI/gsm8k_size_qwen2.5_1.5b_eval

Viewer • Updated May 29, 2025 • 7.47k • 89

Cornell-AGI/amazon_movie_tv_item_mxbai

Viewer • Updated Dec 2, 2024 • 10.5k • 12

Cornell-AGI/amazon_movie_tv_llama_mxbai

Viewer • Updated Oct 23, 2024 • 17.1k • 236

Cornell-AGI/REFUEL-Ultrainteract-Llama-3-Armo-iter_2

Viewer • Updated Oct 8, 2024 • 116k • 198 • 1

Cornell-AGI/REFUEL-Ultrainteract-Llama-3-Armo-iter_1

Viewer • Updated Oct 8, 2024 • 64.6k • 34 • 2

View 15 datasets