On Vacation 🏝️

Jerry Pan

JERRYPAN617

https://jerrypan617.github.io/

jerrypan617

AI & ML interests

RLHF, Retrieval-Augmented Multimodal Understanding...

Recent Activity

upvoted a paper about 1 month ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

liked a dataset 3 months ago

PKU-Alignment/PKU-SafeRLHF-single-dimension

liked a dataset 3 months ago

PKU-Alignment/PKU-SafeRLHF

View all activity

Organizations

liked 5 datasets 3 months ago

liked 2 models 3 months ago

Qwen/Qwen2.5-1.5B-Instruct

Text Generation • 2B • Updated Sep 25, 2024 • 7.01M • • 617

JERRYPAN617/HH-BTRewardModel-roberta

Reinforcement Learning • 0.1B • Updated Nov 13, 2025 • 1 • 1

liked 7 datasets 3 months ago

ys-zong/VLGuard

Viewer • Updated Jan 19, 2025 • 3k • 341 • 13

PKU-Alignment/MM-SafetyBench

Viewer • Updated Sep 19, 2024 • 6.72k • 938 • 4

saferlhf-v/BeaverTails-V

Viewer • Updated Mar 8, 2025 • 30.4k • 440 • 7

PKU-Alignment/PKU-SafeRLHF-V

Viewer • Updated Mar 25, 2025 • 30.4k • 162 • 5

Moemu/Muice-Dataset

Viewer • Updated 13 days ago • 3.74k • 167 • 49

liuhaotian/LLaVA-Instruct-150K

Preview • Updated Jan 3, 2024 • 3.18k • 572

MMMU/MMMU

Viewer • Updated 7 days ago • 11.6k • 72k • 317

liked a Space 3 months ago

Qwen2.5 Psydoctor Demo

📈

基于 Qwen2.5-1.5B-Instruct 模型微调的 LoRA 适配器，专门用于心理医生对话场景。

liked 2 datasets 4 months ago

FreedomIntelligence/medical-o1-reasoning-SFT

Viewer • Updated Apr 22, 2025 • 90.1k • 3.65k • 1.07k

nvidia/Nemotron-CC-Math-v1

Viewer • Updated Dec 23, 2025 • 190M • 3.83k • 66

liked a model 4 months ago

JERRYPAN617/qwen2.5-lora-psydoctor

Text Generation • Updated Oct 25, 2025 • 6 • 1

liked 2 datasets 4 months ago

hiyouga/geometry3k

Viewer • Updated Apr 14, 2025 • 3k • 23.8k • 68

Anthropic/hh-rlhf

Viewer • Updated May 26, 2023 • 169k • 18.5k • 1.66k

Jerry Pan

AI & ML interests

Recent Activity

Organizations

JERRYPAN617's activity

Qwen2.5 Psydoctor Demo