RedOne 2.0: Rethinking Domain-specific LLM Post-Training in Social Networking Services Paper • 2511.07070 • Published Nov 10, 2025 • 19
LongRM: Revealing and Unlocking the Context Boundary of Reward Modeling Paper • 2510.06915 • Published Oct 8, 2025 • 14
Revisiting Long-context Modeling from Context Denoising Perspective Paper • 2510.05862 • Published Oct 7, 2025 • 20
LOGO -- Long cOntext aliGnment via efficient preference Optimization Paper • 2410.18533 • Published Oct 24, 2024 • 43