Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment
Paper
•
2402.19085
•
Published
None defined yet.
GISA: A Benchmark for General Information-Seeking Assistant
DLLM-Searcher: Adapting Diffusion Large Language Model for Search Agents