Running RL 1 Price Negotiation Environment Server π 1 Negotiate prices in an interactive buyer simulation
Running Featured 74 QED-Nano: Teaching a Tiny Model to Prove Hard Theorems π 74 Who needs 1T parameters? Olympiad proofs with a 4B model
Sleeping RL 3 RLM Interactive Console π 3 Query a recursive language model for longβcontext answers