Reasoning generalization and reasoning in large language models papers Advancing LLM Reasoning Generalists with Preference Trees Paper • 2404.02078 • Published Apr 2, 2024 • 46
Advancing LLM Reasoning Generalists with Preference Trees Paper • 2404.02078 • Published Apr 2, 2024 • 46
Training Novel architectures and techniques to train neural nets Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer Paper • 2203.03466 • Published Mar 7, 2022 • 1
Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer Paper • 2203.03466 • Published Mar 7, 2022 • 1
DeepMind papers collection of papers from google deepmind Attention Is All You Need Paper • 1706.03762 • Published Jun 12, 2017 • 110
Multimodal agents ReALM: Reference Resolution As Language Modeling Paper • 2403.20329 • Published Mar 29, 2024 • 22
Reasoning generalization and reasoning in large language models papers Advancing LLM Reasoning Generalists with Preference Trees Paper • 2404.02078 • Published Apr 2, 2024 • 46
Advancing LLM Reasoning Generalists with Preference Trees Paper • 2404.02078 • Published Apr 2, 2024 • 46
DeepMind papers collection of papers from google deepmind Attention Is All You Need Paper • 1706.03762 • Published Jun 12, 2017 • 110
Training Novel architectures and techniques to train neural nets Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer Paper • 2203.03466 • Published Mar 7, 2022 • 1
Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer Paper • 2203.03466 • Published Mar 7, 2022 • 1
Multimodal agents ReALM: Reference Resolution As Language Modeling Paper • 2403.20329 • Published Mar 29, 2024 • 22