Need to read
updated
Textbooks Are All You Need II: phi-1.5 technical report
Paper
• 2309.05463
• Published
• 89
When Less is More: Investigating Data Pruning for Pretraining LLMs at
Scale
Paper
• 2309.04564
• Published
• 17
Large-Scale Automatic Audiobook Creation
Paper
• 2309.03926
• Published
• 56
The Languini Kitchen: Enabling Language Modelling Research at Different
Scales of Compute
Paper
• 2309.11197
• Published
• 5
Chain-of-Verification Reduces Hallucination in Large Language Models
Paper
• 2309.11495
• Published
• 40
LMDX: Language Model-based Document Information Extraction and
Localization
Paper
• 2309.10952
• Published
• 67
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Paper
• 2309.12307
• Published
• 90
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language
Models
Paper
• 2309.12284
• Published
• 19
Small-scale proxies for large-scale Transformer training instabilities
Paper
• 2309.14322
• Published
• 22
QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models
Paper
• 2309.14717
• Published
• 46
Jointly Training Large Autoregressive Multimodal Models
Paper
• 2309.15564
• Published
• 8