Scaling Laws for Neural Language Models(January 23, 2020)
DEEP DOUBLE DESCENT: WHERE BIGGER MODELS AND MORE DATA HURT(December 5, 2019)
Training Compute-Optimal Large Language Models (April 13, 2022)
Emergent Abilities of Large Language Models (08/2022)