A lot has happened last month: Apple announced the integration of on-device LLMs, Nvidia shared their large Nemotron model, FlashAttention-3 was announced, Google's Gemma 2 came out, and much more. You've probably already read about it all in various news outlets.
ORIGINAL LINK: https://magazine.sebastianraschka.com/p/instruction-pretraining-llms