LLMs-from-scratch/ch04
Sebastian Raschka b6c4b2f9f1
Some checks are pending
Check hyperlinks / test (push) Waiting to run
Spell Check / spellcheck (push) Waiting to run
PEP8 Style checks / flake8 (push) Waiting to run
Update bonus section formatting (#400)
2024-10-12 10:26:08 -05:00
..
01_main-chapter-code note about random numbers 2024-09-22 12:02:03 -05:00
02_performance-analysis update card 2024-10-11 12:15:01 -05:00
README.md Update bonus section formatting (#400) 2024-10-12 10:26:08 -05:00

Chapter 4: Implementing a GPT Model from Scratch to Generate Text

 

Main Chapter Code

 

Bonus Materials

  • 02_performance-analysis contains optional code analyzing the performance of the GPT model(s) implemented in the main chapter
  • ch05/07_gpt_to_llama contains a step-by-step guide for converting a GPT architecture implementation to Llama 3.2 and loads pretrained weights from Meta AI (it might be interesting to look at alternative architectures after completing chapter 4, but you can also save that for after reading chapter 5)