mirror of
https://github.com/rasbt/LLMs-from-scratch.git
synced 2024-11-25 16:22:50 +08:00
.. | ||
01_main-chapter-code | ||
02_performance-analysis | ||
README.md |
Chapter 4: Implementing a GPT Model from Scratch to Generate Text
Main Chapter Code
- 01_main-chapter-code contains the main chapter code.
Bonus Materials
- 02_performance-analysis contains optional code analyzing the performance of the GPT model(s) implemented in the main chapter
- ch05/07_gpt_to_llama contains a step-by-step guide for converting a GPT architecture implementation to Llama 3.2 and loads pretrained weights from Meta AI (it might be interesting to look at alternative architectures after completing chapter 4, but you can also save that for after reading chapter 5)