LLMs-from-scratch/ch05/07_gpt_to_llama
casinca bb31de8999
Some checks failed
Code tests (Linux) / test (push) Has been cancelled
Code tests (macOS) / test (push) Has been cancelled
Test PyTorch 2.0 and 2.5 / test (2.0.1) (push) Has been cancelled
Test PyTorch 2.0 and 2.5 / test (2.5.0) (push) Has been cancelled
Code tests (Windows) / test (push) Has been cancelled
Check hyperlinks / test (push) Has been cancelled
Spell Check / spellcheck (push) Has been cancelled
PEP8 Style checks / flake8 (push) Has been cancelled
[minor] typo & comments (#441)
* typo & comment

- safe -> save
- commenting code: batch_size, seq_len = in_idx.shape

* comment

- adding # NEW for assert num_heads % num_kv_groups == 0

* update memory wording

---------

Co-authored-by: rasbt <mail@sebastianraschka.com>
2024-11-18 19:52:42 +09:00
..
tests Update test-requirements-extra.txt 2024-10-23 19:19:58 -05:00
config.json move access token to config.json 2024-09-23 08:56:16 -05:00
converting-gpt-to-llama2.ipynb [minor] typo & comments (#441) 2024-11-18 19:52:42 +09:00
converting-llama2-to-llama3.ipynb [minor] typo & comments (#441) 2024-11-18 19:52:42 +09:00
previous_chapters.py GPT to Llama (#368) 2024-09-23 07:34:06 -05:00
README.md Implement Llama 3.2 (#383) 2024-10-05 07:30:47 -05:00
requirements-extra.txt fixed Llama 2 to 3.2 NBs (#388) 2024-10-06 09:56:55 -05:00
standalone-llama32.ipynb [minor] typo & comments (#441) 2024-11-18 19:52:42 +09:00

Converting GPT to Llama

This folder contains code for converting the GPT implementation from chapter 4 and 5 to Meta AI's Llama architecture in the following recommended reading order: