mirror of https://github.com/rasbt/LLMs-from-scratch.git synced 2024-11-25 16:22:50 +08:00

History

casinca bb31de8999 Some checks failed Code tests (Linux) / test (push) Has been cancelled Details Code tests (macOS) / test (push) Has been cancelled Details Test PyTorch 2.0 and 2.5 / test (2.0.1) (push) Has been cancelled Details Test PyTorch 2.0 and 2.5 / test (2.5.0) (push) Has been cancelled Details Code tests (Windows) / test (push) Has been cancelled Details Check hyperlinks / test (push) Has been cancelled Details Spell Check / spellcheck (push) Has been cancelled Details PEP8 Style checks / flake8 (push) Has been cancelled Details [minor] typo & comments (#441 ) * typo & comment - safe -> save - commenting code: batch_size, seq_len = in_idx.shape * comment - adding # NEW for assert num_heads % num_kv_groups == 0 * update memory wording --------- Co-authored-by: rasbt <mail@sebastianraschka.com>		2024-11-18 19:52:42 +09:00
..
tests	Update test-requirements-extra.txt	2024-10-23 19:19:58 -05:00
config.json	move access token to config.json	2024-09-23 08:56:16 -05:00
converting-gpt-to-llama2.ipynb	[minor] typo & comments (#441 )	2024-11-18 19:52:42 +09:00
converting-llama2-to-llama3.ipynb	[minor] typo & comments (#441 )	2024-11-18 19:52:42 +09:00
previous_chapters.py	GPT to Llama (#368 )	2024-09-23 07:34:06 -05:00
README.md	Implement Llama 3.2 (#383 )	2024-10-05 07:30:47 -05:00
requirements-extra.txt	fixed Llama 2 to 3.2 NBs (#388 )	2024-10-06 09:56:55 -05:00
standalone-llama32.ipynb	[minor] typo & comments (#441 )	2024-11-18 19:52:42 +09:00

README.md

Converting GPT to Llama

This folder contains code for converting the GPT implementation from chapter 4 and 5 to Meta AI's Llama architecture in the following recommended reading order:

converting-gpt-to-llama2.ipynb: contains code to convert GPT to Llama 2 7B step by step and loads pretrained weights from Meta AI
converting-llama2-to-llama3.ipynb: contains code to convert the Llama 2 model to Llama 3, Llama 3.1, and Llama 3.2
standalone-llama32.ipynb: a standalone notebook implementing Llama 3.2