LLMs-from-scratch/ch02
Sebastian Raschka 1f61aeb7c4
Some checks are pending
Code tests (Linux) / test (push) Waiting to run
Code tests (macOS) / test (push) Waiting to run
Test PyTorch 2.0 and 2.4 / test (2.0.1) (push) Waiting to run
Test PyTorch 2.0 and 2.4 / test (2.4.0) (push) Waiting to run
Code tests (Windows) / test (push) Waiting to run
Check hyperlinks / test (push) Waiting to run
Spell Check / spellcheck (push) Waiting to run
PEP8 Style checks / flake8 (push) Waiting to run
Note about SSL certificates (#404)
2024-10-19 16:27:19 -05:00
..
01_main-chapter-code Note about SSL certificates (#404) 2024-10-19 16:27:19 -05:00
02_bonus_bytepair-encoder update formatting 2024-05-24 07:20:37 -05:00
03_bonus_embedding-vs-matmul minor spelling fix 2024-09-08 15:35:36 -05:00
04_bonus_dataloader-intuition fixed num_workers (#229) 2024-06-19 17:36:46 -05:00
README.md Update bonus section formatting (#400) 2024-10-12 10:26:08 -05:00

Chapter 2: Working with Text Data

 

Main Chapter Code

 

Bonus Materials

  • 02_bonus_bytepair-encoder contains optional code to benchmark different byte pair encoder implementations

  • 03_bonus_embedding-vs-matmul contains optional (bonus) code to explain that embedding layers and fully connected layers applied to one-hot encoded vectors are equivalent.

  • 04_bonus_dataloader-intuition contains optional (bonus) code to explain the data loader more intuitively with simple numbers rather than text.