@aclifton314 yup, still the case that the gradients won’t flow through the sampling line. Check out this this post
chrisdoyleIE
43
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| Finetuning GPT2 using Multiple GPU and Trainer | 14 | 6934 | May 22, 2023 | |
| Chapter 3 questions | 160 | 11806 | March 7, 2026 | |
| Fine tuning GPT2 tensorflow | 0 | 103 | June 24, 2024 | |
| Key Error 'loss' while fine tuning GPT-2 with the Trainer utility | 9 | 7544 | May 10, 2022 | |
| Fine tune with SFTTrainer | 17 | 16241 | September 12, 2024 |