06. Module 05 Review — LLM Training Lifecycle¶
Focus: pretraining, SFT, RLHF / DPO, practical training knobs, and the bridge into quantization / fine-tuning efficiency.
Review loop¶
- Skim the TOC in
02_explainer.mdand mark the chapter that still feels least automatic. - Re-read only that fuzzy chapter.
- Re-answer the self-check questions in
01_weekly_plan.mdfrom memory. - Re-do the hardest prompts in
04_daily_recall.md. - Sketch the failure-fix table from explainer §6.1 — all 12 rows — without looking.
- Review
05_hands_on_lab.mdand write one note on data quality, one on training knobs, and one on evaluation.
Reflection¶
- Which stage of the lifecycle now feels operational instead of hand-wavy?
- Where are you still fuzzy: curriculum, shadowing, performance review, or specialization?
- What should feel automatic before starting
06_adaptation_compression?
Completion gate¶
- [ ] All explainer chapters read at least once
- [ ] Can explain the full lifecycle with the employee-training analogy
- [ ] Failure-fix table sketched from memory (all 12 rows)
- [ ] Weekly plan completed
- [ ] GPT-2 hands_on_lab shipped with before/after perplexity
- [ ] Can compute parameter memory from count × bytes
- [ ] Ready to move to
06_adaptation_compression