Days 29–35 · 17.5 hours
This week closes Phase II with a comprehensive capstone project, then launches Phase III with the modern LLM training pipeline.
| Day | Topic | Phase | Focus |
|---|---|---|---|
| 29 | Phase II Capstone Day 1 | II | Train mini-LM |
| 30 | Phase II Capstone Day 2 | II | Ablation report + checkpoint |
| 31 | The Modern LLM Recipe | III | Pretrain → SFT → Alignment |
| 32 | Supervised Fine-Tuning | III | Instruction-following |
| 33 | RLHF | III | Reward models + PPO |
| 34 | DPO & Modern Alignment | III | Direct preference optimization |
| 35 | LoRA & Efficient Fine-Tuning | III | Parameter-efficient methods |