Phase I · Days 1–7 · 17.5 hours
This week builds the foundation everything else rests on. You'll revisit backpropagation through computation graphs, understand why CNNs and RNNs were the dominant paradigms (and their limitations), and discover the information-theoretic thread — compression = prediction = intelligence — that unifies the entire curriculum.
| Day | Topic | Focus |
|---|---|---|
| 1 | Computation Graphs & Backprop | How gradients actually flow |
| 2 | CNN & ResNets | Spatial hierarchies & residual revolution |
| 3 | RNN/LSTM Essentials | Sequential processing & vanishing gradients |
| 4 | Seq2Seq & The Bottleneck | The fixed-vector bottleneck that demands attention |
| 5 | Information Theory & Compression | Cross-entropy, KL divergence, compression = intelligence |
| 6 | Embeddings & Representation Learning | How neural nets learn meaning |
| 7 | Training Stability Cookbook | Practical recipes for stable training |
For detailed chapter-level coverage of all Week 1 topics, see: 01 — DL Foundations & Information Theory