Agenda — RAG Basics¶
Retrieval-Augmented Generation solves the most common failure mode of LLMs in production: hallucination about facts they weren't trained on. Today you build a working RAG system from scratch and understand every design decision.
Learning objectives¶
By the end of this session you will be able to:
- Explain the RAG architecture and why it reduces hallucination
- Implement five different chunking strategies and choose between them
- Build a complete retrieval + augmentation + generation pipeline
- Decide when RAG is better than fine-tuning (and when it isn't)
Schedule¶
| Time | Topic | File |
|---|---|---|
| 0:00 – 0:20 | What is RAG? — architecture, benefits, limitations | 01-what-is-rag |
| 0:20 – 0:55 | Chunking strategies — fixed, semantic, recursive, hierarchical | 02-chunking-strategies |
| 0:55 – 1:25 | Retrieval and augmentation — context assembly, prompt design | 03-retrieval-and-augmentation |
| 1:25 – 2:00 | RAG pipeline end-to-end — PDF ingestion to grounded answers | 04-rag-pipeline-end-to-end |
| 2:00 – 2:30 | RAG vs. fine-tuning — decision framework | 05-rag-vs-fine-tuning |
| 2:30 – 3:00 | Practice exercises | 06-practice-exercises |