Lecture Materials
Slides and resources for each lecture
Week 1: Introduction (Jan 12, 14)
Week 2: When to Annotate (Jan 21)
Lecture 3: When to Annotate | Tools & Formats
Rule-based vs. ML approaches, decision framework, annotation tool landscape, data formats
PDFWeek 3: Corpus & Data (Jan 28)
Lecture 4: Corpus Selection & Data Sourcing
MAMA criteria, sampling strategies, licensing, synthetic data generation
PDFWeek 4: What Models Learn (Feb 2, 4)
Lectures 5 & 6: What Models Learn from Annotation
How annotations shape model behavior, data-centric AI, annotation artifacts
PDFWeek 5: Design Pipeline & IAA I (Feb 9, 11)
Week 7: IAA II & IAA III (Feb 23, 25)
Week 8: IAA in the LLM Era & Annotator Reliability (Mar 2, 4)
Week 10: Annotator Reliability II & Annotation Projects (Mar 16, 18)
Week 11: Annotation Projects & Supervision Engineering (Mar 23, 25)
Week 12: Instruction Annotation (Mar 30)
Lecture 17: Instruction Annotation
Instruction-tuning datasets, task contract specification, template leakage, mixture engineering
PDF LLMWeek 13: Instruction Annotation & RLHF (Apr 13, 15)
Week 14: Preference Annotation & Reasoning Tuning (Apr 20, 22)
Lecture 20: Preference Annotation & RLHF in Practice
Preferences as value judgments, RLHF pipelines, DPO, Constitutional AI, rater pool composition, biases
PDF LLMLecture 21: From RLHF to Reasoning Tuning
Why the field pivoted: credit assignment, chain-of-thought, ORM vs.\ PRM, GRPO, train-time and test-time scaling
PDF LLMWeek 15: Reasoning Annotation & The Future (Apr 27, 29)
Lecture 22: Reasoning Annotation: Process Supervision
Rationales vs.\ process supervision, PRM800K, Math-Shepherd, e-SNLI leakage, faithfulness, CoT controllability
PDF LLMLecture 23: The Future of Annotation: From Text to Agents
Beyond text: agent trajectories, tool-use traces, multi-modal annotation, the road ahead
Upcoming