Skip to main content Link Menu Expand (external link) Copy Copied

Advanced Natural Language Processing

Spring 2026

Instructor: Sewon Min, Alane Suhr
Class hours: TuThu 15:30–17:00 (15:40–17:00 considering Berkeley time)
Class location: SODA 306
Contact: TBA
Ed link: TBA

If you are interested in taking the course and can’t directly enroll, please submit this form.


This course provides a graduate-level introduction to Natural Language Processing (NLP). We will survey foundational approaches such as word representations and n-gram language models, followed by neural methods including recurrent networks and attention mechanisms, and then progress to modern Transformer-based architectures. In addition, the course will cover advanced topics in contemporary NLP such as retrieval-augmented models, mixture-of-experts architectures, AI agents, and vision-language models.

Prerequisites: CS 288 assumes prior experience in machine learning and proficiency in PyTorch. Students should be familiar with neural networks, PyTorch, and NumPy; no introductory tutorials will be provided. Relevant experience can be gained in CS 182, 188, or 189. Prior coursework in linguistics or NLP (e.g., CS 183/283A) is recommended but not required.

Schedule (Tentative)

All deadlines are by 6PM PST.

01/20 Tue
Introduction
01/22 Thu
Word representation & Text classification
01/27 Tue
Language modeling basics
Assignment 1 released
01/29 Thu
Recurrent models & Attention
02/03 Tue
Pre-LLM case study: Text generation
02/05 Thu
Pre-LLM case study: Question answering
02/10 Tue
LLM basics: Architecture (Tokenizer, Transformers, Encoders, Decoders)
Assignment 1 due
02/12 Thu
LLM basics: Training (Pre-training, Fine-tuning)
Assignment 2 released
02/17 Tue
LLM basics: GPT-3, Prompting, In-context learning
02/19 Thu
LLM deep dive: Scaling laws
02/24 Tue
LLM deep dive: Data curation
02/26 Thu
LLM deep dive: Post-training
Assignment 2 due
03/03 Tue
LLM deep dive: Inference methods & Evaluation
03/05 Thu
Experimental design & Human annotation
Assignment 3 released
03/10 Tue
Retrieval and RAG
03/12 Thu
Long-context LLM
03/17 Tue
Mixture-of-Experts
03/19 Thu
Test-time compute & Reasoning models
Assignment 3 due
03/24 Tue
No class: Spring break
03/26 Thu
No class: Spring break
03/31 Tue
LLM agents
Assignment 4 released
04/02 Thu
Vision-language models
04/07 Tue
Speech models
04/09 Thu
Interactive embodied agents
04/14 Tue
Pragmatics
Assignment 4 due
04/16 Thu
Impact & Social implications
04/21 Tue
Guest lecture (TBA)
04/23 Thu
Guest lecture (TBA)
04/28 Tue
Project presentation
04/30 Thu
Project presentation
Project report due by 05/05 (Tue)