CS 185/285 at UC Berkeley

Deep Reinforcement Learning

Lectures: 9 - 10 am on Wednesdays and 8 - 10 am on Fridays, both in Hearst Annex A1

Announcement: Homework 5 (Offline RL) is now released.

Announcement: The default final project options are now available: Offline-to-Online RL Default Final Project and LLM RL Default Final Project.

Announcement: Homework 4 (LLM RL) is now released.

Announcement: The final project outline has been released.

Looking for deep RL course materials from past years?

Recordings of lectures from Fall 2023 are here, and materials from previous offerings are here.

Email all staff (preferred): cs285-staff-sp2026@lists.eecs.berkeley.edu

Instructor Sergey Levine

svlevine@eecs.berkeley.edu

Office Hours: Wednesdays 8 - 9 AM in Hearst Annex A1
Head GSI Seohong Park

seohong@berkeley.edu

Office Hours: Fri 1:15p-2:15p in Berkeley Way West 1204
GSI Vivek Myers

vmyers@berkeley.edu

Office Hours: Tue 4p-5p in Berkeley Way West 1204
GSI Kevin Black

kvablack@berkeley.edu

Office Hours: Tue 9a-10a in Berkeley Way West 1212
GSI Pranav Atreya

pranavatreya@berkeley.edu

Office Hours: Mon 5p-6p in Berkeley Way West 1216
GSI Mitsuhiko Nakamoto

nakamoto@eecs.berkeley.edu

Office Hours: Thursday 4p-5p Berkeley Way West 1204
GSI Catherine Glossop

catherine_glossop@berkeley.edu

Office Hours: Thu 10a-11a in Berkeley Way West 1211

Week 1 Overview

Course Intro & Imitation Learning

Monday, January 19 – Friday, January 23

Homework 1: Imitation Learning

Week 2 Overview

Imitation Learning & RL Basics

Monday, January 26 – Friday, January 30

Homework 1: Imitation Learning

Week 3 Overview

Policy Gradients & Actor Critic

Monday, February 2 – Friday, February 6

Week 4 Overview

Value-Based RL

Monday, February 9 – Friday, February 13

Week 5 Overview

Advanced Policy Gradients

Monday, February 16 – Friday, February 20

Week 6 Overview

Variational Inference

Monday, February 23 – Friday, February 27

Week 7 Overview

Finishing VI & LLM RL

Monday, March 2 – Friday, March 6

Week 8 Overview

Model-Based RL

Monday, March 9 – Friday, March 13

Week 9 Overview

Offline Reinforcement Learning

Monday, March 16 – Friday, March 20

Final Project Information

Default Project Options

Project Outline

Final Project Outline

Homeworks

See Syllabus for more information (including rough schedule).

Lecture Slides

See Syllabus for more information.

Lecture 1: Introduction
Lecture 2: Behavioral Cloning
Lecture 3: Behavioral Cloning Part 2
Lecture 4: RL Basics
Lecture 5: Policy Gradients
Lecture 6: Actor Critic
Lecture 7: Value-Based RL
Lecture 8: Q-learning in Practice
Lecture 9: Advanced Policy Gradients Part 1
Lecture 10: Advanced Policy Gradients Part 2
Lecture 11: Variational Inference
Lecture 12: VI in RL
Lecture 13: Control as Inference
Lecture 14: LLM RL
Lecture 15: Model-Based RL Part 1
Lecture 16: Model-Based RL Part 2
Lecture 17: Offline RL Part 1
Lecture 18: Offline RL Part 2
Lecture 19: TBD
Lecture 20: TBD
Lecture 21: TBD
Lecture 22: TBD
Lecture 23: TBD

Discussion Section Slides