Motivation and Thoughts

Hi! I’m Yitao. This is a list of my notes while reading the book “Probabilistic Aritificial Intelligence” by Andreas Krause and Jonas Hübotter, which I’ll refer to as PAI. It is available on arxiv. I believe that it is also a course website, where the book is used as a textbook.

There are many approaches to learn reinforcement learning, and I believe that the book is friendly to people with background in machine learning. It is also a good chance to brush up memories from books like PRML.

Ultimately, our goal from reading the book is to know about concepts like Gaussian Process, Multi-arm Bandit and Q-learning, and how methods like markov chain and Neural Networks are used in solving the RL problem.

Another course that might be of interest is Reinforcement Learning and Optimal Control by Dimitri P. Bertsekas, which would provide more details about different problems in Reinforcement Learning. This course might also be more friendly to people with background in control theory.

My notetaking is mostly a cheatsheet for future me and I’ll go through some of the proofs in the book that is not entirely clear to me(and some of them are left in the book as exercise). A major part of the note is generated from back and forth conversation with LLMs while reading the book. Hopefully, my note can be helpful to you.

My notes

Plan: We would skip Bayesian Neural Network for now and come back later, and at the end, we might rely on few external sources to learn about topics like RLHF.