Generating Lecture Notes by AI

Stanford CS336 - Language Modeling from Scratch
Stanford CS231n - Convolutional Neural Networks for Visual Recognition
Stanford CS236 - Deep Generative Models
MIT 6.S184 - Flow Matching and Diffusion Models
Karpathy Zero to Hero
Agentic AI
Pipeline

After the little success of the CS336 Series, I decided to expand this idea to other courses that are either popular with a wider audience or personally interesting to me. The motivation remains the same: to explore whether we can learn a course faster and (hopefully) better with the help of LLMs.

At the moment, the following courses are covered:

Stanford CS336 - Language Modeling from Scratch by Prof. Percy Liang and the Stanford CS336 team.
Stanford CS231n - Convolutional Neural Networks for Visual Recognition by Prof. Fei-Fei Li and the Stanford CS231n team.
Stanford CS236 - Deep Generative Models by Prof. Stefano Ermon and the team.
MIT 6.S184 - Flow Matching and Diffusion Models.
Karpathy’s Series by Andrej Karpathy on building LLMs from scratch.
Agentic AI Series by a collection of lectures and tutorials on building agentic AI systems (unfortunately, I couldn’t find the original source of this series)

Disclaimers:

All credits go to the original authors and instructors.
This should not be seen as a substitute for the official materials. Watching the lectures directly will always give you the best learning experience.
The actual content is produced by LLMs with very little human intervention. While I believe that today’s flagship reasoning LLMs can often summarize better than the average human, they are still not perfect. There may be mistakes, hallucinations, or missing insights—especially since I intentionally avoid adding too much of my own interpretation. So please keep these limitations in mind and take the content with a grain of salt.

Stanford CS336 - Language Modeling from Scratch

Stanford CS231n - Convolutional Neural Networks for Visual Recognition

Stanford CS236 - Deep Generative Models

MIT 6.S184 - Flow Matching and Diffusion Models

Karpathy Zero to Hero

Agentic AI

Pipeline

The pipeline is designed to be as automated as possible. It is composed of the following steps:

Get transcript of the lecture by using Youtube Transcriptor tool. I use free tier from tactiq.io https://tactiq.io/tools/youtube-transcript. (Update on Dec 17, 2025: I found that actually tactiq.io uses Youtube Transcript API under the hood to get the transcript. So I switched to using that to make the pipeline fully automated with Youtube URL as the only input.)
Use OpenAI API (gpt-5-mini model) to summarize the transcript, outputing a list of key discussions with detailed explanations and timestamps associated with each discussion.
Use OpenAI API (gpt-5-mini model) to refine the detailed explanations so that they are blog post friendly format (highlighting, itemizing, etc.).
With the timestamps, and the Youtube video URL, capture the relevant frames from the lecture video to demonstrate the key discussions. This is done by using the youtube-dl tool to download the video (only specific segments regarding that specific timestamp) and then using the ffmpeg tool to extract the frames.
Put all the content, frames together and generate the blog post following the al-folio blog post format.