👋 Hi, all!

I am Shichao Song, a third-year PhD student. My current research focuses on foundation models. I blog about my research and life.
Multi-armed bandit

Fully Annotated Guide to "The Multi-Armed Bandit Problem and Its Solutions"

The multi-armed bandit problem is a classic exploration–exploitation dilemma in reinforcement learning. Lilian Weng’s post is an excellent introduction, but some mathematical details and motivations can be cryptic. This article annotates it with step-by-step explanations and supplementary notes.

 · 16 min · 3235 words · Shichao Song
The Ouroboros Process (by [Google Gemini](https://gemini.google.com/))

Product Requirements Document of Ouroboros

An agentic DOM workspace where an LLM has full read/write/delete privileges over its own source code and visual interface.

 · Updated:  · 12 min · 2502 words · Shichao Song, Qingchen Yu, Huayi Lai, Xiaonan Zhang
The Great Decoupling (by [Google Gemini](https://gemini.google.com/))

Everything is Fleeting: A Roadmap to the Post-Labor AI Economy

A speculative roadmap of how AI will dismantle the labor market and what comes after.

 · 12 min · 2511 words · Shichao Song
快速的 AI 时代和温暖的人心([Google Gemini](https://gemini.google.com/) 生成)

我目前知道的

关于 AI 和人的一个简单技术人文讨论

 · Updated:  · 3 min · 1296 words · Shichao Song
Diagram of the probability transition process in speculative sampling.

How is the Speculative Decoding Algorithm Constructed?

A simple mathematical derivation of the algorithm construction process from the paper “Fast Inference from Transformers via Speculative Decoding”.

 · Updated:  · 5 min · 884 words · Shichao Song