👋 Hi, all!
I am Shichao Song, a third-year PhD student. My current research focuses on foundation models. I blog about my research and life.
The multi-armed bandit problem is a classic exploration–exploitation dilemma in reinforcement learning. Lilian Weng’s post is an excellent introduction, but some mathematical details and motivations can be cryptic. This article annotates it with step-by-step explanations and supplementary notes.
An agentic DOM workspace where an LLM has full read/write/delete privileges over its own source code and visual interface.
A speculative roadmap of how AI will dismantle the labor market and what comes after.
关于 AI 和人的一个简单技术人文讨论
A simple mathematical derivation of the algorithm construction process from the paper “Fast Inference from Transformers via Speculative Decoding”.