posts

Notes and writeups on mathematics and machine learning.

Improving one small model: a deep look at depth-recurrence in 10-minute pretraining

We take one depth-recurrent language model from a 10-minute pretraining competition and try three ways to improve it. Two fail cleanly; the third, learning a mixing rule and then freezing it, ships.

12 min read · June 22, 2026

2026 · pretraining depth-recurrence language-models parameter-golf · machine-learning
GRPO, SFT, and teaching reasoning through arithmetic

What GRPO and SFT can and cannot teach a 3B model about arithmetic reasoning, measured on the Countdown task.

14 min read · June 20, 2026

2026 · reinforcement-learning, reasoning, GRPO, SFT · machine-learning
Parameter Golf: Six Weeks to Build the Best LLM

An account of OpenAI's Parameter Golf competition. Six weeks, two thousand pull requests, and a 14% compression improvement wrung from the same hardware, the same data, and the same ten minutes of training.

26 min read · May 08, 2026

2026 · language-models pretraining parameter-golf competition · machine-learning
From one 11 to another in four dimension

1 min read · December 02, 2024

2024 · Mathematical physics, four-manifolds
RG flow of 4D gauge theory

4 min read · December 02, 2024

2024 · High energy physics, gauge theroy