Today I Learned
A collection of daily discoveries, insights, and notes from building AI infrastructure. Each entry captures something new learned along the journey.
Saturday, February 28, 2026
Nano-RL and Nano-Coder: Producing and Consuming Agentic Models
Starting two complementary projects: nano-RL for training agentic models with reinforcement learning, and nano-coder for building agent systems that consume these models
Nano-Train: From Minimal Training Loop to Distributed Training Framework
Journey from building a basic training loop to understanding modern LLM training framework architecture with monitoring, model inspection, and distributed parallelism
Wednesday, February 12, 2025
SGLang Model Support: Native DeepSeek-V2 Mapping and Transformers Backend Fallback
Learned how SGLang supports models like GLM-5 via native DeepSeek-V2 inheritance, and falls back to Transformers backend for unsupported models with hot-path replacement
Building Production LLM Infrastructure on Kubernetes
Learned how to build production-grade Kubernetes infrastructure for LLM serving with custom CRDs, operators, and fleet autoscaling patterns