Anatomy of Trillion-Parameter Switchboards: Understanding Feedforward Blocks

Exploring the hidden layers of trillion-parameter switchboards: Feedforward Neural Networks and Activation Functions.

January 10, 2026 · 5 min

The Geometry of Meaning: Sine, ALiBi, RoPE, and HoPE

From Sinusoidal to RoPE and HoPE: How Transformers learn to process word order and sequence length.

January 9, 2026 · 7 min